Accurately Precise and Precisely Accurate: An Experimental Comprehension
Shaikh. Mohiuddin1,, Zahida Karim1
1Department of Chemistry, University of Karachi, Karachi, Pakistan
Abstract
Accuracy and precision are the primary targets in chemical analysis. This article is an experimental approach to appreciate the concepts of precision and accuracy in analytical chemistry. An easy, fast and well-established spectrophotometry experiment for quantitative analysis of Fe2+ is done using 1,10-phenanthroline as a complexing or coloring agent. Real time learning approach is carried out; it means discussing the data in front of class after collection of experimental results. A little data is used to discuss variety of concepts of basic statistics in analytical chemistry. A debate regarding priority of precision over accuracy is done. The objective of using standard deviation (STDEV) and changes in its value by unit conversion or other transformation is explained. The advantage of using relative standard deviation (RSD) is also demonstrated. Several spectrophotometers are compared regarding precision. By using same data, precision of analog and digital spectrophotometers is also compared. Repeatability and reproducibility are also explained and calculated using the same data. Presence of systematic error or accuracy of experiment is statistically assessed without doing further experimentation. The whole study is optionally supported with online MS-Excel worksheets that are automated for better understanding.
Keywords: accuracy, precision, second year undergraduate, analytical chemistry, graduate education/research
World Journal of Chemical Education, 2015 3 (2),
pp 40-45.
DOI: 10.12691/wjce-3-2-3
Received March 28, 2015; Revised April 10, 2015; Accepted April 15, 2015
Copyright © 2015 Science and Education Publishing. All Rights Reserved.Cite this article:
- Mohiuddin, Shaikh., and Zahida Karim. "Accurately Precise and Precisely Accurate: An Experimental Comprehension." World Journal of Chemical Education 3.2 (2015): 40-45.
- Mohiuddin, S. , & Karim, Z. (2015). Accurately Precise and Precisely Accurate: An Experimental Comprehension. World Journal of Chemical Education, 3(2), 40-45.
- Mohiuddin, Shaikh., and Zahida Karim. "Accurately Precise and Precisely Accurate: An Experimental Comprehension." World Journal of Chemical Education 3, no. 2 (2015): 40-45.
Import into BibTeX | Import into EndNote | Import into RefMan | Import into RefWorks |
1. Introduction
Statistics is an essential tool for quality analytical chemistry [1]. The prime target of using statistics is to have accurate and precise outcomes of analysis. Stories about accuracy and precision are not new, people are always worried about it [2, 3]. This is well-known concept that closeness of data is precision and closeness of results or its mean value to the true or actual value is accuracy. Several attempts have been made in past to nourish the concept of accuracy and precision in the field of chemical education (4). Experiments and pedagogical contents have been reported for good understanding of the concept of basic statistics, especially accuracy and precision [5, 6]. Other chemical analysis [7, 8] as well as disciplines covering other than science subjects are also emphasis on accuracy-precision differentiation even in topic of articles [9, 10, 11].
1.1. Real Time Teaching and LearningThis is an approach to discuss the concepts within lab or during experiment. In this experiment, a collective data is used and students are gathered to discuss data and do calculation to appreciate concepts. At each step students are asked reason(s) for any expected and unexpected matter, details are demonstrated in discussion section. This gives chance to students to actively think and practice their concepts. Most of the concepts canvassed in this article are commonly known, but it is always better to practically appreciate them.
1.2. Which Comes First? Accuracy or PrecisionAn important thing that is discussed (prior to experiment) is preference among accuracy and precision. Both are important but which one has priority, it has to be understood. Usually people have different misconceptions in treatment of data [12]. One of those is about giving preference to accuracy or precision. A website has been visited in August 2011 and also cited in this article [13] showing a poll for choosing more important among accuracy and precision. In such poll out of 22 participants, 12 gave preference to accuracy over precision. This is not necessary to have same results of such polls every time but usually the students of chemistry or analytical chemistry also give importance to accuracy over precision. That argument is exercised in current study. Details are specified in subsequent text (see section 4.1.).
1.3. Why Standard Deviation?Since precision is the closeness to the mean value of data, therefore deviation (i.e. difference of each data point with mean) can be one of the tools to know closeness of data. But problem is that approximately half of the deviation values are always negative with similar magnitude as of positive values. In order to find out the average of deviation one should always find zero value of average deviation or value very near to zero. Usually for averaging the data having both positive and negative values, a root mean square is calculated. Therefore square of deviation turns negative values to positive and then after calculating their average, square root of resultant is calculated. This value is termed as standard deviation (STDEV) in terms of statistics. That is basically root mean square value of deviation (equation 1) and holds good identity for precision of data.
(1) |
STDEV has the same unit as of each data point. If value of each data point is transformed or manipulated then the value of STDEV is also changed as well as its unit. This fact is also explained and practiced in this study. There is a way to normalize such transformation that having a value showing precision but does not change with transformation or manipulation. This is relative standard deviation (RSD) that is the ratio of STDEV with mean value of same data. RSD is a well-known statistical terminology and for all type of transformations it usually remains same. This is also well practiced and evidenced in this experiment.
1.5. Comparisons Regarding PrecisionIn current study precision of spectrophotometers is compared individually and also by categorizing them in analog and digital equipment. Comparisons of precision on different basis or categories (like among students, gender etc.) are also possible in this experiment but not considered in this study. Criterion for precision comparison is the value of STDEV. Lesser the value of STDEV means better precision.
1.6. Repeatability and ReproducibilityThese are also useful statistical terminologies; repeatability refers to the precision of replicate analysis while reproducibility refers to the precision between different sets of analysis. For repeatability, all experimental conditions for each replicate are considered to be identical and also all runs are at same slot of time. In case of reproducibility there must be difference in one or more of the conditions like equipment, time or instance, analyst, method etc. Within-run precision is repeatability and between-run precision is reproducibility. These are also estimated in this experiment to practice such concepts [14].
1.7. Regarding AccuracyAccuracy is related to the systematic errors in the data. Statistical confidence level is good for identifying systematic error in analysis. Important is that the true value ‘μ’ of analyte in sample must be known. Number of replicates ‘n’, their mean ‘’ and STDEV ‘s’ are used to find out confidence interval (equation 2) corresponding to value ‘t’ that is tabulated statistically [14].
(2) |
In case that true value lies in the confidence interval (equation 2) of x+ and x-. One can state that at the certain level of confidence (for which ‘t’ is tabulated) there is no systematic error in data or analysis.
1.8. Brief of Chemistry in AnalysisA well-established and simple spectrophotometric determination of Fe2+ is carried out in this study [15]. Complexing agent (1,10-phenanthroline) forms fairly colored complex with Fe2+. This complex shows highest absorbance approximately at 510nm. Complex is stable under a certain pH range. Acetic acid/ Sodium acetate buffer is used, with support of hydroxylamine solution as reducing agent in order to avoid unnecessary oxidation of Fe2+. Although this study can also be done far easier and quicker way, by analyzing readily colored analytes (like CuSO4, KMnO4 solution etc.).
2. Materials and Methods
2.1. Pre-labThe scope, objectives, experimental details and hazards related to this study are instructed to students before experiment. Priority among precision and accuracy is also discussed before experiment.
2.2. Preparations by InstructorDistilled water is used throughout the analysis. A stock solution of 20ppm Fe2+ is prepared from analytical grade ammonium iron (II) sulfate hexahydrate in 500mL volumetric flask as a collective sample. This stock sample solution is acidified with 5mL of 98% sulfuric acid before dilution and was made available to student in a good quality 50mL graduated burette. In order to avoid any biased result, concentration of this stock sample is not given to students. This concentration is informed to students after all experimental outcomes are entered in Table 1 at classroom’s board.
Approximately 100 g of hydroxylamine hydrochloride is dissolved and diluted up to 1 liter with distilled water. Sodium acetate solution is prepared by dissolving ~200g anhydrous sodium acetate in 1 liter of distilled water. 2.5g of 1, 10-phenanthroline monohydrate is dissolved in 1 liter of ethanol.
2.3. Preparations by StudentsStudents are instructed to dilute (1:10) the above stock sample in three 25mL volumetric flasks (for triplicate analysis) with addition of hydroxylamine hydrochloride, sodium acetate and 1,10-phenanthroline solutions (2.5mL of each) before making up of volume.
Each group of students is instructed to make appropriate stock Fe2+ calibration standard solution and acidify accordingly with sulfuric acid (as mentioned earlier in preparation of stock sample solution). One working calibration standard of Fe2+ is diluted within the range of 0.02 to 0.06mM from stock Fe2+ calibration standard. 2.5mL of each of hydroxylamine hydrochloride, sodium acetate and 1,10-phenanthroline solutions are also added prior to making up of volume of working calibration standard. Students of group ‘1’ choose 0.04mM Fe2+ as working calibration standard (Table 3). Blank solution is prepared by diluting 2.5mL of hydroxylamine hydrochloride, sodium acetate and 1, 10-phenanthroline solutions in 25mL volumetric flask.
Table 1. Consolidated Data of Entire Class and Outcomes of Calculation. (Portion in gray background is reproduced from classroom board after experiment)
Six spectrophotometers of two types are employed. Three Erma® AE-200 analog and three Thermo® Genesys-6 digital spectrophotometers are used. All instruments are assigned codes as mentioned in Table 1. Students are distributed into 24 groups and four groups are assigned to do their experiment at each instrument (Table 1). Absorbance measurements are carried out at 510nm in 10mm optical grade plastic cells against blank solution. Portion of Table 1 with gray background is made at the classroom board. Each group of students entered the data in respective rows at board after experiment.
3. Hazards
No extremely hazardous substance is used in this experiment. Care should be taken to avoid skin contact and wear eye protection. Acidification of stock Fe2+ solution must be carried out in well-ventilated area preferably in fume hood.
4. Discussion
This experiment is done by 72 students. They were divided in 24 groups such that each group has three students. This experiment is repeated in six semesters in three years. Around 150 students have participated all-over. Currently, experiment that has done in one semester is reported only. This is a collective experiment therefore whole class did most of the calculations altogether. For some calculations, only data or results of group 1 are given as an example. MS Excel™ worksheets as online support for this study are available online at https://1drv.ms/1LFKvYp. See details in section of ‘Available Supporting Information’ at the end of this article. This article is also understandable without such support. In this case ignore brackets [ ] in following text.
4.1. Accuracy Vs. PrecisionFirst of all a question is asked to students about preference between accuracy and precision. Approximately 70% of students were in favor of accuracy, 25% with precision and 5% had opinion that both have equal importance. This is further discussed with the help of examples in which this is asked that if outcomes of a titration X are 2.9, 17.1, 8.9, 11.1mL with average of 10.0mL. In another similar titration Y having endpoints 9.5, 9.6, 9.5, 9.4mL with average of 9.5mL. Which titration is looking fine? Almost all replied that Y is looking good. The only reason is good precision. If the true or real value of end point is given (e.g. 10.0mL) then it seems that X is better regarding accuracy. Since average of analysis matches the true value. But still most of the students choose Y again just because of precision. Another important fact is that most of the time the true value for samples is not known in real quantitative analysis. Therefore mostly there is no idea about accuracy. But precision is always there, unless you don’t have replicate analysis. Fortunately they all agreed that precision has more importance.
4.2. Real-time LearningThe better way of learning is to call all the students at classroom’s board, after students have finished their experiment and entered outcomes at board. The portion of the Table 1 having gray background is reproduced from the classroom board. This approach of real-time learning is beneficial for students as this is a good exercise of their concepts. In preceding text, it has already been discussed that precision has priority over accuracy regarding importance. Therefore precision is assessed before accuracy in the remaining section. Further aspects as real-time teaching and learning are continued in subsequent text.
4.3. Difference in the Values of Standard DeviationPrior to do any calculations, students are asked that what they expect about STDEV values of columns (Absorbance, concentration in mM and ppm). Whether they will be same or not? This question creates groups in students having opinion yes and no.
Then students are asked to calculate the STDEV (using equation 1) for first two columns (Absorbance and concentration in mM units). The values of STDEV are different, i.e. 0.0397 [‘Table 1’!E33] and 0.0058 [‘Table 1’!F33] (Table 1) for absorbance and concentration (in mM), respectively. The reason for this difference is asked to the students. This is because these are the STDEV of different values therefore these are not same. Then it is also asked that what about the unit of STDEV? Answer is that the STDEV has the unit same as each individual data has. For example, unit for STDEV of molar concentration values is molarity.
Then it is asked to students that how they can estimate the STDEV for concentration (mM) from the value of STDEV of Absorbance. It means as they have calculated the value for concentration (mM) using single standard calibration for their sample (equation (3), where A is absorbance, C is for concentration; subscript u and k represents unknown and known concentrations).
(3) |
Similarly they have to estimate the value for STDEV of concentration (mM) by placing the value of STDEV of sample’s absorbance 0.0397 (from column statistics of Table 1) in place of average absorbance Au. Therefore the equation 3 is transformed to the following equation (4), where ‘s’ is STDEV.
(4) |
The result of this calculation (using equation 4) is an estimated STDEV for concentration (mM) that is 0.0027mM. Absorbance and concentration of working standard are used from data obtained by group 1 from Table 3. This value (0.0027mM) is somehow closer to the STDEV obtained directly from the column having concentrations (mM) in column statistics of Table 1 (i.e. 0.0058mM) [‘Table 1’!F32]. The reason for having difference in these values is very interesting. Since each group uses different calibration standard solutions for such estimation. Hence there must be different degree of errors in preparation and measurement of calibration standards by each group. Therefore values of estimated STDEV and directly calculated STDEV are different.
Similarly students are asked to estimate STDEV for concentration in ppm using STDEV of concentration (mM). They used the conversion factor as they have used to convert the mean concentration (mM) to ppm. In this case, the conversion factor is 55.847. Therefore, the estimated value for STDEV of concentration (in ppm) is 0.3240 that it very close to direct STDEV calculation of column in Table 1 (i.e. 0.3266) [‘Table 1’!G33]. Unlike conversion from absorbance to mM concentration, both concentration columns have similar conversion factor for each observation. Therefore the estimated and directly calculated STDEV values are closer. Sometimes a little difference in these values may appear. This may be due to the difference in selection of significant figures in calculation by different groups.
4.4. How relative Standard Deviation is Advantageous?RSD is the ratio of the STDEV and mean of replicates. Therefore it normalizes the deviation effect. RSD values for absorbance and both kind of concentration values are expected to be same. But there are interesting facts in column statistics of Table 1 that RSD for absorbance is different from RSD of corresponding concentration values. But RSD values for both concentration columns are same. This is again discussed with students that the absorbance does not have effect of calibration standard. Since different calibration standards are used in groups hence different calibration factors are used for each data point. Therefore the RSD values are different. In case of both concentration units, these are same because for each observation a fixed conversion factor is used. Unlike collective data in Table 1, in Table 3 the data is from single group [‘Table 3’!E19 to G19]. The RSD values are same for all columns. Since single group uses standard of fixed concentration of working standard, it means same calibration factor is employed for all replicates. Most of the times RSD does not change with conversions hence this seems to be a better and more comparable tool to estimate precision than STDEV.
4.5. Comparing EquipmentData is organized in categories of groups that have used same equipment. Then precision for each instrument is calculated. STDEV for absorbance and concentration in mM and ppm units are mentioned in Table 1 under the column heading of STDEV. Regarding instrument’s performance it is better to do comparison of absorbance instead of concentrations. In case of concentration columns, the use of calibration standards compensates the errors. So, one could not perfectly find the precision profile of standalone equipment. The lowest of STDEV value for absorbance 0.01307 [‘Table 1’!H19] or highest precision is of instrument ‘D’ that is digital equipment. The poorest in precision is equipment ‘C’ (i.e. analog) with STDEV for absorbance value 0.05502 [‘Table 1’!H15].
Interesting thing is that if the STDEV for concentration in mM or ppm from each instrument is compared then still the best is ‘D’ but the worst is now ‘F’ (digital) [‘Table’1!I27]. Again the fact behind this observation is that the ratio of absorbance values in calculation of concentration (using equation 2) compensates the errors. Therefore it is always recommended in analysis, to run fresh calibration standard along with the sample.
4.6. Analog vs. DigitalOut of which first three (coded: A, B and C) are analog and others are digital. According to Table 2, precision for analog equipment is less or STDEV (0.0479) [‘Table 2’!D6] is higher than digital equipment (0.0316) [‘Table 2’!D7]. This may be due to the manual reading of absorbance displayer at analog spectrometers (i.e. a galvanometric needle). Therefore chance of uncertainty is higher. Others (D, E, and F) are having digital displayer that has less chance of human error in observing absorbance values. Also several techniques for signal to noise ratio improvement are being used in modern or digital equipment [16].
It has been mentioned in previous text that within-run precision is repeatability and between-run precision is reproducibility. Students are asked to identify the cases of within and between runs. Data in Table 3 is observed by single group and also represents within-run system. Since six spectrophotometers are used in this experiment Therefore one can estimate repeatability that is STDEV for the replicates run by each group. For group 1 the repeatability (i.e. STDEV) in absorbance measurement is 0.0030 [‘Table 3’!E18].
For estimation of reproducibility, conditions between the runs must be changed. This means there must be difference in at least one of the analysis parameters like date of analysis, analyst, temperature, equipment etc. for single sample. In this experiment different instruments and groups of analysts are involved in analysis of the same sample. Therefore the STDEV for absorbance measurements by all the groups is 0.0397 [‘Table 1’!E33] and this can be referred as reproducibility of analysis (Table 1). The values are according to expectations that repeatability is improved in comparison with reproducibility. STDEV for same equipment is usually lesser than measurements using different equipment. One can also estimate repeatability and reproducibility values for concentration (mM) and in ppm.
4.8. Accuracy ReconciliationAfter experiment, students are informed about the actual concentration of the collective sample they have analyzed (i.e. 20.0ppm Fe2+). Since this sample is diluted 10 times (see section 2.3.) therefore it became 2.0ppm Fe2+ after dilution. This is true value ‘μ’ as described in equation 1. The mean ‘’ concentration value (in ppm) for triplicate runs is 1.8430 [‘Table 3’!G17] and STDEV i.e. ‘s’ is 0.0115 [‘Table 3’!G18] from Table 3. Statistical value of ‘t’ is tabulated 4.303 at two tailed basis with confidence level ‘α’ = 0.05 for 2 degree of freedom. The calculated value of term in equation 2 is 0.0286. Finally, using equation 2 the estimated range for ‘μ’ is found 1.872 to 1.814. Since true value (μ = 2.0ppm) does not lies in this range. It can be stated statistically that at 95% confidence level there is a systematic error in the analysis performed by group 1. In case if true value lies in the estimated range then one can state that there is no systematic error in the analysis at 95% confidence level. The above range can also be calculated for other confidence levels. One of the students asked the reason about not using absorbance data for assessment of accuracy. The reason is that it is difficult or quite impossible to have true value for the absorbance measurement in this case.
5. Conclusion
This activity is based on the good understanding of basic statistical tools used in the field of analytical chemistry. The experiment is simple, easy and fast. Small amount of less harmful chemicals are used. Usual spectrophotometers are used that are mostly available in academic and other laboratories. Experiment can hold large number of students as it require less space and give chance to each student with active participation. This is collective lab for whole class. A real-time approach is used to discuss the collective and individual experimental outcomes by sharing ideas with students. A little data is used to appreciate variety of concepts.
6. Available Supporting Information
MS-Excel™-2003 worksheets are available online at https://1drv.ms/1LFKvYp (file named Support File(1).xls) to support understanding of tables and also confirm calculations in this article. This file can be used to do calculation for your own experimental data. By reason the cells in the worksheets are not locked. Therefore it is recommended to enter the values only in the cells having gray background. This will automatically do calculation in rest of the cells having formula. Reference of the cells in worksheets are given in italics within the square bracket [‘Sheet’s name’! Cell address] in the manuscript. Those who are not using this software support may ignore such brackets. This MS Excel software support and other MS Word document files (Instructor’s_Instructions[1].docx and Handouts_for_students[1].docx) are also available via Internet website https://1drv.ms/1LFKvYp. All these files can be downloaded and also edited online at internet browser.
Acknowledgment
Authors are thankful to number of teaching assistants and students to take part in this activity and their support with valuable feedback during experiment.
List of Abbreviations
STDEV: Standard Deviation
RSD: Relative Standard Deviation.
References
[1] | Paul, S. M., “The Importance and Efficacy of Using Statistics in the High School Chemistry Laboratory,” Journal of Chemical Education, 83(11). 1649-1651. Nov. 2006. | ||
In article | CrossRef | ||
[2] | Walter, R. C. “Practical precision for chemistry students,” Journal of Chemical Education, 29(7). 349-352. Jul.1952. | ||
In article | CrossRef | ||
[3] | Buehrer, T. F. and Schupp, O. E. Jr., “Student precision in quantitative analysis. I. Factors influencing precision” Journal of Chemical Education, 3(12). 1271-1276. Nov.1926. | ||
In article | CrossRef | ||
[4] | Jordan, A. D., “Which Method Is Most Precise; Which Is Most Accurate?,” Journal of Chemical Education, 84(9). 1459-1460. Sep.2007. | ||
In article | CrossRef | ||
[5] | Richard, S. T., “Precision and Accuracy in Measurements: A Tale of Four Graduated Cylinders,” Journal of Chemical Education, 75 (8). 992-995. Aug.1998. | ||
In article | CrossRef | ||
[6] | Piskulic, L., Racca, L., Bottai, H. and Leiva, M., “Accuracy and Precision in Measurements: Two Complementary Approaches,” Teaching Statistics: An International Journal for Teachers, 28(1). 14-16. Feb.2006. | ||
In article | CrossRef | ||
[7] | Hui, L., Chi-Fu, Y. and Sanjeevi, S.. “Fluorescence Axial Localization with Nanometer Accuracy and Precision,” Nano Letters, 12(7). 3731-3735. Jun.2012. | ||
In article | CrossRef PubMed | ||
[8] | Matthew, A. W., Andreas, V. and Rohit, V. P., “New Estimators for Calculating Solvation Entropy and Enthalpy and Comparative Assessments of Their Accuracy and Precision,” The Journal of Physical Chemistry. B., 114(24). 8166-8180. May.2010. | ||
In article | |||
[9] | Leonard, I. W., Tyler, B. C., and Pradeep, K. A., “Approaches for Achieving Long-Term Accuracy and Precision of δ18O and δ2H for Waters Analyzed using Laser Absorption Spectrometers,” Environmental Science and Technology, 48(2). 1123-1131. Dec.2014. | ||
In article | CrossRef PubMed | ||
[10] | Feng, Y. and Max, L., “Accuracy and Precision of a Custom Camera-Based System for 2-D and 3-D Motion Tracking during Speech and Nonspeech Motor Tasks,” Journal of Speech, Language, and Hearing Research, 57(2). 426-438. Apr.2014. | ||
In article | CrossRef PubMed | ||
[11] | Haley, S. M., Coster, W. J., Dumas, H. M., Fragala-Pinkham, M. A., Kramer, J. N., Pengsheng, Tian, F., Kao, Y., Moed, R. and Ludlow, L. H., “Accuracy and Precision of the Pediatric Evaluation of Disability Inventory Computer-Adaptive Tests (PEDI-CAT),” Developmental Medicine & Child Neurology, 53(12). 1100-1106. Dec.2011. | ||
In article | CrossRef PubMed | ||
[12] | Hartkopf A. V. “Significance and precision” Journal of Chemical Education, 64 (12), 1068, Dec.1987. | ||
In article | CrossRef | ||
[13] | Marc, T. S., “Accuracy vs. Precision - What is more important for Process Capability” Elsmar Cove Disscussion Forum, 1(6), 6. Jul.2007. | ||
In article | |||
[14] | Miller, J.C. and Miller, J.N., Statistics and chemometrics for analytical chemistry, 6th Edition, Pearson Education, Canada, 2010. | ||
In article | |||
[15] | Belcher, R. “The application of chelate compounds in analytical chemistry,” Pure and Applied Chemistry, 34(1), 13-27, Jan.1973. | ||
In article | CrossRef | ||
[16] | Skoog, D. A., Holler, F. J. and Crouch, S. R., Principles of Instrumental Analysis, 6th Edition, Thomson Brooks/Cole, Belmont, CA, 2007. | ||
In article | |||