Critique of the Guide to the expression of uncertainty in measurement method of estimating and reporting uncertainty in diagnostic assays.

BACKGROUND The Guide to the Expression of Uncertainty in Measurement (GUM) provides instructions for constructing uncertainty intervals for a measurement. This method is usually reserved for reference materials, but GUM has been recently proposed as a way to express uncertainty for commercial diagnostic assays. METHODS Using the official GUM standard and published applications of GUM to commercial diagnostic assays, I undertook an analysis to evaluate whether applying GUM to commercial diagnostic assays is warranted. RESULTS Certain important assays, such as troponin I, would not be candidates for GUM because troponin I is not a well-defined physical quantity. Unlike definitive methods, in which efforts are taken to detect and eliminate all systematic error sources, commercial assays often trade off features such as ease of use and cost with accuracy and allow systematic errors to be present as long as the overall accuracy meets the medical need goal. Laboratories are hindered in preparing GUM models because the knowledge required to specify some systematic errors is often available only to manufacturers. Some non-GUM methods to estimate uncertainty rely on observed data, which include both known and unknown sources of error. The occurrence of large, unknown errors for assays in routine use (e.g., outliers) is not unusual because diagnostic assays must be chemically specific in the presence of thousands of potentially interfering substances. There is no provision in GUM to deal with unexplained outliers, which may lead to uncertainty intervals that are not wide enough. Some clinicians assume that diagnostic assay results have little uncertainty. This situation may be made worse by including an uncertainty interval, which implies certification. CONCLUSIONS Evaluations for accuracy (total analytical error) based on describing the distribution of result differences between commercial assays and reference methods indicate that some assays have a few results with large differences (e.g., outliers). This leads to a wide accuracy interval (total analytical error limits). It is unlikely that GUM would be able to predict these wide intervals, especially because there is little or no provision for outlier treatment in GUM. Presenting too narrow GUM uncertainty intervals to clinicians would be misleading. The modeling used by practitioners of the GUM method is potentially useful in improving quality, but commercial diagnostic assays are not ready for GUM uncertainty statements.

[1]  S. Ellison,et al.  Quantifying uncertainty in analytical measurement , 2000 .

[2]  A Kallner,et al.  Does the Uncertainty of Commonly Performed Glucose Measurements Allow Identification of Individuals at High Risk for Diabetes? , 1999, Clinical chemistry and laboratory medicine.

[3]  J Kristiansen,et al.  Traceability and Uncertainty in Analytical Measurements , 1998, Annals of clinical biochemistry.

[4]  M Grabe Estimation of measurement uncertainties - an alternative to the ISO Guide , 2001 .

[5]  E. Bruck,et al.  National Committee for Clinical Laboratory Standards. , 1980, Pediatrics.

[6]  F. P. Anderson,et al.  Performance of four homogeneous direct methods for LDL-cholesterol. , 2002, Clinical chemistry.

[7]  J S Krouwer,et al.  Estimating total analytical error and its sources. Techniques to improve method evaluation. , 1992, Archives of pathology & laboratory medicine.

[8]  V. Marks,et al.  False-positive immunoassay results: a multicenter survey of erroneous immunoassay results from assays of 74 analytes in 10 donors from 66 laboratories in seven countries. , 2002, Clinical chemistry.

[9]  René Dybkaer From total allowable error via metrological traceability to uncertainty of measurement of the unbiased result , 1999 .

[10]  T Groth,et al.  Factors influencing the quality of analytical methods. A systems analysis, with use of computer simulation. , 1974, Clinical chemistry.

[11]  Julian H Barth,et al.  Wrong biochemistry results: two case reports and observational study in 5310 patients on potentially misleading thyroid-stimulating hormone and gonadotropin immunoassay results. , 2002, Clinical chemistry.

[12]  Jan S Krouwer,et al.  Setting performance goals and evaluating total analytical error for diagnostic assays. , 2002, Clinical chemistry.

[13]  A Kallner,et al.  Quality specifications based on the uncertainty of measurement. , 1999, Scandinavian journal of clinical and laboratory investigation.

[14]  R Dybkaer,et al.  Setting quality specifications for the future with newer approaches to defining uncertainty in laboratory medicine. , 1999, Scandinavian journal of clinical and laboratory investigation.

[15]  Mauro Panteghini,et al.  Performance of today's cardiac troponin assays and tomorrow's. , 2002, Clinical chemistry.

[16]  J. Kristiansen,et al.  Description of a Generally Applicable Model for the Evaluation of Uncertainty of Measurement in Clinical Chemistry , 2001, Clinical chemistry and laboratory medicine.

[17]  P O Collinson,et al.  Measurement of Cardiac Troponins , 2001, Annals of clinical biochemistry.

[18]  Ulf Örnemark,et al.  Evaluation of Uncertainty of Measurement in Routine Clinical Chemistry - Applications to Determination of the Substance Concentration of Calcium and Glucose in Serum , 2002, Clinical chemistry and laboratory medicine.

[19]  G. Klee,et al.  Analytic bias specifications based on the analysis of effects on performance of medical guidelines. , 1999, Scandinavian journal of clinical and laboratory investigation.

[20]  L. Cole,et al.  False-positive hCG assay results leading to unnecessary surgery and chemotherapy and needless occurrences of diabetes and coma. , 1999, Clinical chemistry.

[21]  N W Tietz,et al.  A model for a comprehensive measurement system in clinical chemistry. , 1979, Clinical chemistry.

[22]  L. Cole,et al.  False diagnosis and needless therapy of presumed malignant disease in women with false-positive human chorionic gonadotropin concentrations , 2000, The Lancet.