Quality of Analytical Measurements: Statistical Methods for Internal Validation

Every day millions of analytical determinations are made in thousands of laboratories all around the world. These measurements are necessary for evaluating merchandise in the commercial interchanges, supporting health care, nourishing security, quality control of water and the environment, characterization of raw materials and manufactured products, and forensic analyses. Practically, any aspect of the contemporary social activity is somehow supported in the analytical measurements. The cost of these measurements is high, but the cost of the decisions made based on incorrect results is much greater. For example, a test that wrongly shows the presence of a forbidden substance in a food destined for human consumption can result in an expensive claim, the confirmation of the presence of an abuse drug can lead to serious judicial sentences, and doping in the sport practice may result in severe sanction. The importance of providing a correct result is evident and also of being able to demonstrate that this result is correct. Once an analytical problem is posed to a laboratory and the analytical method is selected, the next step is the in-house validation of a method. This is the process of defining the analytical requirements to respond to the problem and to confirm that the considered method has performance characteristics consistent with those required. The results of the validation experiments have to be evaluated in order to ensure that the method meets the measurement required specification. This chapter focuses on statistical evaluation of the data in the context of validation of a method to show what information can, or cannot, be extracted from the experimental results. In Section 1.02.1, the link between measurement method and a random variable is explained to show how the probability is the natural form of expressing experimental uncertainty. Section 1.02.2 describes confidence intervals to measure bias and precision under the normality hypothesis. Also, a nonparametric interval on the median and the tolerance intervals, useful in evaluating the fit for purpose of a method, are described. In Section 1.02.3, decision making on the basis of experimental data, therefore affected by uncertainty, is described. In this section, the computation of the power of a test is systematically proposed as a key element to evaluate the quality of the decision at the desired significance level. A brief incursion into tests based on intervals is also made as they solve the problem of deciding whether an interval of values is acceptable, for example, a relative error smaller than 10% in absolute value. The section ends with some tests to evaluate the compatibility of a theoretical distribution with experimental data. Section 1.02.4 is dedicated to the analysis of variance (ANOVA) for both fixed and random effects, and in Section 1.02.5 some more specific questions related to the usual parameters of the process of validation of an analysis method are analyzed. Mathematical proofs are not covered in this chapter and to be operative from a practical point of view, several examples have been included so that the reader can verify the understanding of the formulas and the line of argument for their thoughtful use. This aspect is completed with the inclusion of an Appendix where some essential aspects related to the effectiveness of the statistical models and the limits laws are described. In the Appendix, in MATLAB® code, the necessary sentences to do all the proposed calculations in the chapter, including those of the power of tests, are incorporated.

[1]  Max Feinberg,et al.  Bootstrap-based tolerance intervals for application to method validation , 2007 .

[2]  Johanna Smeyers-Verbeke,et al.  Handbook of Chemometrics and Qualimetrics: Part A , 1997 .

[3]  L. Brown,et al.  An unbiased test for the bioequivalence problem , 1997 .

[4]  S. S. Wilks Determination of Sample Sizes for Setting Tolerance Limits , 1941 .

[5]  Abraham S. Goldin,et al.  STATISTICAL TECHNIQUES FOR COLLABORATIVE TESTS , 1968 .

[6]  V. L. Anderson,et al.  Design of Experiments: A Realistic Approach , 1974 .

[7]  Max Feinberg,et al.  Validation of analytical methods based on accuracy profiles. , 2007, Journal of chromatography. A.

[8]  Roger Willink,et al.  On using the Monte Carlo method to calculate uncertainty intervals , 2006 .

[9]  Ignacio Lira,et al.  Comparison between the conventional and Bayesian approaches to evaluate measurement data , 2006 .

[10]  Jagdish K. Patel,et al.  Tolerance limits - a review , 1986 .

[11]  Werner A. Stahel,et al.  Robust Statistics: The Approach Based on Influence Functions , 1987 .

[12]  Luisa T. Fernholz,et al.  Content-Corrected Tolerance Limits Based on the Bootstrap , 2001, Technometrics.

[13]  P. R. Fisk,et al.  Distributions in Statistics: Continuous Multivariate Distributions , 1971 .

[14]  S. R. Searle Linear Models , 1971 .

[15]  Stephen L. R. Ellison,et al.  Reporting measurement uncertainty and coverage intervals near natural limits. , 2006, The Analyst.

[16]  M. C. Ortiz,et al.  Robustness of the extraction step when parallel factor analysis (PARAFAC) is used to quantify sulfonamides in kidney by high performance liquid chromatography-diode array detection (HPLC-DAD). , 2004, The Analyst.

[17]  P. Hubert,et al.  Using tolerance intervals in pre-study validation of analytical methods to predict in-study results. The fit-for-future-purpose concept. , 2007, Journal of chromatography. A.

[18]  Irene A. Stegun,et al.  Handbook of Mathematical Functions. , 1966 .

[19]  G. Kateman,et al.  Quality control in analytical chemistry , 1981 .

[20]  S. Wellek Testing Statistical Hypotheses of Equivalence , 2002 .

[21]  J. Castillo,et al.  Bioestadística para las ciencias de la salud ( , 2004 .

[22]  E. Lehmann,et al.  Testing Statistical Hypothesis. , 1960 .

[23]  H. Scheffé,et al.  The Analysis of Variance , 1960 .

[24]  H. Schouten,et al.  Sample size formula with a continuous outcome for unequal group sizes and unequal variances. , 1999, Statistics in medicine.

[25]  B. Silverman Density estimation for statistics and data analysis , 1986 .

[26]  Eric Ziemons,et al.  Analysis of recent pharmaceutical regulatory documents on analytical method validation. , 2007, Journal of chromatography. A.

[27]  Johanna Smeyers-Verbeke,et al.  Comparison of alternative measurement methods , 1999 .

[28]  Donald J. Schuirmann A comparison of the Two One-Sided Tests Procedure and the Power Approach for assessing the equivalence of average bioavailability , 1987, Journal of Pharmacokinetics and Biopharmaceutics.

[29]  J. Wolfowitz,et al.  Tolerance Limits for a Normal Distribution , 1946 .

[30]  P. C. Meier,et al.  Statistical Methods in Analytical Chemistry , 2005 .

[31]  Günter Zech Frequentist and Bayesian confidence intervals , 2001 .

[32]  Inmaculada García,et al.  Usefulness of D-optimal designs and multicriteria optimization in laborious analytical procedures. Application to the extraction of quinolones from eggs. , 2005, Journal of chromatography. A.

[33]  Maurice G. Kendall,et al.  The Advanced Theory of Statistics, Vol. 2: Inference and Relationship , 1979 .

[34]  Giselle Limentani,et al.  Beyond the t-test: statistical equivalence testing. , 2005, Analytical chemistry.

[35]  Günther H. Mehring On optimal tests for general interval-hypotheses , 1993 .

[36]  P. Sprent,et al.  Applied nonparametric statistical methods , 1988 .

[37]  M Laurentie,et al.  Harmonization of strategies for the validation of quantitative analytical procedures. A SFSTP proposal--part II. , 2004, Journal of pharmaceutical and biomedical analysis.

[38]  A. G. González,et al.  Accuracy profiles from uncertainty measurements. , 2006, Talanta.

[39]  Testing Average Equivalence — Finding a Compromise Between Theory and Practice , 2000 .

[40]  D. Massart,et al.  Comparison of alternative measurement methods: determination of the minimal number of measurements required for the evaluation of the bias by means of interval hypothesis testing , 2000 .

[41]  Philippe Hubert,et al.  Using total error as decision criterion in analytical method transfer , 2007 .

[42]  M. Evans Statistical Distributions , 2000 .

[43]  Y. Heyden,et al.  Reappraisal of hypothesis testing for method validation : detection of systematic error by comparing the means of two methods or of two laboratories , 1995 .