Tutorial on evaluation of type I and type II errors in chemical analyses: from the analytical detection to authentication of products and process control.

Uncertainty is inherent in all experimental determinations. Nevertheless, these measurements are used to make decisions including the performance of the own measurement systems. The link between the decision and the true implicit system that generates the data (measurement system, production process, category of samples, etc.) is a representation of this uncertainty as a probability distribution. This representation leads to the probabilistic formalization of the possibility of making errors. In the context of regulations established by official agencies, it is important to use these statistical decision methods in some cases because the own norm makes them mandatory and, in general, because this is the way of reasonably evaluating whether a working hypothesis is rejected on the basis of the experimental data. The aim of the present tutorial is to introduce some ideas and basic methods for the critical analysis of experimental data. With this goal, the basic elements of the Neyman-Pearson theory of hypothesis testing are formally introduced in connection with the common problems in chemical analysis and, if this is the case, their relation to the norms of regulatory agencies. The notion of decision with 'enough quality' is modelled when explicitly considering: (1) the null, H(0), and alternative, H(1), hypotheses. (2) The significance level of the test, which is the probability, alpha, of rejecting H(0) when it is true, and the power of the test, 1-beta, beta being the probability of accepting H(0) when it is false. (3) The difference between H(0) and H(1) that has to be detected with experimental data. (4) The needed sample size. These four concepts should be explicitly defined for each problem and, under the usual assumption of normal distribution of the data, the mathematical relations among these concepts are shown, which allow the analyst to design a decision rule with pre-set values of alpha and beta. To illustrate the unifying character of this inferential methodology, several situations are exposed along the tutorial: the design of a hypothesis test to decide on the performance characteristics of analytical methods, the capability of detection of both quantitative and qualitative analytical methods (including its generalization to the case of multivariate and/or multiway signals), the analytical sensitivity with multivariate signals, the class-modelling and the process control.

[1]  M. C. Ortiz,et al.  Multivariate analytical sensitivity in the determination of selenium, copper, lead and cadmium by stripping voltammetry when using soft calibration , 2003 .

[2]  M. C. Ortiz,et al.  Sequential probability ratio test for evaluating functionality in acid–base potentiometry , 2005 .

[3]  M. C. Ortiz,et al.  Capability of discrimination : application to soft calibration methods , 2001 .

[4]  Luis A. Sarabia,et al.  How to search the experimental conditions that improve a Partial Least Squares calibration model. Application to a flow system with electrochemical detection for the determination of sulfonamides in milk , 2008 .

[5]  Douglas C. Montgomery,et al.  Introduction to Statistical Quality Control , 1986 .

[6]  Kaj Heydorn,et al.  Quality assurance of qualitative analysis in the framework of the European project ’MEQUALAN’ , 2003 .

[7]  B. Efron Bootstrap Methods: Another Look at the Jackknife , 1979 .

[8]  M. C. Ortiz,et al.  A multivariate multianalyte screening method for sulfonamides in milk based on front-face fluorescence spectroscopy. , 2010, Analytica chimica acta.

[9]  M. C. Ortiz,et al.  Sensitivity and specificity of PLS-class modelling for five sensory characteristics of dry-cured ham using visible and near infrared spectroscopy , 2006 .

[10]  M. C. Ortiz,et al.  Advantages of PARAFAC calibration in the determination of malachite green and its metabolite in fish by liquid chromatography-tandem mass spectrometry. , 2008, Journal of chromatography. A.

[11]  M. C. Ortiz,et al.  DETARCHI: A program for detection limits with specified assurance probabilities and characteristic curves of detection , 1994 .

[12]  Desire L. Massart,et al.  Supervised pattern recognition: the ideal method? , 1986 .

[13]  M. C. Ortiz,et al.  Rapid determination of sulfonamides in milk samples using fluorescence spectroscopy and class modeling with n-way partial least squares. , 2007, Analytica chimica acta.

[14]  H L Pardue,et al.  The inseparable triad: analytical sensitivity, measurement uncertainty, and quantitative resolution. , 1997, Clinical chemistry.

[15]  J. M. Bernardo,et al.  Bayesian Methodology in Statistics , 2009 .

[16]  M. C. Ortiz,et al.  Pareto-optimal front as a tool to study the behaviour of experimental factors in multi-response analytical procedures. , 2008, Analytica chimica acta.

[17]  Anthony C. Davison,et al.  Bootstrap Methods and Their Application , 1998 .

[18]  Douglas C. Montgomery,et al.  Applied Statistics and Probability for Engineers, Third edition , 1994 .

[19]  Luis A. Sarabia,et al.  Vectorial optimization as a methodogical alternative to desirability function , 2006 .

[20]  J. Murdoch Shewhart Control Charts , 1979 .

[21]  M. C. Ortiz,et al.  Class-modelling techniques that optimize the probabilities of false noncompliance and false compliance , 2010 .

[22]  Luis A. Sarabia,et al.  On Pareto-optimal fronts for deciding about sensitivity and specificity in class-modelling problems , 2005 .

[23]  M. C. Ortiz,et al.  Analysis and comparison of SIMCA models for denominations of origin of wines from de Canary Islands (Spain) builds by means of their trace and ultratrace metals content , 2002 .

[24]  P. Bievre The 2007 International Vocabulary of Metrology (VIM), JCGM 200:2008 [ISO/IEC Guide 99]: Meeting the need for intercontinentally understood concepts and their associated intercontinentally agreed terms. , 2009 .

[25]  Luis A. Sarabia,et al.  Typification of vinegars from Jerez and Rioja using classical chemometric techniques and neural network methods , 1999 .

[26]  D. Massart,et al.  UNEQ: a disjoint modelling technique for pattern recognition based on normal distribution , 1986 .

[27]  R. Lyman Ott.,et al.  An introduction to statistical methods and data analysis , 1977 .

[28]  Tom Fearn,et al.  Characterising the performance of qualitative analytical methods: Statistics and terminology , 2005 .

[29]  M. C. Ortiz,et al.  Identification and quantification of ciprofloxacin in urine through excitation-emission fluorescence and three-way PARAFAC calibration. , 2009, Analytica chimica acta.

[30]  M. D. Luque de Castro,et al.  Minimum value assured by a method to determine gold in alloys by using laser-induced breakdown spectroscopy and partial least-squares calibration model , 2004 .

[31]  Luis A. Sarabia,et al.  A stochastic trained neural network for nonparametric hypothesis testing , 2002 .

[32]  C. Andrew. Clayton,et al.  Detection limits with specified assurance probabilities , 1987 .

[33]  Candin Liteanu,et al.  Statistical theory and methodology of trace analysis , 1980 .

[34]  L. A. Konopel'ko,et al.  Uncertainty of Qualitative Chemical Analysis: General Methodology and Binary Test Methods , 2004 .

[35]  A. J. Ferrer-Riquelme,et al.  Statistical Control of Measures and Processes , 2009 .

[36]  Caporal-Gautier,et al.  Guide de validation analytique. Rapport d'une commission SFSTP. I : Méthodologie , 1992 .

[37]  Lloyd A. Currie,et al.  Detection and quantification limits: origins and historical overview , 1997 .

[38]  M. C. Ortiz,et al.  Quality of Analytical Measurements: Statistical Methods for Internal Validation , 2009 .

[39]  Magni Martens,et al.  Multivariate Analysis of Quality : An Introduction , 2001 .

[40]  M. C. Ortiz,et al.  Quality of Analytical Measurements: Univariate Regression , 2020, Comprehensive Chemometrics.

[41]  Luis A. Sarabia,et al.  Response Surface Methodology , 2009 .

[42]  S. D. Jong,et al.  Handbook of Chemometrics and Qualimetrics , 1998 .

[43]  J. Sáez,et al.  Typification of alcoholic distillates by multivariate techniques using data from chromatographic analyses , 1993 .

[44]  M. C. Ortiz,et al.  Capability of detection and three-way data , 2006 .

[45]  Luis A. Sarabia,et al.  GINN (Genetic Inside Neural Network): towards a non-parametric training , 1997 .

[46]  D. Massart,et al.  Comparison of alternative measurement methods: determination of the minimal number of measurements required for the evaluation of the bias by means of interval hypothesis testing , 2000 .

[47]  Kalyanmoy Deb,et al.  Multi-objective optimization using evolutionary algorithms , 2001, Wiley-Interscience series in systems and optimization.

[48]  Svante Wold,et al.  Pattern recognition by means of disjoint principal components models , 1976, Pattern Recognit..

[49]  David R. Cox,et al.  PRINCIPLES OF STATISTICAL INFERENCE , 2017 .

[50]  M. C. Ortiz,et al.  Detection capability of tetracyclines analysed by a fluorescence technique: comparison between bilinear and trilinear partial least squares models , 2004 .