Conditional Fisher's exact test as a selection criterion for pair-correlation method. Type I and Type II errors

Abstract The pair-correlation method (PCM) has been developed recently for discrimination between two variables. PCM can be used to identify the decisive (fundamental, basic) factor from among correlated variables even in cases when all other statistical criteria fail to indicate significant difference. These decisions are needed frequently in QSAR studies and/or chemical model building. The conditional Fisher's exact test, based on testing significance in the 2×2 contingency tables is a suitable selection criterion for PCM. The test statistic provides a probabilistic aid for accepting the hypothesis of significant differences between two factors, which are almost equally correlated with the response (dependent variable). Differentiating between factors can lead to alternative models at any arbitrary significance level. The power function of the test statistic has also been deduced theoretically. A similar derivation was undertaken for the description of the influence of Type I (false-positive conclusion, error of the first kind) and Type II (false-negative conclusion, error of the second kind) errors. The appropriate decision is indicated from the low probability levels of both false conclusions.

[1]  J J Gart,et al.  The determination of sample sizes for use with the exact conditional test in 2 x 2 comparative trials. , 1973, Biometrics.

[2]  M. Thompson Selection of Variables in Multiple Regression: Part I. A Review and Evaluation , 1978 .

[3]  Milton Abramowitz,et al.  Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables , 1964 .

[4]  I. Vincze,et al.  Mathematische Statistik mit industriellen Anwendungen , 1974 .

[5]  E. Lehmann Testing Statistical Hypotheses. , 1997 .

[6]  E. Lehmann Testing Statistical Hypotheses , 1960 .

[7]  Johanna Smeyers-Verbeke,et al.  Handbook of Chemometrics and Qualimetrics: Part A , 1997 .

[8]  D. J. Finney,et al.  Tables for Testing Significance In a 2 × 3 Contingency Table , 1963 .

[9]  K. Héberger,et al.  Rate constants for the addition of the 2-hydroxy-2-propyl radical to alkenes in solution: ADDITION OF 2-HYDROXY-2-PROPYL RADICAL , 1993 .

[10]  N. M. Faber,et al.  Determining the optimal sample size in forensic casework – with application to fibres , 1999 .

[11]  W. H. Robertson Programming Fisher's Exact Method of Comparing Two Percentages , 1960 .

[12]  D. Massart,et al.  Elimination of uninformative variables for multivariate calibration. , 1996, Analytical chemistry.

[13]  B. Bowerman Statistical Design and Analysis of Experiments, with Applications to Engineering and Science , 1989 .

[14]  K. Héberger,et al.  Rate constants for the addition of the 2-cyano-2-propyl radical to alkenes in solution , 1993 .

[15]  Granville William Anthony,et al.  The Elements of the Differential and Integral Calculus , 1905, Nature.

[16]  R. Rajkó Treatment of Model Error in Calibration by Robust and Fuzzy Procedures , 1994 .

[17]  W. J. Conover,et al.  Practical Nonparametric Statistics , 1972 .

[18]  M. Thompson Selection of Variables in Multiple Regression: Part II. Chosen Procedures, Computations and Examples , 1978 .

[19]  O. J. Dunn,et al.  Applied statistics: analysis of variance and regression , 1975 .

[20]  O. J. Dunn,et al.  Applied statistics: analysis of variance and regression (2nd ed.) , 1986 .