Theoretical Distributions of Optima for Univariate Discrimination of Random Data

Optimal linear discriminant models maximize percentage accuracy for dichotomous classifications, but are rarely used because a theoretical framework that allows one to make valid statements about the statistical significance of the outcomes of such analyses does not exist. This paper describes an analytic solution for the theoretical distribution of optimal values for univariate optimal linear discriminant analysis, under the assumption that the data are random and continuous. We also present the theoretical distribution for sample sizes up to N= 30. The discovery of a statistical framework for evaluating the performance of optimal discriminant models should greatly increase their use by scientists in all disciplines.

[1]  John M. Liittschwager,et al.  Integer Programming Solution of a Classification Problem , 1978 .

[2]  R. E. Warmack,et al.  An Algorithm for the Optimal Solution of Linear Inequalities and its Application to Pattern Recognition , 1973, IEEE Transactions on Computers.

[3]  S. M. Bajgier,et al.  AN EXPERIMENTAL COMPARISON OF STATISTICAL AND LINEAR PROGRAMMING APPROACHES TO THE DISCRIMINANT PROBLEM , 1982 .

[4]  D. Titterington,et al.  Comparison of Discrimination Techniques Applied to a Complex Data Set of Head Injured Patients , 1981 .

[5]  W. Gehrlein General mathematical programming formulations for the statistical classification problem , 1986 .

[6]  D H Singer,et al.  Heart rate variability and sudden death secondary to coronary artery disease during ambulatory electrocardiographic monitoring. , 1987, The American journal of cardiology.

[7]  Gary J. Koehler,et al.  Linear Discriminant Functions Determined by Genetic Search , 1991, INFORMS J. Comput..

[8]  Prediction of Protein Function by Discriminant Analysis , 1986 .

[9]  Robert A. Eisenbeis,et al.  PITFALLS IN THE APPLICATION OF DISCRIMINANT ANALYSIS IN BUSINESS, FINANCE, AND ECONOMICS , 1977 .

[10]  P. Jurs Pattern recognition used to investigate multivariate data in analytical chemistry. , 1986, Science.

[11]  William C. Wedley,et al.  Optimal Criterion Weights in Repetitive Multicriteria Decision-Making , 1985 .

[12]  Gary J. Koehler,et al.  Minimizing Misclassifications in Linear Discriminant Analysis , 1990 .

[13]  Toshihide Ibaraki,et al.  Adaptive Linear Classifier by Linear Programming , 1970, IEEE Trans. Syst. Sci. Cybern..