Multiple group logistic discrimination

Among the many possible approaches suggested for statistical discrimination, the logistic method can be classified midway between fully distributional solutions, of which the assumption of multivariate normality is a classical example[16l, and the distribution-free techniques, using, for instance, kernel or nearest-neighbor methods[l,32]. Therefore it is often called a partially parametric or partially distributional method[l l]. This central position of the logistic model makes it one of the most attractive and widely used tools for solving regression and discrimination problems. Indeed, since there are fewer distributional assumptions than for fully parametric models, the logistic method is applicable to a larger family of multivariate distributions involving both discrete and continuous variables. Moreover, in spite of its wide applicability and generality, the method remains feasible and easy to use, in contrast with nonparametric methods. This paper is intended to review the basic ideas and principles of logistic discrimination[415,22-25] and also to bring additional results to some of the queries raised by Professor Anderson before his death. We restrict our attention to discrimination between qualitatively distinct groups and do not envisage the case where groups are quantitatively distinct or ordered[3,12,14]. We dedicate this paper to our friend and mentor, the late Professor J. A. Anderson, for his fundamental contribution to discriminant analysis and for his continuous support of our research efforts.

[1]  R. Tibshirani,et al.  Local Likelihood Estimation , 1987 .

[2]  A Villers Ruiz,et al.  Statistical package for the social sciences (spss) , 1981 .

[3]  J. Copas Plotting p against x , 1983 .

[4]  A. Tsiatis A note on a goodness-of-fit test for the logistic regression model , 1980 .

[5]  J. Anderson Multivariate logistic compounds , 1979 .

[6]  J. Lawless,et al.  Efficient Screening of Nonnormal Regression Models , 1978 .

[7]  N. Glick Sample-Based Classification Procedures Derived from Density Estimators , 1972 .

[8]  J. Habbema,et al.  Cases of doubt in allocation problems , 1974 .

[9]  A. Wald Tests of statistical hypotheses concerning several parameters when the number of observations is large , 1943 .

[10]  P. Gill,et al.  Quasi-Newton Methods for Unconstrained Optimization , 1972 .

[11]  J. Habbema,et al.  The measurement of performance in probabilistic diagnosis. III. Methods based on continuous functions of the diagnostic probabilities. , 1978, Methods of information in medicine.

[12]  J. Aitchison,et al.  Multivariate binary discrimination by the kernel method , 1976 .

[13]  D. Hosmer,et al.  A review of goodness of fit statistics for use in the development of logistic regression models. , 1982, American journal of epidemiology.

[14]  R. Prentice,et al.  A generalization of the probit and logit methods for dose response curves. , 1976, Biometrics.

[15]  J. A. Anderson,et al.  Probit and logistic discriminant functions , 1981 .

[16]  D. Stablein,et al.  Statistical methods for determining prognosis in severe head injury. , 1980, Neurosurgery.

[17]  V. E. Kane,et al.  Assessment of fisher and logistic linear and quadratic discrimination models , 1983 .

[18]  D. Pregibon Goodness of Link Tests for Generalized Linear Models , 1980 .

[19]  J. Anderson,et al.  Regression, Discrimination and Measurement Models for Ordered Categorical Variables , 1981 .

[20]  Wojtek J. Krzanowski,et al.  Discrimination and Classification Using Both Binary and Continuous Variables , 1975 .

[21]  D. Pregibon,et al.  Graphical Methods for Assessing Logistic Regression Models , 1984 .

[22]  J. A. Anderson,et al.  7 Logistic discrimination , 1982, Classification, Pattern Recognition and Reduction of Dimensionality.

[23]  Critical evaluation of quadratic logistic discriminant analysis methods: a case study. , 1985, Computers and biomedical research, an international journal.

[24]  Calyampudi Radhakrishna Rao,et al.  Linear Statistical Inference and its Applications , 1967 .

[25]  S. Haberman,et al.  The analysis of frequency data , 1974 .

[26]  D. Pregibon Logistic Regression Diagnostics , 1981 .

[27]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[28]  J. Anderson Regression and Ordered Categorical Variables , 1984 .

[29]  W. A. Larsen,et al.  The Use of Partial Residual Plots in Regression Analysis , 1972 .

[30]  J. A. Anderson,et al.  Diagnosis by Logistic Discriminant Function: Further Practical Problems and Results , 1974 .

[31]  D. J. Spiegelhalter,et al.  Statistical and Knowledge‐Based Approaches to Clinical Decision‐Support Systems, with an Application in Gastroenterology , 1984 .

[32]  J. Anderson,et al.  Penalized maximum likelihood estimation in logistic regression and discrimination , 1982 .

[33]  J. Cornfield Joint dependence of risk of coronary heart disease on serum cholesterol and systolic blood pressure: a discriminant function analysis. , 1962, Federation proceedings.

[34]  Víctor M. Guerrero,et al.  Use of the Box-Cox transformation with binary response models , 1982 .

[35]  David R. Cox The analysis of binary data , 1970 .

[36]  F. N. David,et al.  LINEAR STATISTICAL INFERENCE AND ITS APPLICATION , 1967 .

[37]  T. W. Anderson,et al.  An Introduction to Multivariate Statistical Analysis , 1959 .

[38]  D. Hosmer,et al.  Goodness of fit tests for the multiple logistic regression model , 1980 .

[39]  J. A. Anderson,et al.  Quadratic logistic discrimination , 1975 .

[40]  W. J. Dixon,et al.  BMDP-77, Biomedical Computer Programs, P-Series , 1979 .

[41]  Day Ne,et al.  A GENERAL MAXIMUM LIKELIHOOD DISCRIMINANT , 1967 .

[42]  A. Albert,et al.  On the existence of maximum likelihood estimates in logistic regression models , 1984 .

[43]  H. Chernoff On the Distribution of the Likelihood Ratio , 1954 .

[44]  J. Anderson Separate sample logistic discrimination , 1972 .

[45]  J. Neyman,et al.  Research Papers in Statistics. Festschrift for J. Neyman F.N. David editor, assisted by E. Fix. London, New York, Sydney, J. Wiley & Sons, 1966, VIII p. 468 p., 105/–. , 1968, Recherches économiques de Louvain.

[46]  G. Plomteux Multivariate analysis of an enzymic profile for the differential diagnosis of viral hepatitis. , 1980, Clinical chemistry.

[47]  R. Gray,et al.  Calculation of polychotomous logistic regression parameters using individualized regressions , 1984 .

[48]  M. Hills Allocation Rules and Their Error Rates , 1966 .

[49]  T R Holford,et al.  A stepwise variable selection procedure for nonlinear regression models. , 1980, Biometrics.

[50]  P. Lachenbruch An almost unbiased method of obtaining confidence intervals for the probability of misclassification in discriminant analysis. , 1967, Biometrics.

[51]  M. Silvapulle On the Existence of Maximum Likelihood Estimators for the Binomial Response Models , 1981 .

[52]  D. Titterington,et al.  Comparison of Discrimination Techniques Applied to a Complex Data Set of Head Injured Patients , 1981 .

[53]  Trevor Hastie,et al.  Graphical Methods for Assessing Logistic Regression Models: Comment , 1984 .

[54]  J. Chapelle,et al.  Stepwise probit discrimination with specific application to short-term prognosis in acute myocardial infarction. , 1981, Computers and biomedical research, an international journal.

[55]  J. A. Anderson,et al.  Logistic Discrimination and Bias Correction in Maximum Likelihood Estimation , 1979 .

[56]  J. A. Anderson,et al.  LOGISTIC DISCRIMINATION WITH MEDICAL APPLICATIONS , 1973 .