Prediction and cross-validation of neural networks versus logistic regression: using hepatic disorders as an example.

The authors developed and cross-validated prediction models for newly diagnosed cases of liver disorders by using logistic regression and neural networks. Computerized files of health care encounters from the Fallon Community Health Plan were used to identify 1,674 subjects who had had liver-related health services between July 1, 1992, and June 30, 1993. A total of 219 subjects were confirmed by review of medical records as incident cases. The 1,674 subjects were randomly and evenly divided into training and test sets. The training set was used to derive prediction algorithms based solely on the automated data; the test set was used for cross-validation. The area under the Receiver Operating Characteristic curve for a neural network model was significantly larger than that for logistic regression in the training set (p = 0.04). However, the performance was statistically equivalent in the test set (p = 0.45). Despite its superior performance in the training set, the generalizability of the neural network model is limited. Logistic regression may therefore be preferred over neural network on the basis of its established advantages. More generalizable modeling techniques for neural networks may be necessary before they are practical for medical research.

[1]  A R Henderson,et al.  Assessing Test Accuracy and its Clinical Consequences: A Primer for Receiver Operating Characteristic Curve Analysis , 1993, Annals of clinical biochemistry.

[2]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[3]  Geoffrey E. Hinton,et al.  How neural networks learn from experience. , 1992, Scientific American.

[4]  Henderson Ar,et al.  Assessing test accuracy and its clinical consequences: a primer for receiver operating characteristic curve analysis. , 1993 .

[5]  R M Centor,et al.  A Visicalc Program for Estimating the Area Under a Receiver Operating Characteristic (ROC) Curve , 1985, Medical decision making : an international journal of the Society for Medical Decision Making.

[6]  G A Diamond,et al.  Future imperfect: the limitations of clinical prediction models and the limits of clinical prediction. , 1989, Journal of the American College of Cardiology.

[7]  F. Harrell,et al.  Regression modelling strategies for improved prognostic prediction. , 1984, Statistics in medicine.

[8]  D. Mackay,et al.  Bayesian methods for adaptive models , 1992 .

[9]  Richard Lippmann,et al.  Neural Network Classifiers Estimate Bayesian a posteriori Probabilities , 1991, Neural Computation.

[10]  Nathaniel C. Smith Replication studies: A neglected aspect of psychological research. , 1970 .

[11]  Léopold Simar,et al.  Computer Intensive Methods in Statistics , 1994 .

[12]  R. Conant,et al.  Reproducibility of Predictor Variables from a Validated Clinical Rule , 1992, Medical decision making : an international journal of the Society for Medical Decision Making.

[13]  Jacek M. Zurada,et al.  Introduction to artificial neural systems , 1992 .

[14]  J. Swets ROC analysis applied to the evaluation of medical imaging techniques. , 1979, Investigative radiology.