The Effect of Errors in Diagnosis and Measurement on the Estimation of the Probability of an Event

Abstract This article investigates the effect of misclassification and measurement error in the basic data on the asymptotic bias and efficiency of the logistic regression (LR) and normal discrimination (ND) classification procedures. The effect of misclassification in a single binary independent variable on the bias and efficiency of both procedures is also presented. Typically, asymptotic bias increases and efficiency decreases as misclassification and measurement error increase. The performance of LR relative to ND is shown to be better in the presence of error than without error.

[1]  I. Bross Misclassification in 2 X 2 Tables , 1954 .

[2]  T. W. Anderson,et al.  An Introduction to Multivariate Statistical Analysis , 1959 .

[3]  T. W. Anderson An Introduction to Multivariate Statistical Analysis , 1959 .

[4]  Calyampudi Radhakrishna Rao,et al.  Linear Statistical Inference and its Applications , 1967 .

[5]  P. Lachenbruch Discriminant Analysis When the Initial Samples Are Misclassified , 1966 .

[6]  J. Cornfield,et al.  A multivariate analysis of the risk of coronary heart disease in Framingham. , 1967, Journal of chronic diseases.

[7]  S. James Press,et al.  Univariate and Multivariate Log-Linear and Logistic Models , 1973 .

[8]  Abdelmonem A. Afifi,et al.  Classification Based on Dichotomous and Continuous Variables , 1974 .

[9]  Calyampudi R. Rao,et al.  Linear Statistical Inference and Its Applications. , 1975 .

[10]  B. Efron The Efficiency of Logistic Regression Compared to Normal Discriminant Analysis , 1975 .

[11]  R. Brand,et al.  Multivariate Prediction of Coronary Heart Disease in the Western Collaborative Group Study Compared to the Findings of the Framingham Study , 1976, Circulation.

[12]  R. Brand,et al.  Multivariate prediction of coronary heart disease during 8.5 year follow-up in the Western Collaborative Group Study. , 1976, The American journal of cardiology.

[13]  John Aitchison,et al.  Statistical diagnosis when basic cases are not classified with certainty , 1976 .

[14]  A statistical classification of breast cancer patients by degree of nodal metastases , 1977 .

[15]  A. Keys,et al.  Identifying subsets of major risk factors in multivariate estimation of coronary risk. , 1977, Journal of chronic diseases.

[16]  S. J. Press,et al.  Choosing between Logistic Regression and Discriminant Analysis , 1978 .