论文信息 - ONE MORE TIME ABOUT R 2 MEASURES OF FIT IN LOGISTIC REGRESSION

ONE MORE TIME ABOUT R 2 MEASURES OF FIT IN LOGISTIC REGRESSION

In logistic regression, the demand for pseudo R measures of fit is undeniable. There are at least a half dozen such measures, with little consensus on which is preferable. Two of them, both based on the maximum likelihood, are used in almost all statistical software systems. The first, R1, has been implemented in SAS and SPSS. The second, R2, (also known as McFadden’s R, RMF , the deviance RDEV and the entropy RE) is implemented in STATA and SUDAAN as well as SPSS. Until recently these two measures have been considered independent. We will show in our presentation, which is a sequel to our SUGI 25 paper, that there exists a one-to-one correspondence between R1 and R2. If we know one of them, we know the other. The relationship between these measures of fit is required to understand which of them is preferred on a theoretical basis. To make this choice we consider our ability to interpret the measure in a reasonable way, the measure’s dependence on the base rate as well as its degree of susceptibility to overdispersion. We conclude that R2 should be regarded as the standard R measure.

[1] J. Kent. Information gain and a general measure of correlation , 1983 .

[2] M Schemper,et al. Explained variation for logistic regression. , 1996, Statistics in medicine.

[3] N. Nagelkerke,et al. A note on a general definition of the coefficient of determination , 1991 .

[4] Hans C. Jessen,et al. Applied Logistic Regression Analysis , 1996 .

[5] Sara Moore,et al. WHY WE NEED AN R 2 MEASURE OF FIT (AND NOT ONLY ONE) IN PROC LOGISTIC AND PROC GENMOD , 1998 .

[6] S. Menard. Coefficients of Determination for Multiple Logistic Regression Analysis , 2000 .

[7] G. S. Maddala,et al. Limited-dependent and qualitative variables in econometrics: Two-stage estimation methods , 1983 .

[8] M Schemper,et al. Computing measures of explained variation for logistic regression models. , 1999, Computer methods and programs in biomedicine.

[9] Stanley Lemeshow,et al. Applied Logistic Regression, Second Edition , 1989 .

[10] Alan Agresti,et al. Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[11] Trevor Hastie,et al. A Closer Look at the Deviance , 1987 .