论文信息 - Targeting: Logistic Regression, Special Cases and Extensions

Targeting: Logistic Regression, Special Cases and Extensions

Logistic regression is a classical linear model for logit-transformed conditional probabilities of a binary target variable. It recovers the true conditional probabilities if the joint distribution of predictors and the target is of log-linear form. Weights-of-evidence is an ordinary logistic regression with parameters equal to the differences of the weights of evidence if all predictor variables are discrete and conditionally independent given the target variable. The hypothesis of conditional independence can be tested in terms of log-linear models. If the assumption of conditional independence is violated, the application of weights-of-evidence does not only corrupt the predicted conditional probabilities, but also their rank transform. Logistic regression models, including the interaction terms, can account for the lack of conditional independence, appropriate interaction terms compensate exactly for violations of conditional independence. Multilayer artificial neural nets may be seen as nested regression-like models, with some sigmoidal activation function. Most often, the logistic function is used as the activation function. If the net topology, i.e., its control, is sufficiently versatile to mimic interaction terms, artificial neural nets are able to account for violations of conditional independence and yield very similar results. Weights-of-evidence cannot reasonably include interaction terms; subsequent modifications of the weights, as often suggested, cannot emulate the effect of interaction terms.

Helmut Schaeben | H. Schaeben

[1] Steffen L. Lauritzen,et al. Graphical models in R , 1996 .

[2] A. Journel. Combining Knowledge from Diverse Sources: An Alternative to Traditional Data Independence Hypotheses , 2002 .

[3] Dimitris Kanellopoulos,et al. Handling imbalanced datasets: A review , 2006 .

[4] J. Chilès,et al. Geostatistics: Modeling Spatial Uncertainty , 1999 .

[5] Bernhard Schölkopf,et al. Kernel-based Conditional Independence Test and Application in Causal Discovery , 2011, UAI.

[6] Marzuki Khalid,et al. A TWO-STEP SUPERVISED LEARNING ARTIFICIAL NEURAL NETWORK FOR IMBALANCED DATASET PROBLEMS , 2012 .

[7] Minfeng Deng,et al. A Conditional Dependence Adjusted Weights of Evidence Model , 2009 .

[8] Helmut Schaeben,et al. A Mathematical View of Weights-of-Evidence, Conditional Independence, and Logistic Regression in Terms of Markov Random Fields , 2014, Mathematical Geosciences.

[9] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.

[10] Q. Cheng,et al. Conditional Independence Test for Weights-of-Evidence Modeling , 2002 .

[11] Ben Taskar,et al. An Introduction to Conditional Random Fields for Relational Learning , 2007 .