LogitBoost with errors-in-variables

The logistic regression model is a popular tool for relating a binary outcome to a set of covariates. In many applications, the covariates of this model are measured with error. An approach to nonparametric logistic regression with covariate measurement error is presented. The estimate of the log-odds is formed using boosted regression trees. The algorithm uses gradient boosting to fit the trees, and their coefficients are determined using an estimating equation closely related to the likelihood score function. The method is examined using simulations.

[1]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[2]  David M. Rocke,et al.  A Model for Measurement Error for Gene Expression Arrays , 2001, J. Comput. Biol..

[3]  H. Robbins A Stochastic Approximation Method , 1951 .

[4]  Jianqing Fan,et al.  Nonparametric regression with errors in variables , 1993 .

[5]  P. Laake,et al.  Boosted Regression Trees with Errors in Variables , 2007, Biometrics.

[6]  D. Ruppert,et al.  Nonparametric regression in the presence of measurement error , 1999 .

[7]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[8]  A. K. Md. Ehsanes Saleh,et al.  Rank tests and regression rank score tests in measurement error models , 2010, Comput. Stat. Data Anal..

[9]  Greg Ridgeway,et al.  Generalized Boosted Models: A guide to the gbm package , 2006 .

[10]  Finbarr O'Sullivan,et al.  An Analysis of the Role of Positivity and Mixture Model Constraints in Poisson Deconvolution Problems , 2001 .

[11]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[12]  Scott M. Berry,et al.  Bayesian Smoothing and Regression Splines for Measurement Error Problems , 2002 .

[13]  F. Kong,et al.  A stochastic approximation algorithm with Markov chain Monte-carlo method for incomplete data estimation problems. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[15]  Lior Rokach,et al.  Taxonomy for characterizing ensemble methods in classification tasks: A review and annotated bibliography , 2009, Comput. Stat. Data Anal..

[16]  M. Wand,et al.  ADDITIVE MODELS WITH PREDICTORS SUBJECT TO MEASUREMENT ERROR , 2005 .

[17]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[18]  D. Ruppert,et al.  Measurement Error in Nonlinear Models , 1995 .