Incorporating prior information to overcome complete separation problems in discrete choice model estimation

We describe a modified maximum likelihood method that overcomes the problem of complete or quasicomplete separation in multinomial logistic regression for discrete choice models with finite samples. The modification consists of augmenting the observed data set with artificial observations that reflect a prior distribution centered on equal choice probabilities. We demonstrate through Monte Carlo simulations of data sets as small the minimum degrees of freedom case that the modified maximum likelihood approach leads to superior parameter recovery and out of sample prediction compared to conventional maximum likelihood. We explore the role the prior weight plays on parameter recovery and out of sample prediction by varying the weight given to the prior versus the weight given to the data in the likelihood function. We demonstrate a numerical procedure to search for an appropriate weight for the prior. One application for the proposed approach is to estimate discrete choice models for single individuals using data from discrete choice experiments. We illustrate this approach with Monte Carlo simulations as well as four data sets collected using online discrete choice experiments.

[1]  M. Schemper,et al.  A solution to the problem of separation in logistic regression , 2002, Statistics in medicine.

[2]  Donald B. Rubin,et al.  Multiple Imputation of Industry and Occupation Codes in Census Public-use Samples Using Bayesian Logistic Regression , 1991 .

[3]  Nitin R. Patel,et al.  Exact logistic regression: theory and examples. , 1995, Statistics in medicine.

[4]  Rick L. Andrews,et al.  An Empirical Comparison of Logit Choice Models with Discrete versus Continuous Representations of Heterogeneity , 2002 .

[5]  Thomas J. Santner,et al.  A note on A. Albert and J. A. Anderson's conditions for the existence of maximum likelihood estimates in logistic regression models , 1986 .

[6]  Peter Goos,et al.  A Comparison of Criteria to Design Efficient Choice Experiments , 2006 .

[7]  K. Train,et al.  On the Similarity of Classical and Bayesian Estimates of Individual Mean Partworths , 2000 .

[8]  A. Raftery,et al.  Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .

[9]  Power of Tests in Binary Response Models , 1996 .

[10]  B. Haldane THE ESTIMATION AND SIGNIFICANCE OF THE LOGARITHM OF A RATIO OF FREQUENCIES , 1956, Annals of human genetics.

[11]  David Firth,et al.  Multinomial logit bias reduction via the Poisson log-linear model , 2011 .

[12]  Warren F. Kuhfeld,et al.  Large Factorial Designs for Product Engineering and Marketing Research Applications , 2005, Technometrics.

[13]  T. P. Ryan,et al.  A Preliminary Investigation of Maximum Likelihood Logistic Regression versus Exact Logistic Regression , 2002 .

[14]  A. Albert,et al.  On the existence of maximum likelihood estimates in logistic regression models , 1984 .

[15]  Emmanuel Lesaffre,et al.  Partial Separation in Logistic Discrimination , 1989 .

[16]  J. Geweke,et al.  Contemporary Bayesian Econometrics and Statistics , 2005 .

[17]  Bradley P. Carlin,et al.  BAYES AND EMPIRICAL BAYES METHODS FOR DATA ANALYSIS , 1996, Stat. Comput..

[18]  D. Firth Bias reduction of maximum likelihood estimates , 1993 .

[19]  Georg Heinze,et al.  A comparative investigation of methods for logistic regression with separated or nearly separated data , 2006, Statistics in medicine.

[20]  C. Stein,et al.  Estimation with Quadratic Loss , 1992 .

[21]  Celia M. T. Greenwood,et al.  A modified score function estimator for multinomial logistic regression in small samples , 2002 .

[22]  Randall G. Chapman An Approach to Estimating Logit Models of a Single Decision Maker's Choice Behavior , 1984 .

[23]  Jerry A. Hausman,et al.  Assessing the potential demand for electric cars , 1981 .