Errors in variables in multinomial choice modeling: A simulation study applied to a multinomial logit model of travel mode choice

Modeling travel demand is a vital part of transportation planning and management. Level of service (LOS) attributes representing the performance of transportation system and characteristics of travelers including their households are major factors determining the travel demand. Information on actual choice and characteristics of travelers is obtained from a travel survey at an individual level. Since accurate measurement of LOS attributes such as travel time and cost components for different travel modes at an individual level is critical, they are normally obtained from network models. The network-based LOS attributes introduce measurement errors to individual trips thereby causing errors in variables problem in a disaggregate model of travel demand. This paper investigates the possible structure and magnitude of biases introduced to the coefficients of a multinomial logit model of travel mode choice due to random measurement errors in two variables, namely, access/egress time for public transport and walking and cycling distance to work. A model was set up that satisfies the standard assumptions of a multinomial logit model. This model was estimated on a data set from a travel survey on the assumption of correctly measured variables. Subsequently random measurement errors were introduced and the mean values of the parameters from 200 estimations were presented and compared with the original estimates. The key finding in this paper is that errors in variables result in biased parameter estimates of a multinomial logit model and consequently leading to poor policy decisions if the models having biased parameters are applied in policy and planning purposes. In addition, the paper discusses some potential remedial measures and identifies research topics that deserve a detailed investigation to overcome the problem. The paper therefore significantly contributes to bridge the gap between theory and practice in transport.

[1]  Michel Bierlaire,et al.  Route choice modeling with network-free data , 2008 .

[2]  D. Hensher Stated preference analysis of travel choices: the state of practice , 1994 .

[3]  B. Casto,et al.  Car sharing demand estimation and urban transport demand modelling using stated preference techniques , 2008 .

[4]  Juan de Dios Ortúzar,et al.  Forecasting and data aggregation: theory and practice , 1990 .

[5]  K. Train,et al.  Forecasting new product penetration with flexible substitution patterns , 1998 .

[6]  Youssef Dehghani,et al.  Models for transportation level of service , 1980 .

[7]  T. Wansbeek Measurement error and latent variables in econometrics , 2000 .

[8]  Antti Talvitie,et al.  COMPARISON OF OBSERVED AND CODED NETWORK TRAVEL TIME AND COST MEASUREMENTS , 1979 .

[9]  Cheng Hsiao,et al.  Robust estimation of generalized linear models with measurement errors , 2004 .

[10]  D. McFadden Econometric analysis of qualitative response models , 1984 .

[11]  John Bound,et al.  Measurement error in survey data , 2001 .

[12]  David Brownstone,et al.  Discrete Choice Modeling for Transportation , 2001 .

[13]  C. Kao,et al.  Errors in variables in the multinomial response model , 1987 .

[14]  Yingyao Hu Bounding parameters in a linear regression model with a mismeasured regressor using additional information , 2006 .

[15]  Zvi Griliches,et al.  Specification Error in Probit Models , 1985 .

[16]  Moshe Ben-Akiva,et al.  Discrete Choice Analysis: Theory and Application to Travel Demand , 1985 .

[17]  Sumeeta Srinivasan,et al.  Travel demand models in the developing world: correcting for measurement errors , 2010 .

[18]  A. Daly,et al.  Flexible substitution patterns in models of mode and time of day choice: new evidence from the UK and the Netherlands , 2007 .

[19]  A. Satorra,et al.  Measurement Error Models , 1988 .

[20]  Ming Zhang,et al.  Metrics of Urban Form and the Modifiable Areal Unit Problem , 2005 .

[21]  Kenneth E. Train,et al.  Discrete Choice Methods with Simulation , 2016 .

[22]  D. Rubin,et al.  Multiple Imputation for Nonresponse in Surveys , 1989 .

[23]  William Alonso,et al.  PREDICTING BEST WITH IMPERFECT DATA , 1968 .

[24]  Kang-Tsung Chang,et al.  Effects of Zoning Structure and Network Detail on Traffic Demand Modeling , 2002 .

[25]  Bharat P. Bhatta,et al.  Are intrazonal trips ignorable , 2011 .

[26]  D. Ruppert,et al.  Measurement Error in Nonlinear Models , 1995 .