Modeling the Under Reporting Bias in Panel Survey Data

Panel survey data have been gaining importance in marketing. However, one challenge of estimating econometric models based on panel survey data is how to account for underreporting; that is, respondents do not report behavioral incidences that actually occur. Underreporting is especially likely to occur in a panel survey because the data-recording mechanism is often tedious, complex, and effortful. The probability of underreporting is likely to vary across respondents and also over the duration of the survey period. In this paper, we propose a model to simultaneously study reported behavioral incidences and partially observed actual behavioral incidences. We propose a Bayesian approach for estimating the proposed model. We treat those unobserved actual behavioral incidences as latent variables, and the Gibbs sampler makes it convenient to impute the nonreported consumption incidences along with making inferences on other model parameters. Our proposed method has two advantages. First, it offers a model-based approach to remove the underreporting bias in panel survey data and therefore allows marketing researchers to make accurate inferences about consumers' actual behavior. Second, the method also offers a natural way to study factors that influence respondents' propensity to underreport. Because we treat those underreported behavioral incidences as nonmissing at random, this underreporting propensity varies across respondents and over time. This understanding can help marketing researchers design the right strategy to intervene and incentivize respondents to authentically report and hence improve the quality of survey data. The proposed model and estimation approach are tested on both synthetic data and actual panel survey data on consumer-reported beverage-drinking behavior. Our analysis suggests that underreporting can significantly mask respondents' true behavior.

[1]  W. Haas,et al.  Bounded Rationality in Pricing under State Dependent Demand : Do Firms Look Ahead ? How Far Ahead ? , 2004 .

[2]  Jean-Paul Fox,et al.  Reducing Social Desirability Bias through Item Randomized Response: An Application to Measure Underreported Desires , 2010 .

[3]  Peter E. Rossi,et al.  Overcoming Scale Usage Heterogeneity , 2001 .

[4]  Peter S. Fader,et al.  A note on modelling underreported Poisson counts , 2000 .

[5]  Del I. Hawkins,et al.  Uninformed Response Error in Survey Research , 1981 .

[6]  Myung-Soo Jo,et al.  A Model for Controlling Social Desirability Bias by Direct and Indirect Questioning , 1997 .

[7]  David C. Schmittlein,et al.  Technical Note---Why Does the NBD Model Work? Robustness in Representing Product Purchases, Brand Purchases and Imperfectly Recorded Purchases , 1985 .

[8]  Kamel Jedidi,et al.  Dynamic Marketing Mix Allocation for Long-Term Profitability , 2008 .

[9]  Kannan Srinivasan,et al.  Modeling Online Browsing and Path Analysis Using Clickstream Data , 2004 .

[10]  Fred Luthans,et al.  Social Desirability Response Effects: Three Alternative Models , 1983 .

[11]  Robert Turner Inter‐Week Variations in Expenditure Recorded During a Two‐Week Survey of Family Expenditure , 1961 .

[12]  R. Fisher Social Desirability Bias and the Validity of Indirect Questioning , 1993 .

[13]  John D. C. Little,et al.  A Logit Model of Brand Choice Calibrated on Scanner Data , 2011, Mark. Sci..

[14]  Ryuichi Kitamura,et al.  Analysis of attrition biases and trip reporting errors for panel data , 1987 .

[15]  Barbara A. Bailar,et al.  The Effects of Rotation Group Bias on Estimates from Panel Surveys , 1975 .

[16]  Eric T. Bradlow,et al.  A Learning-Based Model for Imputing Missing Levels in Partial Conjoint Profiles , 2004 .

[17]  Berend Wierenga,et al.  A Viral Branching Model for Predicting the Spread of Electronic Word of Mouth , 2009, Mark. Sci..

[18]  Sha Yang,et al.  Modeling Simultaneity in Survey Data , 2005 .

[19]  Michael Y. Hu,et al.  Are Consumer Survey Results Distorted? Systematic Impact of Behavioral Frequency and Duration on Survey Response Errors , 2000 .

[20]  Sylvia Fruhwirth-Schnatter,et al.  Unobserved Preference Changes in Conjoint Analysis , 2003 .

[21]  Oded Netzer,et al.  A Hidden Markov Model of Customer Relationship Dynamics , 2008, Mark. Sci..

[22]  Lewis Mandell,et al.  Some Insight into the Underreporting of Financial Data by Sample Survey Respondents , 1978 .

[23]  C. Bollinger,et al.  Modeling Discrete Choice with Response Error: Food Stamp Participation , 1997 .

[24]  Michel Wedel,et al.  International Market Segmentation Based on Consumer–Product Relations , 1999 .

[25]  John Liechty,et al.  Dynamic Models Incorporating Individual Heterogeneity: Utility Evolution in Conjoint Analysis , 2005 .

[26]  John Geweke,et al.  Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments , 1991 .

[27]  J. Q. Smith,et al.  1. Bayesian Statistics 4 , 1993 .

[28]  J. Neter,et al.  A Study of Response Errors in Expenditures Data from Household Interviews , 1964 .

[29]  Rainer Winkelmann,et al.  Markov chain Monte Carlo analysis of underreported count data with an application to worker absenteeism , 1996 .

[30]  M. Keane,et al.  Decision-Making Under Uncertainty: Capturing Dynamic Brand Choice Processes in Turbulent Consumer Goods Markets , 1996 .

[31]  Wayne A. Fuller,et al.  Estimation in the Presence of Measurement Error , 1995 .

[32]  Ricardo Montoya,et al.  Dynamic Allocation of Pharmaceutical Detailing and Sampling for Long-Term Profitability , 2010, Mark. Sci..

[33]  G. Menon,et al.  The Effects of Accessibility of Information in Memory on Judgments of Behavioral Frequencies , 1993 .

[34]  Eric T. Bradlow,et al.  A hierarchical latent variable model for ordinal data from a customer satisfaction survey with no answer responses , 1999 .

[35]  W. Kamakura,et al.  Household Life Cycles and Lifestyles in the United States , 2006 .

[36]  Michael Y. Hu,et al.  Natural mortality and participation fatigue as potential biases in diary panels: Impact of some demographic factors and behavioral characteristics on systematic attrition , 1996 .

[37]  S. Sudman On the Accuracy of Recording of Consumer Panels: I , 1964 .

[38]  G. Menon,et al.  Are the Parts Better than the Whole? The Effects of Decompositional Questions on Judgments of Frequent Behaviors , 1997 .

[39]  P. Seetharaman,et al.  Bounded Rationality in Pricing under State-Dependent Demand: Do Firms Look Ahead, and if So, How Far? , 2007 .

[40]  Hans Baumgartner,et al.  Response Styles in Marketing Research: A Cross-National Investigation , 2001 .

[41]  Pradeep K. Chintagunta,et al.  Investigating Household State Dependence Effects across Categories , 1999 .

[42]  Jean-Paul Fox,et al.  Using Item Response Theory to Measure Extreme Response Style in Marketing Research: A Global Investigation , 2008 .