Sentence-Based Text Analysis for Customer Reviews

Firms collect an increasing amount of consumer feedback in the form of unstructured consumer reviews. These reviews contain text about consumer experiences with products and services that are different from surveys that query consumers for specific information. A challenge in analyzing unstructured consumer reviews is in making sense of the topics that are expressed in the words used to describe these experiences. We propose a new model for text analysis that makes use of the sentence structure contained in the reviews and show that it leads to improved inference and prediction of consumer ratings relative to existing models using data from www.expedia.com and www.we8there.com. Sentence-based topics are found to be more distinguished and coherent than those identified from a word-based analysis. Data, as supplemental material, are available at https://doi.org/10.1287/mksc.2016.0993.

[1]  Greg M. Allenby,et al.  The Dimensionality of Customer Satisfaction Survey Responses and Implications for Driver Analysis , 2013, Mark. Sci..

[2]  G. Tellis,et al.  Mining Marketing Meaning from Online Chatter: Strategic Brand Analysis of Big Data Using Latent Dirichlet Allocation , 2014 .

[3]  Bruce G. S. Hardie,et al.  A Dynamic Changepoint Model for New Product Sales Forecasting , 2004 .

[4]  Sha Yang,et al.  A Model for Observation, Structural, and Household Heterogeneity in Panel Data , 2000 .

[5]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[6]  Yi Zhao,et al.  Modeling Consumer Learning from Online Product Reviews , 2012, Mark. Sci..

[7]  Ivan Titov,et al.  A Joint Model of Text and Aspect Ratings for Sentiment Summarization , 2008, ACL.

[8]  David Godes,et al.  Using Online Conversations to Study Word-of-Mouth Communication , 2004 .

[9]  Xiaojin Zhu,et al.  Incorporating domain knowledge into topic modeling via Dirichlet Forest priors , 2009, ICML '09.

[10]  Peter E. Rossi,et al.  Bayesian Statistics and Marketing , 2005 .

[11]  Thomas L. Griffiths,et al.  Learning author-topic models from text corpora , 2010, TOIS.

[12]  Sha Yang,et al.  Modeling Variation in Brand Preference: The Roles of Objective Environment and Motivating Conditions , 2002 .

[13]  Rajdeep Grewal,et al.  Incentive-Aligned Conjoint Analysis , 2005 .

[14]  W. Kamakura,et al.  Modeling Preference and Structural Heterogeneity in Consumer Choice , 1996 .

[15]  Martin Wetzels,et al.  More than Words: The Influence of Affective Content and Linguistic Style Matches in Online Reviews on Conversion Rates , 2013 .

[16]  Peter E. Rossi,et al.  Bayesian Statistics and Marketing: Rossi/Bayesian Statistics and Marketing , 2006 .

[17]  Donald R. Lehmann,et al.  State-Dependence Effects in Surveys , 2012, Mark. Sci..

[18]  Jacob Goldenberg,et al.  Mine Your Own Business: Market-Structure Surveillance Through Text Mining , 2012, Mark. Sci..

[19]  Chrysanthos Dellarocas,et al.  Exploring the value of online product reviews in forecasting sales: The case of motion pictures , 2007 .

[20]  Beibei Li,et al.  Designing Ranking Systems for Hotels on Travel Search Engines by Mining User-Generated and Crowd-Sourced Content , 2011, Mark. Sci..

[21]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[22]  Michel Wedel,et al.  Leveraging Missing Ratings to Improve Online Recommendation Systems , 2006 .

[23]  Ricardo Montoya,et al.  Dynamic Allocation of Pharmaceutical Detailing and Sampling for Long-Term Profitability , 2010, Mark. Sci..

[24]  Jonah Berger,et al.  Positive Effects of Negative Publicity: When Negative Reviews Increase Sales , 2009, Mark. Sci..

[25]  Eric T. Bradlow,et al.  Automated Marketing Research Using Online Customer Reviews , 2011 .

[26]  Wayne S. DeSarbo,et al.  Modeling Dynamic Effects in Repeated-Measures Experiments Involving Preference/Choice: An Illustration Involving Stated Preference Analysis , 2004 .

[27]  Oded Netzer,et al.  A Hidden Markov Model of Customer Relationship Dynamics , 2008, Mark. Sci..

[28]  Peter E. Rossi,et al.  Overcoming Scale Usage Heterogeneity , 2001 .

[29]  David Gal,et al.  Answering the Unasked Question: Response Substitution in Consumer Surveys , 2011 .

[30]  Hanna M. Wallach,et al.  Topic modeling: beyond bag-of-words , 2006, ICML.

[31]  Susan T. Dumais,et al.  Characterizing Microblogs with Topic Models , 2010, ICWSM.

[32]  Michal Rosen-Zvi,et al.  Hidden Topic Markov Models , 2007, AISTATS.