Designing Ranking Systems for Hotels on Travel Search Engines by Mining User-Generated and Crowd-Sourced Content

User-generated content on social media platforms and product search engines is changing the way consumers shop for goods online. However, current product search engines fail to effectively leverage information created across diverse social media platforms. Moreover, current ranking algorithms in these product search engines tend to induce consumers to focus on one single product characteristic dimension (e.g., price, star rating). This approach largely ignores consumers' multidimensional preferences for products. In this paper, we propose to generate a ranking system that recommends products that provide, on average, the best value for the consumer's money. The key idea is that products that provide a higher surplus should be ranked higher on the screen in response to consumer queries. We use a unique data set of U.S. hotel reservations made over a three-month period through Travelocity, which we supplement with data from various social media sources using techniques from text mining, image classification, social geotagging, human annotations, and geomapping. We propose a random coefficient hybrid structural model, taking into consideration the two sources of consumer heterogeneity the different travel occasions and different hotel characteristics introduce. Based on the estimates from the model, we infer the economic impact of various location and service characteristics of hotels. We then propose a new hotel ranking system based on the average utility gain a consumer receives from staying in a particular hotel. By doing so, we can provide customers with the “best-value” hotels early on. Our user studies, using ranking comparisons from several thousand users, validate the superiority of our ranking system relative to existing systems on several travel search engines. On a broader note, this paper illustrates how social media can be mined and incorporated into a demand estimation model in order to generate a new ranking system in product search engines. We thus highlight the tight linkages between user behavior on social media and search engines. Our interdisciplinary approach provides several insights for using machine learning techniques in economics and marketing research.

[1]  Gregory K. Leonard,et al.  Competitive Analysis with Differentiated Products , 1994 .

[2]  Yong Liu Word-of-Mouth for Movies: Its Dynamics and Impact on Box Office Revenue , 2006 .

[3]  Steven T. Berry Estimating Discrete-Choice Models of Product Differentiation , 1994 .

[4]  Magnus Niemann,et al.  A Personalized Hotel Selection Engine , 2006 .

[5]  Christopher R. Knittel,et al.  Estimation of Random Coefficient Demand Models: Challenges, Difficulties and Warnings , 2008 .

[6]  Lorin M. Hitt,et al.  Self Selection and Information Role of Online Product Reviews , 2007, Inf. Syst. Res..

[7]  J. M. Villas-Boas,et al.  Endogeneity in Brand Choice Models , 1999 .

[8]  Richard Schmalensee,et al.  MEASURING MARKET POWER IN THE READY-TO-EAT CEREAL INDUSTRY BY AVIV NEVO , 2001 .

[9]  Minjae Song A Hybrid Discrete Choice Model of Differentiated Product Demand with an Application to Personal Computers , 2010 .

[10]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[11]  H. Mills Marketing as a Science , 1961 .

[12]  Jeremy T. Fox,et al.  Improving the Numerical Performance of BLP Static and Dynamic Discrete Choice Random Coefficients Demand Estimation , 2009 .

[13]  Jerry A. Hausman,et al.  Valuation of New Goods Under Perfect and Imperfect Competition , 1994 .

[14]  Teppei Yamamoto,et al.  A Multinomial Response Model for Varying Choice Sets, with Application to Partially Contested Multiparty Elections , 2010 .

[15]  Eric T. Bradlow,et al.  Automatic Construction of Conjoint Attributes and Levels from Online Customer Reviews , 2007 .

[16]  Panagiotis G. Ipeirotis,et al.  The Dimensions of Reputation in Electronic Markets , 2009 .

[17]  Anindya Ghose,et al.  Internet Exchanges for Used Goods: An Empirical Analysis of Trade Patterns and Adverse Selection , 2008, MIS Q..

[18]  Peter S. Fader,et al.  Modeling Consumer Choice among SKUs , 1996 .

[19]  S. Rosen Hedonic Prices and Implicit Markets: Product Differentiation in Pure Competition , 1974, Journal of Political Economy.

[20]  Tuck Siong Chung,et al.  Marketing Models of Service and Relationships , 2006 .

[21]  Steven Berry,et al.  Limit Theorems for Estimating the Parameters of Differentiated Product Demand Systems , 2000 .

[22]  Anand V. Bodapati Recommendation Systems with Purchase Data , 2008 .

[23]  Aviv Nevo Measuring Market Power in the Ready-to-Eat Cereal Industry , 1998 .

[24]  Aviv Nevo,et al.  Identification With Imperfect Instruments , 2008, Review of Economics and Statistics.

[25]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[26]  Minjae Song,et al.  A Hybrid Discrete Choice Model of Differentiated Product Demand with an Application to Personal Computers , 2010 .

[27]  Minjae Song,et al.  Measuring consumerwelfareinthe CPU market: anapplication of the pure‐characteristics demand model , 2007 .

[28]  Sridhar Narayanan,et al.  Identifying Causal Marketing Mix Effects Using a Regression Discontinuity Design , 2011, Mark. Sci..

[29]  Steven T. Berry,et al.  Automobile Prices in Market Equilibrium , 1995 .

[30]  Vasant Dhar,et al.  Research Commentary - Sponsored Search and Market Efficiency , 2010, Inf. Syst. Res..

[31]  Justin McCrary,et al.  Manipulation of the Running Variable in the Regression Discontinuity Design: A Density Test , 2007 .

[32]  C. Kelley Solving Nonlinear Equations with Newton's Method , 1987 .

[33]  Benjamin C. Ngwudike THE 2003 NATIONAL ASSESSMENT OF ADULT LITERACY (NAAL): PERFORMANCE OF AFRICAN AMERICANS IN A NATIONAL CONTEXT , 2008 .

[34]  K. Lancaster,et al.  Consumer Demand. A New Approach. , 1972 .

[35]  R. Kohli,et al.  Internet Recommendation Systems , 2000 .

[36]  Susan T. Dumais,et al.  Learning user interaction models for predicting web search result preferences , 2006, SIGIR.

[37]  Joel H. Steckel,et al.  Cross-Validating Regression Models in Marketing Research , 1993 .

[38]  Eric Chang,et al.  Red Opal: product-feature scoring from reviews , 2007, EC '07.

[39]  Eric T. Bradlow,et al.  Automated Marketing Research Using Online Customer Reviews , 2011 .

[40]  Anindya Ghose,et al.  Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets , 2008, Inf. Syst. Res..

[41]  Bin Gu,et al.  Do online reviews matter? - An empirical investigation of panel data , 2008, Decis. Support Syst..

[42]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[43]  Pradeep K. Chintagunta,et al.  Endogeneity and Heterogeneity in a Probit Demand Model: Estimation Using Aggregate Data , 2001 .

[44]  Carl F. Mela,et al.  E-Customization , 2003 .

[45]  Pradeep Chintagunta,et al.  Beyond the Endogeneity Bias: The Effect of Unmeasured Brand Characteristics on Household-Level Brand Choice Models , 2005, Manag. Sci..

[46]  V. Dhar,et al.  Does Chatter Matter? The Impact of User-Generated Content on Music Sales , 2007 .

[47]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[48]  Paul A. Pavlou,et al.  Can online reviews reveal a product's true quality?: empirical findings and analytical modeling of Online word-of-mouth communication , 2006, EC '06.

[49]  Oz Shy,et al.  Industrial Organization: Theory and Applications , 1995 .

[50]  Panagiotis G. Ipeirotis,et al.  Show me the money!: deriving the pricing power of product features by mining consumer reviews , 2007, KDD '07.

[51]  Z. John Zhang,et al.  From Story Line to Box Office: A New Approach for Green-Lighting Movie Scripts , 2007, Manag. Sci..

[52]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[53]  Michel Wedel,et al.  Leveraging Missing Ratings to Improve Online Recommendation Systems , 2006 .

[54]  Mike Y. Chen,et al.  Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web , 2001 .

[55]  Ernst R. Berndt,et al.  The practice of econometrics : classic and contemporary , 1993 .

[56]  Aviv Nevo A Practitioner's Guide to Estimation of Random‐Coefficients Logit Models of Demand , 2000 .

[57]  David Godes,et al.  Using Online Conversations to Study Word-of-Mouth Communication , 2004 .

[58]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[59]  Harikesh S. Nair,et al.  Accounting for Primary and Secondary Demand Effects with Aggregate Data , 2004 .

[60]  Hernán A. Bruno,et al.  Research Note - Structural Demand Estimation with Varying Product Availability , 2008, Mark. Sci..

[61]  M. Trusov,et al.  Estimating Aggregate Consumer Preferences from Online Product Reviews , 2010 .

[62]  Susan Athey,et al.  Discrete Choice Models with Multiple Unobserved Choice Characteristics , 2007 .

[63]  Michael A. Wulder,et al.  Remote sensing methods in medium spatial resolution satellite data land cover classification of large areas , 2002 .

[64]  Jacob Goldenberg,et al.  Mine Your Own Business: Market-Structure Surveillance Through Text Mining , 2012, Mark. Sci..

[65]  S. Fukuda,et al.  Support vector machine classification of land cover: application to polarimetric SAR data , 2001, IGARSS 2001. Scanning the Present and Resolving the Future. Proceedings. IEEE 2001 International Geoscience and Remote Sensing Symposium (Cat. No.01CH37217).

[66]  Chrysanthos Dellarocas,et al.  Exploring the value of online product reviews in forecasting sales: The case of motion pictures , 2007 .

[67]  Panagiotis G. Ipeirotis,et al.  Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics , 2010, IEEE Transactions on Knowledge and Data Engineering.

[68]  Vasant Dhar,et al.  Sponsored Search and Market Efficiency , 2010 .

[69]  Roland T. Rust,et al.  My Mobile Music: An Adaptive Personalization System For Digital Audio Players , 2007 .

[70]  K. Lancaster A New Approach to Consumer Theory , 1966, Journal of Political Economy.

[71]  Arnaud De Bruyn,et al.  Offering Online Recommendations with Minimum Customer Input Through Conjoint-Based Decision Aids , 2008, Mark. Sci..

[72]  Steven Berry,et al.  The Pure Characteristics Demand Model , 2007 .