Product recommendation with latent review topics

Online customer reviews complement information from product and service providers. While the latter is directly from the source of the product and/or service, the former is generally from users of these products and/or services. Clearly, these two information sets are generated from different perspectives with possibly different sets of intentions. For a prospective customer, both these perspectives together provide a complementary set of information and support their purchase decisions. Given the different perspective and incentive structure, the information from these two source sets tends to be necessarily biased, clearly with the high probability of negative information omission from that provided by the product/service providers. Moreover, customers oftentimes face information overload during their attempts at deciphering existing online customer reviews. We attempt to alleviate this through mining hidden information in online customer reviews. We use a variant of the Latent Dirichlet Allocation (LDA) model and clustering to generate equivalent options that the customer could then use in their purchase decisions. We illustrate this using online hotel review data.

[1]  David P. Baron,et al.  Private Ordering on the Internet: The eBay Community of Traders , 2002, Business and Politics.

[2]  Selwyn Piramuthu,et al.  Input online review data and related bias in recommender systems , 2012, Decis. Support Syst..

[3]  M. A. H. Farquad,et al.  Preprocessing unbalanced data using support vector machine , 2012, Decis. Support Syst..

[4]  B. Sparks,et al.  The impact of online reviews on hotel booking intentions and perception of trust. , 2011 .

[5]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[6]  Sara Dolnicar,et al.  Which Hotel attributes Matter? A review of previous and a framework for future research , 2003 .

[7]  B. Pan,et al.  A retrospective view of electronic word-of-mouth in hospitality and tourism management , 2017 .

[8]  Juheng Zhang,et al.  Voluntary information disclosure on social media , 2015, Decis. Support Syst..

[9]  Xi Chen,et al.  Detecting the migration of mobile service customers using fuzzy clustering , 2015, Inf. Manag..

[10]  Krishna G. Palepu,et al.  Information Asymmetry, Corporate Disclosure and the Capital Markets: A Review of the Empirical Disclosure Literature , 2000 .

[11]  Ling Liu,et al.  Manipulation of online reviews: An analysis of ratings, readability, and sentiments , 2012, Decis. Support Syst..

[12]  Jing Wang,et al.  Customer revisit intention to restaurants: Evidence from online reviews , 2013, Information Systems Frontiers.

[13]  Chrysanthos Dellarocas,et al.  Strategic Manipulation of Internet Opinion Forums: Implications for Consumers and Firms , 2004, Manag. Sci..

[14]  Sulin Ba,et al.  Establishing Online Trust Through a Community Responsibility System , 2001, Decis. Support Syst..

[15]  Gary J. Koehler,et al.  Research Note - Discriminant Analysis with Strategically Manipulated Data , 2014, Inf. Syst. Res..

[16]  R. Law,et al.  Social Media in Tourism and Hospitality: A Literature Review , 2013 .

[17]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[18]  Ronen Feldman,et al.  Book Reviews: The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data by Ronen Feldman and James Sanger , 2008, CL.

[19]  Haldun Aytug,et al.  Comparison of imputation methods for discriminant analysis with strategically hidden data , 2016, Eur. J. Oper. Res..

[20]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[21]  Ling Liu,et al.  Manipulation in digital word-of-mouth: A reality check for book reviews , 2011, Decis. Support Syst..

[22]  A. S. Cantallops,et al.  International Journal of Hospitality Management New Consumer Behavior: a Review of Research on Ewom and Hotels , 2022 .

[23]  George A. Akerlof The Market for “Lemons”: Quality Uncertainty and the Market Mechanism , 1970 .

[24]  Selwyn Piramuthu Feature Selection for Financial Credit-Risk Evaluation Decisions , 1999, INFORMS J. Comput..

[25]  D. Hirshleifer,et al.  Limited Attention, Information Disclosure, and Financial Reporting , 2003 .

[26]  Selwyn Piramuthu Evaluating feature selection methods for learning in data mining applications , 2004, Eur. J. Oper. Res..

[27]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[28]  Bart Baesens,et al.  A dynamic understanding of customer behavior processes based on clustering and sequence mining , 2014, Expert Syst. Appl..

[29]  Rohit Verma,et al.  How Travelers Use Online and Social Media Channels to Make Hotel-choice Decisions , 2010 .

[30]  Stephen Burgess,et al.  Trust perceptions of online travel information by different content creators: Some social and legal implications , 2011, Inf. Syst. Frontiers.