A classification of user-generated content into consumer decision journey stages

In the last decades, the availability of digital user-generated documents from social media has dramatically increased. This massive growth of user-generated content has also affected traditional shopping behaviour. Customers have embraced new communication channels such as microblogs and social networks that enable them not only just to talk with friends and acquaintances about their shopping experience, but also to search for opinions expressed by complete strangers as part of their decision making processes. Uncovering how customers feel about specific products or brands and detecting purchase habits and preferences has traditionally been a costly and highly time-consuming task which involved the use of methods such as focus groups and surveys. However, the new scenario calls for a deep assessment of current market research techniques in order to better interpret and profit from this ever-growing stream of attitudinal data. With this purpose, we present a novel analysis and classification of user-generated content in terms of it belonging to one of the four stages of the Consumer Decision Journey Court et al. (2009) (i.e. the purchase process from the moment when a customer is aware of the existence of the product to the moment when he or she buys, experiences and talks about it). Using a corpus of short texts written in English and Spanish and extracted from different social media, we identify a set of linguistic patterns for each purchase stage that will be then used in a rule-based classifier. Additionally, we use machine learning algorithms to automatically identify business indicators such as the Marketing Mix elements McCarthy and Brogowicz (1981). The classification of the purchase stages achieves an average precision of 74%. The proposed classification of texts depending on the Marketing Mix elements expressed achieved an average precision of 75% for all the elements analysed.

[1]  Chunyan Miao,et al.  Context-Aware Personal Information Retrieval From Multiple Social Networks , 2014, IEEE Computational Intelligence Magazine.

[2]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[3]  Kuansan Wang,et al.  Web scale NLP: a case study on url word breaking , 2011, WWW.

[4]  Óscar Muñoz-García,et al.  Exploiting Web-based Collective Knowledge for Micropost Normalisation , 2013, Tweet-Norm@SEPLN.

[5]  S. Cessie,et al.  Ridge Estimators in Logistic Regression , 1992 .

[6]  Björn W. Schuller,et al.  New Avenues in Opinion Mining and Sentiment Analysis , 2013, IEEE Intelligent Systems.

[7]  Sanjukta A. Pookulangara,et al.  Cultural influence on consumers' usage of social networks and its' impact on online purchase intentions , 2011 .

[8]  Niranjan Pedanekar,et al.  Wishful Thinking - Finding suggestions and ’buy’ wishes from product reviews , 2010, HLT-NAACL 2010.

[9]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, Web Intelligence.

[10]  A. Rangaswamy,et al.  The Impact of New Media on Customer Relationships , 2010 .

[11]  Sarah J. S. Wilner,et al.  Networked Narratives: Understanding Word-of-Mouth Marketing in Online Communities , 2009 .

[12]  Erik Cambria,et al.  Jumping NLP Curves: A Review of Natural Language Processing Research [Review Article] , 2014, IEEE Computational Intelligence Magazine.

[13]  Diego Reforgiato Recupero,et al.  Frame-Based Detection of Opinion Holders and Topics: A Model and a Tool , 2014, IEEE Computational Intelligence Magazine.

[14]  Noah A. Smith,et al.  Movie Reviews and Revenues: An Experiment in Text Regression , 2010, NAACL.

[15]  S. Ng,et al.  The impact of negative word-of-mouth in Web 2.0 on brand equity , 2009 .

[16]  G. Lilien,et al.  A multi-stage model of word-of-mouth influence through viral marketing , 2008 .

[17]  Aditya G. Parameswaran,et al.  Blogs as Predictors of Movie Success , 2009, ICWSM.

[18]  Chrysanthos Dellarocas,et al.  The Digitization of Word-of-Mouth: Promise and Challenges of Online Feedback Mechanisms , 2003, Manag. Sci..

[19]  Neil H. Borden,et al.  The Concept of the Marketing Mix , 1964 .

[20]  Xiaojin Zhu,et al.  May All Your Wishes Come True: A Study of Wishes and How to Recognize Them , 2009, NAACL.

[21]  F. Muñoz,et al.  Demystifying Social Media , 2010 .

[22]  Chunling Yu,et al.  Social Media Peer Communication and Impacts on Purchase Intentions: A Consumer Socialization Framework , 2012 .

[23]  José Ramom Pichel Campos,et al.  A Method to Lexical Normalisation of Tweets , 2013, Tweet-Norm@SEPLN.

[24]  Pranjal Gupta,et al.  How e-WOM recommendations influence product consideration and quality of choice: A motivation to process information perspective , 2010 .

[25]  Daniel E. O'Leary,et al.  Artificial Intelligence and Big Data , 2013, IEEE Intelligent Systems.

[26]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[27]  G. Franzen,et al.  Brands & advertising : how advertising effectiveness influences brand equity , 1999 .

[28]  Lluís Padró,et al.  FreeLing 3.0: Towards Wider Multilinguality , 2012, LREC.

[29]  David C. Edelman Branding in The Digital Age You're Spending Your Money In All the Wrong Places , 2010 .