Know your customer: Detection of Customer Experience (CX) in Social Platforms using Text Categorization

Customers nowadays are one online post away from their stores, specially when it comes to post-shopping experiences. This translates to large amounts of text messages to evaluate and process for big brands that aim to maintain a good quality of service as well as a digital channel of communication for their customers. Automating the understanding of this text data poses questions such as how large the corpus should be and which are the best algorithms to discriminate whether a social media post is related or not to customer experience (CX). In order to help answering these questions, first, we get hold of posts from three different platforms: Foursquare (77K) , Twitter (153K) and Facebook (2.2M). Such posts are directed to brands ranked in the ForeSee CX Index and the Forrester CX Index rankings. Second, we build a binary classifier using different algorithms to identify customer experience posts on a social platform. The accuracy of the best performing setting is 86.4% for Facebook and 91.2% for Twitter. Third, we explore the effect of increasing the number of training samples, and how a plateau is reached after 5K posts. Finally, we conduct experiments using different combinations of n-grams as features for the text mining process. As a result we observe that uni-grams and bi-grams are the best combination when we need to choose features for a classifier discriminating customer experience social media posts on Twitter and a combination of up to four-grams on Facebook.

[1]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[2]  Ramakrishnan Ramanathan,et al.  Measuring Service Quality using SERVQUAL and AHP: an application to a Chinese IT company and comparison , 2011 .

[3]  R. Ordelman,et al.  Improved cyberbullying detection using gender information , 2012 .

[4]  Songbo Tan,et al.  Neighbor-weighted K-nearest neighbor for unbalanced text corpus , 2005, Expert Syst. Appl..

[5]  Katherine N. Lemon,et al.  Understanding Customer Experience Throughout the Customer Journey , 2016 .

[6]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[7]  Xun Xu,et al.  The antecedents of customer satisfaction and dissatisfaction toward various types of hotels: A text mining approach , 2016 .

[8]  C. Baird,et al.  From social media to social customer relationship management , 2013, IEEE Engineering Management Review.

[9]  Rodrigo Castro,et al.  Back to #6D: Predicting Venezuelan states political election results through Twitter , 2017, 2017 Fourth International Conference on eDemocracy & eGovernment (ICEDEG).

[10]  Weiguo Fan,et al.  The power of social media analytics , 2014, CACM.

[11]  P Visalakshi Latent Semantic Indexing Based SVM Model for Email Spam Classification , 2014 .

[12]  Hinrich Schütze,et al.  Introduction to Information Retrieval: Scoring, term weighting, and the vector space model , 2008 .

[13]  N. Subramanian,et al.  Role of social media in retail network operations and marketing to enhance customer satisfaction , 2017 .

[14]  Yen S. Low,et al.  Text Mining for Adverse Drug Events: the Promise, Challenges, and State of the Art , 2014, Drug Safety.

[15]  Fabrizio Sebastiani Text Categorization , 2005, Encyclopedia of Database Technologies and Applications.

[16]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[17]  Yiming Yang,et al.  A re-examination of text categorization methods , 1999, SIGIR '99.

[18]  Malik Yousef,et al.  One-Class SVMs for Document Classification , 2002, J. Mach. Learn. Res..

[19]  M. E. Maron,et al.  Automatic Indexing: An Experimental Inquiry , 1961, JACM.

[20]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[21]  D. Stewart,et al.  Customer Experience Management in Retailing: Understanding the Buying Process , 2009 .

[22]  Björn W. Schuller,et al.  New Avenues in Opinion Mining and Sentiment Analysis , 2013, IEEE Intelligent Systems.

[23]  David Madigan,et al.  Large-Scale Bayesian Logistic Regression for Text Categorization , 2007, Technometrics.

[24]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[25]  April Kontostathis,et al.  Text Mining and Cybercrime , 2010 .

[26]  Babis Theodoulidis,et al.  Analyzing Customer Experience Feedback Using Text Mining , 2014 .

[27]  Dennis McLeod,et al.  A Comparative Study for Email Classification , 2007 .

[28]  Bernardete Ribeiro,et al.  The importance of stop word removal on recall values in text categorization , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[29]  Yunming Ye,et al.  An Improved Random Forest Classifier for Text Categorization , 2012, J. Comput..

[30]  Amir Khanlari Strategic Customer Relationship Management in the Age of Social Media , 2015 .

[31]  Hakan Ferhatosmanoglu,et al.  Short text classification in twitter to improve information filtering , 2010, SIGIR.