Aggregating Customer Review Attributes for Online Reputation Generation

In this paper, we face the problem of generating reputation for movies, products, hotels, restaurants and services by mining customer reviews expressed in natural language. To the best of our knowledge, previous studies on reputation generation for online entities have primarily examined semantic and sentiment orientation of customer reviews, disregarding other useful information that could be extracted from reviews, such as review helpfulness and review time. Therefore, we propose a new approach that combines review helpfulness, review time, review attached rating and review sentiment orientation for the purpose of generating a single reputation value toward various entities. The contribution of the paper is threefold. First, we design two equations to compute review helpfulness and review time scores, and we fine-tune Bidirectional Encoder Representations from Transformers (BERT) model to predict the review sentiment orientation probability. Second, we design a formula to assign a numerical score to each review. Then, we propose a new formula to compute reputation value toward the target entity (movie, product, hotel, restaurant, service, etc). Finally, we propose a new form to visualize reputation that depicts numerical reputation value, opinion categories, top positive review and top negative review. Experimental results coming from several real-world data sets of miscellaneous domains collected from IMDb, TripAdvisor and Amazon websites show the effectiveness of the proposed method in generating and visualizing reputation compared to three state-of-the-art reputation systems.

[1]  Hsinchun Chen,et al.  Selecting Attributes for Sentiment Classification Using Feature Relation Networks , 2011, IEEE Transactions on Knowledge and Data Engineering.

[2]  João Francisco Valiati,et al.  Document-level sentiment classification: An empirical comparison between SVM and ANN , 2013, Expert Syst. Appl..

[3]  Shiliang Sun,et al.  A review of natural language processing techniques for opinion mining systems , 2017, Inf. Fusion.

[4]  Tong Zhang,et al.  Effective Use of Word Order for Text Categorization with Convolutional Neural Networks , 2014, NAACL.

[5]  El Habib Nfaoui,et al.  An Unsupervised Approach for Reputation Generation , 2019, Procedia Computer Science.

[6]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[7]  Bernard Yannou,et al.  Mining customer product reviews for product development: A summarization process , 2019, Expert Syst. Appl..

[8]  Witold Pedrycz,et al.  Fusing and mining opinions for reputation generation , 2017, Inf. Fusion.

[9]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[10]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[11]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[12]  George A. F. Seber,et al.  Linear regression analysis , 1977 .

[13]  Seba Susan,et al.  Fuzzy rule based unsupervised sentiment analysis from social media posts , 2019, Expert Syst. Appl..

[14]  Francisco Javier González-Castaño,et al.  Unsupervised method for sentiment analysis in online texts , 2016, Expert Syst. Appl..

[15]  Quoc V. Le,et al.  Unsupervised Data Augmentation for Consistency Training , 2019, NeurIPS.

[16]  Alaa M. El-Halees,et al.  Feature-Based Opinion Summarization for Arabic Reviews , 2018, 2018 International Arab Conference on Information Technology (ACIT).

[17]  Henry Anaya-Sánchez,et al.  Retrieving Product Features and Opinions from Customer Reviews , 2013, IEEE Intelligent Systems.

[18]  El Habib Nfaoui,et al.  A hybrid approach for generating reputation based on opinions fusion and sentiment analysis , 2019, J. Organ. Comput. Electron. Commer..

[19]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[20]  Yang Yu,et al.  How Service-Related Factors Affect the Survival of B2T Providers: A Sentiment Analysis Approach , 2015, J. Organ. Comput. Electron. Commer..

[21]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[22]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[23]  Marc'Aurelio Ranzato,et al.  Ensemble of Generative and Discriminative Techniques for Sentiment Analysis of Movie Reviews , 2014, ICLR.

[24]  Alistair Kennedy,et al.  SENTIMENT CLASSIFICATION of MOVIE REVIEWS USING CONTEXTUAL VALENCE SHIFTERS , 2006, Comput. Intell..

[25]  Moshe Koppel,et al.  THE IMPORTANCE OF NEUTRAL EXAMPLES FOR LEARNING SENTIMENT , 2006, Comput. Intell..

[26]  Oskar Ahlgren Research on Sentiment Analysis: The First Decade , 2016, 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW).

[27]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[28]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[29]  Erik Cambria,et al.  Adaptive two-stage feature selection for sentiment classification , 2017, 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[30]  Colin Raffel,et al.  Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[31]  Kevin Gimpel,et al.  ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.

[32]  M. E. Maron,et al.  Automatic Indexing: An Experimental Inquiry , 1961, JACM.

[33]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[34]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[35]  D. Altman,et al.  Statistics notes: Measurement error , 1996 .

[36]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[37]  Yiming Yang,et al.  XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.

[38]  Yen-Liang Chen,et al.  Opinion mining from online hotel reviews - A text summarization approach , 2017, Inf. Process. Manag..

[39]  Samuel Pecar,et al.  Towards Opinion Summarization of Customer Reviews , 2018, ACL.

[40]  Sasha Blair-Goldensohn,et al.  Building a Sentiment Summarizer for Local Service Reviews , 2008 .

[41]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[42]  Shenghuo Zhu,et al.  SumView: A Web-based engine for summarizing product reviews and customer opinions , 2013, Expert Syst. Appl..

[43]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[44]  Manoj Kumar Tiwari,et al.  Mining consumer reviews to generate ratings of different product attributes while producing feature-based review-summary , 2016, Int. J. Syst. Sci..

[45]  Quoc V. Le,et al.  ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators , 2020, ICLR.

[46]  B. K. Tripathy,et al.  Investigation of recurrent neural networks in the field of sentiment analysis , 2017, 2017 International Conference on Communication and Signal Processing (ICCSP).

[47]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[48]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[49]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[50]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[51]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[52]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[53]  Christopher D. Manning,et al.  Baselines and Bigrams: Simple, Good Sentiment and Topic Classification , 2012, ACL.

[54]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[55]  Durga Toshniwal,et al.  Feature based Summarization of Customers' Reviews of Online Products , 2013, KES.

[56]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[57]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[58]  Piotr Szymanski,et al.  Comprehensive Study on Lexicon-based Ensemble Classification Sentiment Analysis , 2015, Entropy.

[59]  Eibe Frank,et al.  A Simple Approach to Ordinal Classification , 2001, ECML.

[60]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[61]  Alec Radford,et al.  Improving Language Understanding by Generative Pre-Training , 2018 .

[62]  El Habib Nfaoui,et al.  A Logistic Regression Approach for Generating Movies Reputation Based on Mining User Reviews , 2019, 2019 International Conference on Intelligent Systems and Advanced Computing Sciences (ISACS).

[63]  Xiaoyan Zhu,et al.  Movie review mining and summarization , 2006, CIKM '06.

[64]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[65]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[66]  Michael B. Miller Linear Regression Analysis , 2013 .

[67]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.