Automatic Approach of Sentiment Lexicon Generation for Mobile Shopping Reviews

The dramatic increase in the use of smartphones has allowed people to comment on various products at any time. The analysis of the sentiment of users’ product reviews largely depends on the quality of sentiment lexicons. Thus, the generation of high-quality sentiment lexicons is a critical topic. In this paper, we propose an automatic approach for constructing a domain-specific sentiment lexicon by considering the relationship between sentiment words and product features in mobile shopping reviews. The approach first selects sentiment words and product features from original reviews and mines the relationship between them using an improved pointwise mutual information algorithm. Second, sentiment words that are related to mobile shopping are clustered into categories to form sentiment dimensions. At each sentiment dimension, each sentiment word can take the value of 0 or 1, where 1 indicates that the word belongs to a particular category whereas 0 indicates that it does not belong to that category. The generated lexicon is evaluated by constructing a sentiment classification task using several product reviews written in both Chinese and English. Two popular non-domain-specific sentiment lexicons as well as state-of-the-art machine-learning and deep-learning models are chosen as benchmarks, and the experimental results show that our sentiment lexicons outperform the benchmarks with statistically significant differences, thus proving the effectiveness of the proposed approach.

[1]  W. Marsden I and J , 2012 .

[2]  Michael S. Bernstein,et al.  Empath: Understanding Topic Signals in Large-Scale Text , 2016, CHI.

[3]  Diana Maynard,et al.  Automatic Detection of Political Opinions in Tweets , 2011, #MSM.

[4]  M. de Rijke,et al.  UvA-DARE ( Digital Academic Repository ) Using WordNet to measure semantic orientations of adjectives , 2004 .

[5]  Rosa M. Carro,et al.  Sentiment analysis in Facebook and its application to e-learning , 2014, Comput. Hum. Behav..

[6]  Patricio Martínez-Barco,et al.  Subjectivity and sentiment analysis: An overview of the current state of the area and envisaged developments , 2012, Decis. Support Syst..

[7]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[8]  Zili Zhang,et al.  Sentiment classification of Internet restaurant reviews written in Cantonese , 2011, Expert Syst. Appl..

[9]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[10]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[11]  P. Ekman An argument for basic emotions , 1992 .

[12]  Vikas Sindhwani,et al.  Document-Word Co-regularization for Semi-supervised Sentiment Analysis , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[13]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[14]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[15]  Mike Thelwall,et al.  Sentiment Analysis Is a Big Suitcase , 2017, IEEE Intelligent Systems.

[16]  Hazem M. Hajj,et al.  A Light Lexicon-based Mobile Application for Sentiment Mining of Arabic Tweets , 2015, ANLP@ACL.

[17]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[18]  Diego Reforgiato Recupero,et al.  Sentiment Analysis: Adjectives and Adverbs are Better than Adjectives Alone , 2007, ICWSM.

[19]  Hua Xu,et al.  Chinese comments sentiment classification based on word2vec and SVMperf , 2015, Expert Syst. Appl..

[20]  Li Chen,et al.  News impact on stock price return via sentiment analysis , 2014, Knowl. Based Syst..

[21]  Hung-Yu Kao,et al.  Automatic Domain-Specific Sentiment Lexicon Generation with Label Propagation , 2013, IIWAS '13.

[22]  Mariano Sigman,et al.  Comparative study of LSA vs Word2vec embeddings in small corpora: a case study in dreams database , 2016, ArXiv.

[23]  Jin Chen,et al.  Research on Building a Chinese Sentiment Lexicon Based on SO-PMI , 2012 .

[24]  Walaa Medhat,et al.  Sentiment analysis algorithms and applications: A survey , 2014 .

[25]  Jeonghee Yi,et al.  Sentiment analysis: capturing favorability using natural language processing , 2003, K-CAP '03.

[26]  Bo Jiang,et al.  Machine Learning and Lexicon Based Methods for Sentiment Classification: A Survey , 2014, 2014 11th Web Information System and Application Conference.

[27]  G. A. Mishne,et al.  Expiriments with mood classification in blog posts , 2005, SIGIR 2005.

[28]  Yao Lu,et al.  Exploring the Sentiment Strength of User Reviews , 2010, WAIM.

[29]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[30]  Pedro M. Domingos,et al.  Beyond Independence: Conditions for the Optimality of the Simple Bayesian Classifier , 1996, ICML.

[31]  Peter D. Turney Thumbs Up, Thumbs Down , 2013, Journal of Cell Science.

[32]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[33]  Fernando Enríquez,et al.  An approach to the use of word embeddings in an opinion classification task , 2016, Expert Syst. Appl..

[34]  Chun Chen,et al.  DASA: Dissatisfaction-oriented Advertising based on Sentiment Analysis , 2010, Expert Syst. Appl..

[35]  Samir Kumar Bandyopadhyay,et al.  Sentiment Analysis - A Review , 2015 .

[36]  Feiping Nie,et al.  Discriminative Embedded Clustering: A Framework for Grouping High-Dimensional Data , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[37]  Elizabeth D. Liddy,et al.  Discerning Emotions in Texts , 2004, AAAI 2004.

[38]  D. Watson,et al.  On the Dimensional and Hierarchical Structure of Affect , 1999 .

[39]  Ming Xu,et al.  Feature-level sentiment analysis for Chinese product reviews , 2011, 2011 3rd International Conference on Computer Research and Development.

[40]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[41]  Xiaodong Li,et al.  Market impact analysis via deep learned architectures , 2019, Neural Computing and Applications.

[42]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[43]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.