Evaluating the performance of sentence level features and domain sensitive features of product reviews on supervised sentiment analysis tasks

With the popularity of e-commerce, posting online product reviews expressing customer’s sentiment or opinion towards products has grown exponentially. Sentiment analysis is a computational method that plays an essential role in automating the extraction of subjective information i.e. customer’s sentiment or opinion from online product reviews. Two approaches commonly used in Sentiment analysis tasks are supervised approaches and lexicon-based approaches. In supervised approaches, Sentiment analysis is seen as a text classification task. The result depends not only on the robustness of the machine learning algorithm but also on the utilized features. Bag-of-word is a common utilized features. As a statistical feature, bag-of-word does not take into account semantic of words. Previous research has indicated the potential of semantic in supervised SA task. To augment the result of sentiment analysis, this paper proposes a method to extract text features named sentence level features (SLF) and domain sensitive features (DSF) which take into account semantic of words in both sentence level and domain level of product reviews. A word sense disambiguation based method was adapted to extract SLF. For every similarity employed in generating SLF, the SentiCircle-based method was enhanced to generate DSF. Results of the experiments indicated that our proposed semantic features i.e. SLF and SLF + DSF favorably increase the performance of supervised sentiment analysis on product reviews.

[1]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[2]  Amit Kumar Mishra,et al.  Various Aspects of Sentiment Analysis: A Review , 2019, SSRN Electronic Journal.

[3]  Jure Leskovec,et al.  Inferring Networks of Substitutable and Complementary Products , 2015, KDD.

[4]  V. Uma,et al.  Approaches to Sentiment Analysis on Product Reviews , 2019, Advances in Business Information Systems and Analytics.

[5]  Walaa Medhat,et al.  Sentiment analysis algorithms and applications: A survey , 2014 .

[6]  Yücel Saygin,et al.  SU-Sentilab : A Classification System for Sentiment Analysis in Twitter , 2013, *SEMEVAL.

[7]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[8]  Riyanarto Sarno,et al.  Automatic ranking system of university based on technology readiness level using LDA-Adaboost.MH , 2018, 2018 International Conference on Information and Communications Technology (ICOIACT).

[9]  Zhong Jin,et al.  Document Sentiment Classification based on the Word Embedding , 2015, ICM 2015.

[10]  Widodo Budiharto,et al.  Prediction and analysis of Indonesia Presidential election from Twitter using sentiment analysis , 2018, Journal of Big Data.

[11]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[12]  Rada Mihalcea,et al.  Unsupervised graph-based word sense disambiguation , 2009 .

[13]  Chastine Fatichah,et al.  Enhancing the performance of sentiment analysis task on product reviews by handling both local and global context , 2020, Int. J. Inf. Decis. Sci..

[14]  Reinald Kim Amplayo,et al.  Incorporating product description to sentiment topic models for improved aspect-based sentiment analysis , 2018, Inf. Sci..

[15]  Hao-Chiang Koong Lin,et al.  The asymmetric effect of review valence on numerical rating , 2019, Online Inf. Rev..

[16]  Amélie Marian,et al.  Beyond the Stars: Improving Rating Predictions using Review Text Content , 2009, WebDB.

[17]  Harith Alani,et al.  Contextual semantics for sentiment analysis of Twitter , 2016, Inf. Process. Manag..

[18]  Saidah Saad,et al.  Sentiment Analysis or Opinion Mining: A Review , 2017 .

[19]  Hongwei Wang,et al.  Sentimental feature selection for sentiment analysis of Chinese online reviews , 2015, International Journal of Machine Learning and Cybernetics.

[20]  Yu-N Cheah,et al.  Exploiting sequential patterns to detect objective aspects from online reviews , 2016, 2016 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA).

[21]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[22]  Jozef Juhár,et al.  Classification of heterogeneous text data for robust domain-specific language modeling , 2014, EURASIP J. Audio Speech Music. Process..

[23]  Erik Cambria,et al.  SenticNet 2: A Semantic and Affective Resource for Opinion Mining and Sentiment Analysis , 2012, FLAIRS.

[24]  Yu-N Cheah,et al.  A two-fold rule-based model for aspect extraction , 2017, Expert Syst. Appl..

[25]  Ted Pedersen,et al.  An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet , 2002, CICLing.

[26]  Kamal Eddine El Kadiri,et al.  Random Forest and Support Vector Machine based Hybrid Approach to Sentiment Analysis , 2018 .

[27]  Wenchao Xu,et al.  Aspect based fine-grained sentiment analysis for online reviews , 2019, Inf. Sci..

[28]  Raji Srinivasan,et al.  Social Influence Effects in Online Product Ratings , 2012 .

[29]  Okran Jeong,et al.  Social media contents based sentiment analysis and prediction system , 2018, Expert Syst. Appl..

[30]  Chun Chen,et al.  Opinion Word Expansion and Target Extraction through Double Propagation , 2011, CL.

[31]  Aitor García Pablos,et al.  W2VLDA: Almost unsupervised system for Aspect Based Sentiment Analysis , 2017, Expert Syst. Appl..

[32]  Riyanarto Sarno,et al.  Supervised Probabilistic Latent Semantic Analysis (sPLSA) for estimating technology readiness level , 2017, 2017 11th International Conference on Information & Communication Technology and System (ICTS).

[33]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[34]  Peter Z. Yeh,et al.  Semantic Interpretation of the Web without the Semantic Web: Toward Business-Aware Web Processors , 2007 .

[35]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[36]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[37]  Wen-Chin Tsao,et al.  eWOM persuasiveness: do eWOM platforms and product type matter? , 2015, Electron. Commer. Res..