Improving The SFA Algorithm by Employing Multi-Source Data

In recent years, discriminative learning methods have widely been used in various areas of Natural Language Processing (NLP). These methods achieve the best performance, when the set of training and testing samples have the same distribution. However, in many applications of NLP, the lack of labeled datasets for some domains is a serious challenge. In such conditions, we need to develop a model based on domains with rich labeled instances and apply it to the domain with no labeled instances. In this research, a method for sentiment classification of opinions into positive and negative groups, which represent the users' feelings, is offered based on multi-source transfer learning. The proposed method here employs Spectral Feature Alignment algorithm to adapt different domains. Furthermore, according to the Majority Voting, accuracy is assigned to classifications trained on different domains based on the Majority Voting Error. Ultimately, decisions are made for each classification based on the calculated error. The Amazon datasets for four different categories, each of which contains 1000 positive and 1000 negative samples, are exploited to train the proposed model. Meanwhile, each category includes unlabeled samples that are used to select pivot features. The accuracy values of 85.5%, 86.4%, 83.5% and 90.1% obtained for Electronics, DVD, Books and Kitchen domains respectively, show the effectiveness of the proposed method compared with similar methods.

[1]  Harith Alani,et al.  Automatically Extracting Polarity-Bearing Topics for Cross-Domain Sentiment Classification , 2011, ACL.

[2]  Paolo Rosso,et al.  Cross-domain polarity classification using a knowledge-enhanced meta-classifier , 2015, Knowl. Based Syst..

[3]  Xiaoli Li,et al.  Cooperative Hybrid Semi-Supervised Learning for Text Sentiment Classification , 2019, Symmetry.

[4]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[5]  John Blitzer,et al.  Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.

[6]  Jian Hu,et al.  Using Wikipedia for Co-clustering Based Cross-Domain Text Classification , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[7]  Guodong Zhou,et al.  Semi-supervised Sentiment Classification Based on Auxiliary Task Learning , 2018, NLPCC.

[8]  Deipali. V. Gore,et al.  A Survey on Sentiment Analysis on Twitter Data Using Different Techniques , 2015 .

[9]  Xiaoming Liu,et al.  Transfer learning with one-class data , 2014, Pattern Recognit. Lett..

[10]  Fabrício Benevenuto,et al.  On the combination of "off-the-shelf" sentiment analysis methods , 2016, SAC.

[11]  Benno Stein,et al.  Cross-Lingual Adaptation Using Structural Correspondence Learning , 2010, TIST.

[12]  Qiang Yang,et al.  Cross-domain sentiment classification via spectral feature alignment , 2010, WWW '10.

[13]  Danushka Bollegala,et al.  Using Multiple Sources to Construct a Sentiment Sensitive Thesaurus for Cross-Domain Sentiment Classification , 2011, ACL.

[14]  Bogdan Gabrys,et al.  Classifier selection for majority voting , 2005, Inf. Fusion.

[15]  Rada Mihalcea,et al.  Multilingual Subjectivity Analysis Using Machine Translation , 2008, EMNLP.

[16]  Taghi M. Khoshgoftaar,et al.  A survey of transfer learning , 2016, Journal of Big Data.

[17]  Anindya Datta,et al.  Domain Adaptation for Sentiment Classification in Light of Multiple Sources , 2014, INFORMS J. Comput..

[18]  Sayyam Malik,et al.  Sentiment Analysis Approaches and Applications: A Survey , 2018, International Journal of Computer Applications.

[19]  Chris H. Q. Ding,et al.  K-means clustering via principal component analysis , 2004, ICML.

[20]  Kerstin Denecke,et al.  Are SentiWordNet scores suited for multi-domain sentiment classification? , 2009, 2009 Fourth International Conference on Digital Information Management.

[21]  Ling Liu,et al.  Do online reviews affect product sales? The role of reviewer characteristics and temporal effects , 2008, Inf. Technol. Manag..

[22]  Bir Bhanu,et al.  Words alignment based on association rules for cross-domain sentiment classification , 2018, Frontiers of Information Technology & Electronic Engineering.

[23]  Cheng Wang,et al.  Co-training for Semi-supervised Sentiment Classification Based on Dual-view Bags-of-words Representation , 2015, ACL.

[24]  S. Bharathi,et al.  Sentiment Analysis for Effective Stock Market Prediction , 2017 .

[25]  Xuanjing Huang,et al.  Cross-Domain Sentiment Classification with Target Domain Specific Information , 2018, ACL.

[26]  Ingoo Han,et al.  The Effect of On-Line Consumer Reviews on Consumer Purchasing Intention: The Moderating Role of Involvement , 2007, Int. J. Electron. Commer..

[27]  Qiang Yang,et al.  Bridging Domains Using World Wide Knowledge for Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[28]  Tony R. Martinez,et al.  The robustness of majority voting compared to filtering misclassified instances in supervised classification tasks , 2016, Artificial Intelligence Review.