JEAM: A Novel Model for Cross-Domain Sentiment Classification Based on Emotion Analysis

Cross-domain sentiment classification (CSC) aims at learning a sentiment classifier for unlabeled data in the target domain based on the labeled data from a different source domain. Due to the differences of data distribution of two domains in terms of the raw features, the CSC problem is difficult and challenging. Previous researches mainly focused on concepts mining by clustering words across data domains, which ignored the importance of authors’ emotion contained in data, or the different representations of the emotion between domains. In this paper, we propose a novel framework to solve the CSC problem, by modelling the emotion across domains. We first develop a probabilistic model named JEAM to model author’s emotion state when writing. Then, an EM algorithm is introduced to solve the likelihood maximum problem and to obtain the latent emotion distribution of the author. Finally, a supervised learning method is utilized to assign the sentiment polarity to a given online review. Experiments show that our approach is effective and outperforms state-of-the-art approaches.

[1]  R. Plutchik Emotion, a psychoevolutionary synthesis , 1980 .

[2]  Kristen Grauman,et al.  Connecting the Dots with Landmarks: Discriminatively Learning Domain-Invariant Features for Unsupervised Domain Adaptation , 2013, ICML.

[3]  D. Rubin,et al.  A comparison of dimensional models of emotion: Evidence from emotions, prototypical events, autobiographical memories, and words , 2009, Memory.

[4]  Yulan He,et al.  Joint sentiment/topic model for sentiment analysis , 2009, CIKM.

[5]  Fuzhen Zhuang,et al.  Concept Learning for Cross-Domain Text Classification: A General Probabilistic Framework , 2013, IJCAI.

[6]  Chu-Ren Huang,et al.  Employing Personal/Impersonal Views in Supervised and Semi-Supervised Sentiment Classification , 2010, ACL.

[7]  Feiping Nie,et al.  Cross-language web page classification via dual knowledge transfer using nonnegative matrix tri-factorization , 2011, SIGIR.

[8]  H. Schlosberg Three dimensions of emotion. , 1954, Psychological review.

[9]  Haizhou Li,et al.  A cross-domain adaptation method for sentiment classification using probabilistic latent analysis , 2011, CIKM '11.

[10]  Hessel Haagsma,et al.  The 54th Annual Meeting of the Association for Computational Linguistics , 2016, ACL 2016.

[11]  Jianmin Wang,et al.  Dual Transfer Learning , 2012, SDM.

[12]  Qiang Yang,et al.  Cross-Domain Co-Extraction of Sentiment and Topic Lexicons , 2012, ACL.

[13]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[14]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Indexing , 1999, SIGIR Forum.

[15]  Hongliang Yu,et al.  Identifying Sentiment Words Using an Optimization-based Model without Seed Words , 2013, ACL.

[16]  Qiang Yang,et al.  Co-clustering based classification for out-of-domain documents , 2007, KDD '07.

[17]  Fuzhen Zhuang,et al.  Triplex Transfer Learning: Exploiting Both Shared and Distinct Concepts for Text Classification , 2013, IEEE Transactions on Cybernetics.

[18]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[19]  Harith Alani,et al.  Automatically Extracting Polarity-Bearing Topics for Cross-Domain Sentiment Classification , 2011, ACL.

[20]  Chew Lim Tan,et al.  From Semantic to Emotional Space in Probabilistic Sense Sentiment Analysis , 2013, AAAI.

[21]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[22]  Qiang Yang,et al.  Cross-domain sentiment classification via spectral feature alignment , 2010, WWW '10.

[23]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.