Grasp the implicit features: Hierarchical emotion classification based on topic model and SVM

Microblog post has been a hot research source for emotion classification in recent years. However, due to bloggers' free narrative style and topics' timeliness, the data from microblog post is usually implicit and imbalanced. In this paper, the problems of emotion classification in Chinese microblog posts are solved in a hierarchical way using a knowledge-based topic model and Support Vector Machine(SVM). Based on topic model, an implicit feature detection algorithm is proposed to identify the latent emotions underlying the microblog posts. Meanwhile, a hierarchical emotion structure is employed to classify emotions into 19 classes of four levels by SVM. This structure can meet different research requirements at three granularities. The experiment results validate that our model can achieve better performance in terms of precision, recall and F-scores.

[1]  Xinying Xu,et al.  Hidden sentiment association in chinese web opinion mining , 2008, WWW.

[2]  Xiaolong Wang,et al.  On-line Hot Topic Recommendation Using Tolerance Rough Set Based Topic Clustering , 2010, J. Comput..

[3]  Christopher C. Yang,et al.  An Association-Based Unified Framework for Mining Features and Opinion Words , 2015, ACM Trans. Intell. Syst. Technol..

[4]  Zhen Hai,et al.  Implicit Feature Identification via Co-occurrence Association Rule Mining , 2011, CICLing.

[5]  Marie-Francine Moens,et al.  Automatic Sentiment Analysis in On-line Text , 2007, ELPUB.

[6]  Peng Wang,et al.  A robust framework for short text categorization based on topic model and integrated classifier , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[7]  Diana Inkpen,et al.  Prior and contextual emotion of words in sentential context , 2014, Comput. Speech Lang..

[8]  Carlo Strapparava,et al.  Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text , 2010 .

[9]  Ting Liu,et al.  Creating a Fine-Grained Corpus for Chinese Sentiment Analysis , 2015, IEEE Intelligent Systems.

[10]  Yejin Choi,et al.  Connotation Lexicon: A Dash of Sentiment Beneath the Surface Meaning , 2013, ACL.

[11]  Hua Xu,et al.  A rule-based approach to emotion cause detection for Chinese micro-blogs , 2015, Expert Syst. Appl..

[12]  Hua Xu,et al.  Implicit Feature Detection via a Constrained Topic Model and SVM , 2013, EMNLP.

[13]  Luc Van Gool,et al.  Modeling scenes with local descriptors and latent aspects , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[14]  Shuzhi Sam Ge,et al.  Emotion analysis of children's stories with context information , 2014, Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific.

[15]  Francis R. Bach,et al.  Rethinking LDA: Moment Matching for Discrete ICA , 2015, NIPS.

[16]  1993 International Joint Conference on AI , 1994, IEEE Expert.

[17]  Xiaojin Zhu,et al.  Latent Dirichlet Allocation with Topic-in-Set Knowledge , 2009, HLT-NAACL 2009.

[18]  Kim Schouten,et al.  Finding Implicit Features in Consumer Reviews for Sentiment Analysis , 2014, ICWE.

[19]  Ye Wei,et al.  A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs , 2015 .

[20]  Susumu Horiguchi,et al.  Learning to classify short and sparse text & web with hidden topics from large-scale data collections , 2008, WWW.

[21]  Matthew Purver,et al.  Predicting Emotion Labels for Chinese Microblog Texts , 2012, SDAD@ECML/PKDD.

[22]  Jingping Bi,et al.  A word distributed representation based framework for large-scale short text classification , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[23]  Qiaozhu Mei,et al.  Understanding the Limiting Factors of Topic Modeling via Posterior Contraction Analysis , 2014, ICML.

[24]  Stan Szpakowicz,et al.  Hierarchical versus Flat Classification of Emotions in Text , 2010, HLT-NAACL 2010.

[25]  Carlo Strapparava,et al.  Learning to identify emotions in text , 2008, SAC '08.

[26]  Desheng Dash Wu,et al.  A Decision Support Approach for Online Stock Forum Sentiment Analysis , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[27]  Diana Inkpen,et al.  Detecting Emotion Stimuli in Emotion-Bearing Sentences , 2015, CICLing.

[28]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[29]  Hsin-Hsi Chen,et al.  Mining opinions from the Web: Beyond relevance retrieval , 2007 .

[30]  Thomas L. Griffiths,et al.  Hierarchical Topic Models and the Nested Chinese Restaurant Process , 2003, NIPS.

[31]  Janyce Wiebe,et al.  An Investigation for Implicatures in Chinese : Implicatures in Chinese and in English are similar ! , 2014, WASSA@ACL.

[32]  Brian D. Davison,et al.  Empirical study of topic modeling in Twitter , 2010, SOMA '10.

[33]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[34]  Mengen Chen,et al.  Short Text Classification Improved by Learning Multi-Granularity Topics , 2011, IJCAI.

[35]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[36]  Andrew McCallum,et al.  Rethinking LDA: Why Priors Matter , 2009, NIPS.

[37]  Jiun-Hung Chen,et al.  A multi-label classification based approach for sentiment classification , 2015, Expert Syst. Appl..

[38]  P. Ekman,et al.  DIFFERENCES Universals and Cultural Differences in the Judgments of Facial Expressions of Emotion , 2004 .

[39]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[40]  Hua Xu,et al.  Hierarchical emotion classification and emotion component analysis on chinese micro-blog posts , 2015, Expert Syst. Appl..

[41]  Jon Atle Gulla,et al.  Sentiment analysis in a hybrid hierarchical classification process , 2012, Seventh International Conference on Digital Information Management (ICDIM 2012).

[42]  Bin Gu,et al.  Incremental learning for ν-Support Vector Regression , 2015, Neural Networks.