Neural Network-Based Vector Representation of Documents for Reader-Emotion Categorization

In this paper, we propose a novel approach for reader-emotion categorization using word embedding learned from neural networks and an SVM classifier. The primary objective of such word embedding methods involves learning continuous distributed vector representations of words through neural networks. It can capture semantic context and syntactic cues, and subsequently be used to infer similarity measures among words, sentences, and even documents. Various methods of combining the word embeddings are tested for their performances on reader-emotion categorization of a Chinese news corpus. Results demonstrate that the proposed method, when compared to several other approaches, can achieve comparable or even better performances.

[1]  Hsin-Hsi Chen,et al.  Emotion Classification of Online News Articles from the Reader's Perspective , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[2]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[3]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[4]  Yoshua Bengio,et al.  Hierarchical Probabilistic Neural Network Language Model , 2005, AISTATS.

[5]  Yong Yu,et al.  Learning Word Representation Considering Proximity and Ambiguity , 2014, AAAI.

[6]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[7]  Hsin-Hsi Chen,et al.  Writer Meets Reader: Emotion Analysis of Social Media from Both the Writer's and Reader's Perspectives , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[8]  Hsin-Hsi Chen,et al.  Mining Sentiment Words from Microblogs for Predicting Writer-Reader Emotion Transition , 2012, LREC.

[9]  Yung-Chun Chang,et al.  A semantic frame-based intelligent agent for topic detection , 2017, Soft Comput..

[10]  G. A. Mishne,et al.  Expiriments with mood classification in blog posts , 2005, SIGIR 2005.

[11]  Hsin-Hsi Chen,et al.  What emotions do news articles trigger in their readers? , 2007, SIGIR.

[12]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[13]  Janyce Wiebe,et al.  Articles: Recognizing Contextual Polarity: An Exploration of Features for Phrase-Level Sentiment Analysis , 2009, CL.

[14]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[15]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[16]  Chung-Hsien Wu,et al.  Emotion recognition from text using semantic labels and separable mixture models , 2006, TALIP.

[17]  Chu-Ren Huang,et al.  Emotion Cause Detection with Linguistic Constructions , 2010, COLING.

[18]  Koray Kavukcuoglu,et al.  Learning word embeddings efficiently with noise-contrastive estimation , 2013, NIPS.

[19]  Stuart Adam Battersby,et al.  Experimenting with Distant Supervision for Emotion Classification , 2012, EACL.

[20]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[21]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[22]  Changqin Quan,et al.  Construction of a Blog Emotion Corpus for Chinese Emotional Expression Analysis , 2009, EMNLP.

[23]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[24]  Jonathon Read,et al.  Using Emoticons to Reduce Dependency in Machine Learning Techniques for Sentiment Classification , 2005, ACL.

[25]  Dipankar Das,et al.  Word to Sentence Level Emotion Tagging for Bengali Blogs , 2009, ACL/IJCNLP.

[26]  Hsin-Hsi Chen,et al.  以部落格文本進行情緒分類之研究 (A Study of Emotion Classification Using Blog Articles) [In Chinese] , 2006, ROCLING/IJCLCLP.

[27]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[28]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[29]  Xiaotie Deng,et al.  Automatic construction of Chinese stop word list , 2006 .