Recursive Deep Learning for Sentiment Analysis over Social Data

Sentiment analysis has now become a popular research problem to tackle in NLP field. However, there are very few researches conducted on sentiment analysis for Chinese. Progress is held back due to lack of large and labelled corpus and powerful models. To remedy this deficiency, we build a Chinese Sentiment Treebank over social data. It concludes 13550 labeled sentences which are from movie reviews. Furthermore, we introduce a novel Recursive Neural Deep Model (RNDM) to predict sentiment label based on recursive deep learning. We consider the problem of classifying one sentence by overall sentiment, determining a review is positive or negative. On predicting sentiment label at sentence level, our model outperforms other commonly used baselines, such as Naïve Bayes, Maximum Entropy and SVM, by a large margin.

[1]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[2]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[3]  Andrew Y. Ng,et al.  Semantic Compositionality through Recursive Matrix-Vector Spaces , 2012, EMNLP.

[4]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[5]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[6]  Claire Cardie,et al.  Learning with Compositional Semantics as Structural Inference for Subsentential Sentiment Analysis , 2008, EMNLP.

[7]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[8]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[9]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[10]  Vibhu O. Mittal,et al.  Comparative Experiments on Sentiment Classification for Online Product Reviews , 2006, AAAI.

[11]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[12]  Yulan He,et al.  Joint sentiment/topic model for sentiment analysis , 2009, CIKM.

[13]  Ronan Collobert,et al.  Deep Learning for Efficient Discriminative Parsing , 2011, AISTATS.

[14]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[15]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[16]  Andrew Y. Ng,et al.  Improving Word Representations via Global Context and Multiple Word Prototypes , 2012, ACL.

[17]  Nizar Y. Habash,et al.  Handbook of Natural Language Processing, Second Edition , 2010 .

[18]  TanSongbo,et al.  An empirical study of sentiment analysis for chinese documents , 2008 .

[19]  Yoshua Bengio,et al.  Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.

[20]  Geoffrey E. Hinton,et al.  A Scalable Hierarchical Distributed Language Model , 2008, NIPS.

[21]  Jin Zhang,et al.  An empirical study of sentiment analysis for chinese documents , 2008, Expert Syst. Appl..

[22]  Philipp Koehn,et al.  Synthesis Lectures on Human Language Technologies , 2016 .

[23]  Kentaro Inui,et al.  Dependency Tree-based Sentiment Classification using CRFs with Hidden Variables , 2010, NAACL.

[24]  Mirella Lapata,et al.  Composition in Distributional Models of Semantics , 2010, Cogn. Sci..

[25]  Christopher D. Manning,et al.  Learning Continuous Phrase Representations and Syntactic Parsing with Recursive Neural Networks , 2010 .