论文信息 - Document Modeling with Gated Recurrent Neural Network for Sentiment Classification

Document Modeling with Gated Recurrent Neural Network for Sentiment Classification

Document level sentiment classification remains a challenge: encoding the intrinsic relations between sentences in the semantic meaning of a document. To address this, we introduce a neural network model to learn vector-based document representation in a unified, bottom-up fashion. The model first learns sentence representation with convolutional neural network or long short-term memory. Afterwards, semantics of sentences and their relations are adaptively encoded in document representation with gated recurrent neural network. We conduct document level sentiment classification on four large-scale review datasets from IMDB and Yelp Dataset Challenge. Experimental results show that: (1) our neural model shows superior performances over several state-of-the-art algorithms; (2) gated recurrent neural network dramatically outperforms standard recurrent neural network in document modeling for sentiment classification. 1

[1] M. Bunge. Sense and reference , 1974 .

[2] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[3] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[4] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[5] Hinrich Schütze,et al. Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[6] Bo Pang,et al. Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[7] Peter D. Turney. Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[8] Bo Pang,et al. Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[9] Xiaojin Zhu,et al. Seeing stars when there aren’t many stars: Graph-based semi-supervised learning for sentiment categorization , 2006 .

[10] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[11] Amélie Marian,et al. Beyond the Stars: Improving Rating Predictions using Review Text Content , 2009, WebDB.

[12] Gerhard Weikum,et al. The Bag-of-Opinions Method for Review Rating Prediction from Sparse Text Patterns , 2010, COLING.

[13] Mike Thelwall,et al. A Study of Information Retrieval Weighting Schemes for Sentiment Analysis , 2010, ACL.

[14] Mirella Lapata,et al. Composition in Distributional Models of Semantics , 2010, Cogn. Sci..

[15] Rui Xia,et al. Exploring the Use of Word Relation Features for Sentiment Classification , 2010, COLING.

[16] Claire Cardie,et al. Compositional Matrix-Space Models for Sentiment Analysis , 2011, EMNLP.

[17] Christopher Potts,et al. Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[18] Yoshua Bengio,et al. Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach , 2011, ICML.

[19] Wei Gao,et al. Unsupervised Discovery of Discourse Relations for Eliminating Intra-sentence Polarity Ambiguities , 2011, EMNLP.

[20] Yoshua Bengio,et al. Joint Training of Deep Boltzmann Machines , 2012, ArXiv.

[21] Christopher D. Manning,et al. Baselines and Bigrams: Simple, Good Sentiment and Topic Classification , 2012, ACL.

[22] Navdeep Jaitly,et al. Hybrid speech recognition with Deep Bidirectional LSTM , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[23] Christopher Potts,et al. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[24] Phil Blunsom,et al. The Role of Syntax in Vector Space Models of Compositional Semantics , 2013, ACL.

[25] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[26] Andrew Y. Ng,et al. Parsing with Compositional Vector Grammars , 2013, ACL.

[27] Hod Lipson,et al. Re-embedding words , 2013, ACL.

[28] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[29] Wojciech Zaremba,et al. Learning to Execute , 2014, ArXiv.

[30] Jiwei Li,et al. Feature Weight Tuning for Recursive Neural Networks , 2014, ArXiv.

[31] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[32] Phil Blunsom,et al. A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[33] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[34] Yiqun Liu,et al. Do users rate or review?: boost phrase-level sentiment labeling with review-level sentiment classification , 2014, SIGIR.

[35] Georgiana Dinu,et al. Don’t count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors , 2014, ACL.

[36] Ming Zhou,et al. Adaptive Multi-Compositionality for Recursive Neural Models with Applications to Sentiment Analysis , 2014, AAAI.

[37] Jun Zhao,et al. Joint Opinion Relation Detection Using One-Class Deep Neural Network , 2014, COLING.

[38] Claire Cardie,et al. Deep Recursive Neural Networks for Compositionality in Language , 2014, NIPS.

[39] Alexander J. Smola,et al. Jointly modeling aspects, ratings and sentiments for movie recommendation (JMARS) , 2014, KDD.

[40] Christopher D. Manning,et al. Global Belief Recursive Neural Networks , 2014, NIPS.

[41] Ming Zhou,et al. Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification , 2014, ACL.

[42] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[43] Saif Mohammad,et al. Sentiment Analysis of Short Informal Texts , 2014, J. Artif. Intell. Res..

[44] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[45] Quoc V. Le,et al. Distributed Representations of Sentences and Documents , 2014, ICML.

[46] Misha Denil,et al. Modelling, Visualising and Summarising Documents with a Single Convolutional Neural Network , 2014, ArXiv.

[47] Christopher D. Manning,et al. Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.

[48] Hongyu Guo,et al. Long Short-Term Memory Over Tree Structures , 2015, ArXiv.

[49] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[50] Ting Liu,et al. Learning Semantic Representations of Users and Products for Document Level Sentiment Classification , 2015, ACL.

[51] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[52] Daniel Jurafsky,et al. A Hierarchical Neural Autoencoder for Paragraphs and Documents , 2015, ACL.

[53] Yoshua Bengio,et al. Gated Feedback Recurrent Neural Networks , 2015, ICML.

[54] Han Zhao,et al. Self-Adaptive Hierarchical Sentence Model , 2015, IJCAI.

[55] Eduard H. Hovy,et al. When Are Tree Structures Necessary for Deep Learning of Representations? , 2015, EMNLP.

[56] Tong Zhang,et al. Effective Use of Word Order for Text Categorization with Convolutional Neural Networks , 2014, NAACL.

[57] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[58] Navneet Kaur,et al. Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[59] Philipp Koehn,et al. Synthesis Lectures on Human Language Technologies , 2016 .

[60] Lei Zhang,et al. Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.