论文信息 - Analysis with Deeply Learned Distributed Representations of Variable Length Texts

Analysis with Deeply Learned Distributed Representations of Variable Length Texts

Learning good semantic vector representations for phrases, sentences and paragraphs is a challenging and ongoing area of research in natural language processing and understanding. In this project, we survey and implement several deeplearning and deep-learning-inspired approaches and evaluate these algorithms on several sentiment-labeled datasets and analysis tasks. In doing so, we demonstrate new state-of-the-art performance on the IMDB Large Movie Review Dataset [5] using highly-tuned paragraph vectors [4], and highly competitive performance on the Stanford Sentiment Treebank dataset [8] using Deep Recursive-NNs and LSTMs for both binary and fine classification tasks. Finally, we compare and analyze each model’s performance on our selection of sentiment analysis tasks.

James Hong | Michael Fang

[1] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[2] Bo Pang,et al. Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[3] Christopher Potts,et al. Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[4] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[5] Christopher Potts,et al. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[6] Claire Cardie,et al. Deep Recursive Neural Networks for Compositionality in Language , 2014, NIPS.

[7] Quoc V. Le,et al. Distributed Representations of Sentences and Documents , 2014, ICML.

[8] Christopher D. Manning,et al. Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.