论文信息 - An In-Depth Experimental Comparison of RNTNs and CNNs for Sentence Modeling

An In-Depth Experimental Comparison of RNTNs and CNNs for Sentence Modeling

The goal of modeling sentences is to accurately represent their meaning for different tasks. A variety of deep learning architectures have been proposed to model sentences, however, little is known about their comparative performance on a common ground, across a variety of datasets, and on the same level of optimization. In this paper, we provide such a novel comparison for two popular architectures, Recursive Neural Tensor Networks (RNTNs) and Convolutional Neural Networks (CNNs). Although RNTNs have been shown to work well in many cases, they require intensive manual labeling due to the vanishing gradient problem. To enable an extensive comparison of the two architectures, this paper employs two methods to automatically label the internal nodes: a rule-based method and (this time as part of the RNTN method) a convolutional neural network. This enables us to compare these RNTN models to a relatively simple CNN architecture. Experiments conducted on a set of benchmark datasets demonstrate that the CNN outperforms the RNTNs based on automatic phrase labeling, whereas the RNTN based on manual labeling outperforms the CNN. The results corroborate that CNNs already offer good predictive performance and, at the same time, more research on RNTNs is needed to further exploit sentence structure.

Stefan Kramer | Marcin Skowron | Zahra Ahmadi | Aleksandrs Stier

[1] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[2] Christopher Potts,et al. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[3] Eduard H. Hovy,et al. When Are Tree Structures Necessary for Deep Learning of Representations? , 2015, EMNLP.

[4] Yann LeCun,et al. Very Deep Convolutional Networks for Text Classification , 2016, EACL.

[5] Hal Daumé,et al. Deep Unordered Composition Rivals Syntactic Methods for Text Classification , 2015, ACL.

[6] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[7] Claire Cardie,et al. Deep Recursive Neural Networks for Compositionality in Language , 2014, NIPS.

[8] Ye Zhang,et al. A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification , 2015, IJCNLP.

[9] Danqi Chen,et al. A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[10] Noah A. Smith,et al. A Dependency Parser for Tweets , 2014, EMNLP.

[11] Dan Klein,et al. Accurate Unlexicalized Parsing , 2003, ACL.

[12] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).