Interpretable Structure-aware Document Encoders with Hierarchical Attention
暂无分享,去创建一个
Martin Jaggi | Claudiu Musat | Michael Baeriswyl | Khalil Mrini | Martin Jaggi | C. Musat | Khalil Mrini | Michael Baeriswyl
[1] J. Hanley,et al. The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.
[2] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[3] Jürgen Schmidhuber,et al. Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.
[4] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.
[5] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[6] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[7] Quoc V. Le,et al. Distributed Representations of Sentences and Documents , 2014, ICML.
[8] Phong Le,et al. Compositional Distributional Semantics with Long Short Term Memory , 2015, *SEMEVAL.
[9] Hongyu Guo,et al. Long Short-Term Memory Over Recursive Structures , 2015, ICML.
[10] Christopher D. Manning,et al. Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.
[11] Quoc V. Le,et al. Semi-supervised Sequence Learning , 2015, NIPS.
[12] Sanja Fidler,et al. Skip-Thought Vectors , 2015, NIPS.
[13] Wei Xu,et al. Bidirectional LSTM-CRF Models for Sequence Tagging , 2015, ArXiv.
[14] Han Zhao,et al. Self-Adaptive Hierarchical Sentence Model , 2015, IJCAI.
[15] Christopher Potts,et al. A large annotated corpus for learning natural language inference , 2015, EMNLP.
[16] Eric Nichols,et al. Named Entity Recognition with Bidirectional LSTM-CNNs , 2015, TACL.
[17] Mirella Lapata,et al. Long Short-Term Memory-Networks for Machine Reading , 2016, EMNLP.
[18] Felix Hill,et al. Learning Distributed Representations of Sentences from Unlabelled Data , 2016, NAACL.
[19] Diyi Yang,et al. Hierarchical Attention Networks for Document Classification , 2016, NAACL.
[20] Florian Schmidt,et al. Neural Document Embeddings for Intensive Care Patient Mortality Prediction , 2016, NIPS 2016.
[21] Li Zhao,et al. Attention-based LSTM for Aspect-level Sentiment Classification , 2016, EMNLP.
[22] Charles Elkan,et al. Learning to Diagnose with LSTM Recurrent Neural Networks , 2015, ICLR.
[23] Peter Szolovits,et al. MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.
[24] Yoshimasa Tsuruoka,et al. Tree-to-Sequence Attentional Neural Machine Translation , 2016, ACL.
[25] Sanjeev Arora,et al. A Simple but Tough-to-Beat Baseline for Sentence Embeddings , 2017, ICLR.
[26] Zhen-Hua Ling,et al. Enhanced LSTM for Natural Language Inference , 2016, ACL.
[27] Tomas Mikolov,et al. Enriching Word Vectors with Subword Information , 2016, TACL.
[28] Holger Schwenk,et al. Supervised Learning of Universal Sentence Representations from Natural Language Inference Data , 2017, EMNLP.
[29] Matteo Pagliardini,et al. Unsupervised Learning of Sentence Embeddings Using Compositional n-Gram Features , 2017, NAACL.
[30] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.