暂无分享,去创建一个
[1] Yoshua Bengio,et al. Hierarchical Probabilistic Neural Network Language Model , 2005, AISTATS.
[2] Sebastian Ruder,et al. A survey of cross-lingual embedding models , 2017, ArXiv.
[3] Wenlin Chen,et al. Strategies for Training Large Vocabulary Neural Language Models , 2015, ACL.
[4] Pradeep Dubey,et al. BlackOut: Speeding up Recurrent Neural Network Language Models With Very Large Vocabularies , 2015, ICLR.
[5] Georgiana Dinu,et al. Hubness and Pollution: Delving into Cross-Space Mapping for Zero-Shot Learning , 2015, ACL.
[6] Yonghui Wu,et al. Exploring the Limits of Language Modeling , 2016, ArXiv.
[7] Geoffrey E. Hinton,et al. Regularizing Neural Networks by Penalizing Confident Output Distributions , 2017, ICLR.
[8] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[9] Alon Lavie,et al. Meteor Universal: Language Specific Translation Evaluation for Any Target Language , 2014, WMT@ACL.
[10] Mauro Cettolo,et al. The IWSLT 2016 Evaluation Campaign , 2016, IWSLT.
[11] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[12] R. Gray,et al. Vector quantization , 1984, IEEE ASSP Magazine.
[13] Blockin Blockin,et al. Quick Training of Probabilistic Neural Nets by Importance Sampling , 2003 .
[14] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[15] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.
[16] Dong Wang,et al. Normalized Word Embedding and Orthogonal Transform for Bilingual Word Translation , 2015, NAACL.
[17] Lior Wolf,et al. Using the Output Embedding to Improve Language Models , 2016, EACL.
[18] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.
[19] E. L. Lehmann,et al. Theory of point estimation , 1950 .
[20] Xin Jiang,et al. Neural Generative Question Answering , 2015, IJCAI.
[21] Satoshi Nakamura,et al. Neural Machine Translation via Binary Code Prediction , 2017, ACL.
[22] Noah A. Smith,et al. A Simple, Fast, and Effective Reparameterization of IBM Model 2 , 2013, NAACL.
[23] Richard Socher,et al. A Deep Reinforced Model for Abstractive Summarization , 2017, ICLR.
[24] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[25] Alexander M. Rush,et al. OpenNMT: Open-Source Toolkit for Neural Machine Translation , 2017, ACL.
[26] Omer Levy,et al. Dependency-Based Word Embeddings , 2014, ACL.
[27] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[28] Geoffrey Zweig,et al. Toward Human Parity in Conversational Speech Recognition , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[29] Quoc V. Le,et al. A Neural Conversational Model , 2015, ArXiv.
[30] J. Segura,et al. A new type of sharp bounds for ratios of modified Bessel functions , 2016, 1606.02008.
[31] Jason Weston,et al. A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.
[32] Karin M. Verspoor,et al. Findings of the 2016 Conference on Machine Translation , 2016, WMT.
[33] Jacob Eisenstein,et al. Mimicking Word Embeddings using Subword RNNs , 2017, EMNLP.
[34] Wang Ling,et al. Two/Too Simple Adaptations of Word2Vec for Syntax Problems , 2015, NAACL.
[35] Jeff Johnson,et al. Billion-Scale Similarity Search with GPUs , 2017, IEEE Transactions on Big Data.
[36] C. Bishop. Mixture density networks , 1994 .
[37] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[38] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[39] Tomas Mikolov,et al. Enriching Word Vectors with Subword Information , 2016, TACL.
[40] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.
[41] Graham Neubig,et al. Stronger Baselines for Trustable Results in Neural Machine Translation , 2017, NMT@ACL.
[42] Koray Kavukcuoglu,et al. Learning word embeddings efficiently with noise-contrastive estimation , 2013, NIPS.
[43] Pascal Vincent,et al. An Exploration of Softmax Alternatives Belonging to the Spherical Loss Family , 2015, ICLR.