Efficient Estimation of Word Representations in Vector Space
暂无分享,去创建一个
Jeffrey Dean | Tomas Mikolov | Gregory S. Corrado | Kai Chen | J. Dean | G. Corrado | Kai Chen | Tomas Mikolov
[1] D. Rumelhart. Learning internal representations by back-propagating errors , 1986 .
[2] James L. McClelland,et al. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .
[3] Geoffrey E. Hinton,et al. Distributed Representations , 1986, The Philosophy of Artificial Intelligence.
[4] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..
[5] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..
[6] Peter D. Turney. Measuring Semantic Similarity by Latent Relational Analysis , 2005, IJCAI.
[7] Yoshua Bengio,et al. Hierarchical Probabilistic Neural Network Language Model , 2005, AISTATS.
[8] Pavel Krbec,et al. Language Modeling for Speech Recognition of Czech , 2006 .
[9] Holger Schwenk,et al. Continuous space language models , 2007, Comput. Speech Lang..
[10] Yoshua Bengio,et al. Scaling learning algorithms towards AI , 2007 .
[11] Thorsten Brants,et al. Large Language Models in Machine Translation , 2007, EMNLP.
[12] Geoffrey E. Hinton,et al. Three new graphical models for statistical language modelling , 2007, ICML '07.
[13] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.
[14] Geoffrey E. Hinton,et al. A Scalable Hierarchical Distributed Language Model , 2008, NIPS.
[15] Lukás Burget,et al. Neural network based language models for highly inflective languages , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[16] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..
[17] Lukás Burget,et al. Recurrent neural network based language model , 2010, INTERSPEECH.
[18] Yoshua Bengio,et al. Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.
[19] Lukás Burget,et al. Extensions of recurrent neural network language model , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[20] Christopher Potts,et al. Learning Word Vectors for Sentiment Analysis , 2011, ACL.
[21] Lukás Burget,et al. Empirical Evaluation and Combination of Advanced Language Modeling Techniques , 2011, INTERSPEECH.
[22] Jeffrey Pennington,et al. Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection , 2011, NIPS.
[23] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..
[24] Lukás Burget,et al. Strategies for training large scale neural network language models , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[25] Christopher J. C. Burges,et al. The Microsoft Research Sentence Completion Challenge , 2011 .
[26] Andrew Y. Ng,et al. Improving Word Representations via Global Context and Multiple Word Prototypes , 2012, ACL.
[27] Marc'Aurelio Ranzato,et al. Large Scale Distributed Deep Networks , 2012, NIPS.
[28] Saif Mohammad,et al. SemEval-2012 Task 2: Measuring Degrees of Relational Similarity , 2012, *SEMEVAL.
[29] Yee Whye Teh,et al. A fast and simple algorithm for training neural probabilistic language models , 2012, ICML.
[30] Vysoké Učení,et al. Statistical Language Models Based on Neural Networks , 2012 .
[31] Geoffrey Zweig,et al. Combining Heterogeneous Models for Measuring Relational Similarity , 2013, NAACL.
[32] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[33] Geoffrey Zweig,et al. Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.