暂无分享,去创建一个
[1] Lukás Burget,et al. Extensions of recurrent neural network language model , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Andrew Gordon Wilson,et al. Deep Kernel Learning , 2015, AISTATS.
[3] Thorsten Joachims,et al. Optimizing search engines using clickthrough data , 2002, KDD.
[4] Eric T. Nalisnick,et al. Under review as a conference paper at ICLR 2016 , 2015 .
[5] Andrew McCallum,et al. Word Representations via Gaussian Embedding , 2014, ICLR.
[6] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[7] Lukás Burget,et al. Strategies for training large scale neural network language models , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[8] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.
[9] Zhiyuan Liu,et al. A Unified Model for Word Sense Representation and Disambiguation , 2014, EMNLP.
[10] Tony Jebara,et al. Probability Product Kernels , 2004, J. Mach. Learn. Res..
[11] Felix Hill,et al. SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation , 2014, CL.
[12] Elia Bruni,et al. Multimodal Distributional Semantics , 2014, J. Artif. Intell. Res..
[13] Ehud Rivlin,et al. Placing search in context: the concept revisited , 2002, TOIS.
[14] David M. W. Powers,et al. Verb similarity on the taxonomy of WordNet , 2006 .
[15] Klaus-Robert Müller,et al. Efficient BackProp , 2012, Neural Networks: Tricks of the Trade.
[16] C. Spearman. The proof and measurement of association between two things. , 2015, International journal of epidemiology.
[17] Zhiyuan Liu,et al. Topical Word Embeddings , 2015, AAAI.
[18] Evgeniy Gabrilovich,et al. A word at a time: computing word relatedness using temporal semantic analysis , 2011, WWW.
[19] Oriol Vinyals,et al. Bayesian Recurrent Neural Networks , 2017, ArXiv.
[20] Omer Levy,et al. Neural Word Embedding as Implicit Matrix Factorization , 2014, NIPS.
[21] Silvia Bernardini,et al. The WaCky wide web: a collection of very large linguistically processed web-crawled corpora , 2009, Lang. Resour. Evaluation.
[22] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.
[23] Enhong Chen,et al. A Probabilistic Model for Learning Multi-Prototype Word Embeddings , 2014, COLING.
[24] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.
[25] Zhe Gan,et al. Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling , 2016, ACL.
[26] Raffaella Bernardi,et al. Entailment above the word level in distributional semantics , 2012, EACL.
[27] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[28] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..
[29] Christopher D. Manning,et al. Better Word Representations with Recursive Neural Networks for Morphology , 2013, CoNLL.
[30] Andrew Gordon Wilson,et al. Stochastic Variational Deep Kernel Learning , 2016, NIPS.
[31] Lukás Burget,et al. Recurrent neural network based language model , 2010, INTERSPEECH.
[32] Ruslan Salakhutdinov,et al. Bayesian probabilistic matrix factorization using Markov chain Monte Carlo , 2008, ICML '08.
[33] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[34] Evgeniy Gabrilovich,et al. Large-scale learning of word relatedness with constraints , 2012, KDD.
[35] Andrew McCallum,et al. Efficient Non-parametric Estimation of Multiple Embeddings per Word in Vector Space , 2014, EMNLP.
[36] Andrew Gordon Wilson,et al. Learning Scalable Deep Kernels with Recurrent Structure , 2016, J. Mach. Learn. Res..
[37] Andrew Y. Ng,et al. Improving Word Representations via Global Context and Multiple Word Prototypes , 2012, ACL.
[38] Xuanjing Huang,et al. Gaussian Mixture Embeddings for Multiple Word Prototypes , 2015, ArXiv.
[39] John B. Goodenough,et al. Contextual correlates of synonymy , 1965, CACM.
[40] Aapo Hyvärinen,et al. Noise-Contrastive Estimation of Unnormalized Statistical Models, with Applications to Natural Image Statistics , 2012, J. Mach. Learn. Res..
[41] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..
[42] G. Miller,et al. Contextual correlates of semantic similarity , 1991 .
[43] Geoffrey E. Hinton,et al. A Scalable Hierarchical Distributed Language Model , 2008, NIPS.
[44] Yoshua Bengio,et al. Hierarchical Probabilistic Neural Network Language Model , 2005, AISTATS.