Hierarchical Distributed Representations for Statistical Language Modeling
暂无分享,去创建一个
John Blitzer | Fernando Pereira | Kilian Q. Weinberger | Lawrence K. Saul | L. Saul | Fernando C Pereira | John Blitzer
[1] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.
[2] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.
[3] Robert A. Jacobs,et al. Hierarchical Mixtures of Experts and the EM Algorithm , 1993, Neural Computation.
[4] Michael Collins,et al. Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.
[5] Frederick Jelinek,et al. Statistical methods for speech recognition , 1997 .
[6] Fernando Pereira,et al. Aggregate and mixed-order Markov models for statistical language processing , 1997, EMNLP.
[7] Thomas Hofmann,et al. Mixture models for co-occurrence and histogram data , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).
[8] B. Borchers. A C library for semidefinite programming , 1999 .
[9] Henry Wolkowicz,et al. Solving Euclidean Distance Matrix Completion Problems Via Semidefinite Programming , 1999, Comput. Optim. Appl..
[10] B. Borchers. CSDP, A C library for semidefinite programming , 1999 .
[11] F ChenStanley,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.
[12] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..
[13] Kilian Q. Weinberger,et al. Learning a kernel matrix for nonlinear dimensionality reduction , 2004, ICML.
[14] CHENGXIANG ZHAI,et al. A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.
[15] Bernhard Schölkopf,et al. A kernel view of the dimensionality reduction of manifolds , 2004, ICML.