Learning Distributed Representations for Structured Output Prediction
暂无分享,去创建一个
[1] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[2] Yoshua Bengio,et al. Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.
[3] Regina Barzilay,et al. Low-Rank Tensors for Scoring Dependency Structures , 2014, ACL.
[4] Geoffrey E. Hinton. Tensor Product Variable Binding and the Representation of Symbolic Structures in Connectionist Systems , 1991 .
[5] Tony A. Plate,et al. Holographic reduced representations , 1995, IEEE Trans. Neural Networks.
[6] Stephen P. Boyd,et al. Semidefinite Programming , 1996, SIAM Rev..
[7] Slav Petrov,et al. A Universal Part-of-Speech Tagset , 2011, LREC.
[8] Sebastian Riedel,et al. The CoNLL 2007 Shared Task on Dependency Parsing , 2007, EMNLP.
[9] Dan Klein,et al. Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.
[10] Shimon Ullman,et al. Uncovering shared structures in multiclass classification , 2007, ICML '07.
[11] Stephen P. Boyd,et al. Rank minimization and applications in system theory , 2004, Proceedings of the 2004 American Control Conference.
[12] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.
[13] Honglak Lee,et al. An Analysis of Single-Layer Networks in Unsupervised Feature Learning , 2011, AISTATS.
[14] Claudio Gentile,et al. Hierarchical classification: combining Bayes with SVM , 2006, ICML.
[15] Andrew Y. Ng,et al. Semantic Compositionality through Recursive Matrix-Vector Spaces , 2012, EMNLP.
[16] Michael Collins,et al. Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.
[17] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[18] Francis R. Bach,et al. Low-rank matrix factorization with attributes , 2006, ArXiv.
[19] Ken Lang,et al. NewsWeeder: Learning to Filter Netnews , 1995, ICML.
[20] Thomas Hofmann,et al. Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..
[21] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..
[22] Thorsten Brants,et al. TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.
[23] Tommi S. Jaakkola,et al. Maximum-Margin Matrix Factorization , 2004, NIPS.
[24] Stephen P. Boyd,et al. Proximal Algorithms , 2013, Found. Trends Optim..
[25] Massimiliano Pontil,et al. Multi-Task Feature Learning , 2006, NIPS.
[26] Ann Bies,et al. The Penn Treebank: Annotating Predicate Argument Structure , 1994, HLT.