Joint Feature Selection in Distributed Stochastic Learning for Large-Scale Discriminative Training in SMT
暂无分享,去创建一个
[1] Léon Bottou,et al. Stochastic Learning , 2003, Advanced Lectures on Machine Learning.
[2] James Theiler,et al. Grafting: Fast, Incremental Feature Selection by Gradient Descent in Function Space , 2003, J. Mach. Learn. Res..
[3] Ben Taskar,et al. Alignment by Agreement , 2006, NAACL.
[4] Hermann Ney,et al. Discriminative Training and Maximum Entropy Models for Statistical Machine Translation , 2002, ACL.
[5] Anoop Sarkar,et al. Discriminative Reranking for Machine Translation , 2004, NAACL.
[6] Kenneth Heafield,et al. KenLM: Faster and Smaller Language Model Queries , 2011, WMT@EMNLP.
[7] Ben Taskar,et al. An End-to-End Discriminative Approach to Machine Translation , 2006, ACL.
[8] Taro Watanabe,et al. NTT statistical machine translation for IWSLT 2006 , 2006, IWSLT.
[9] Taro Watanabe,et al. Structural support vector machines for log-linear approach in statistical machine translation , 2009, IWSLT.
[10] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.
[11] Kevin Duh,et al. N-Best Reranking by Multitask Learning , 2010, WMT@ACL.
[12] Chris Dyer,et al. Using a maximum entropy model to build segmentation lattices for MT , 2009, NAACL.
[13] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[14] Khalil Sima'an,et al. A Consistent and Efficient Estimator for Data-Oriented Parsing , 2005, J. Autom. Lang. Comb..
[15] Kevin Duh,et al. Beyond Log-Linear Models: Boosted Minimum Error Rate Training for N-best Re-ranking , 2008, ACL.
[16] Stefan Riezler,et al. On Some Pitfalls in Automatic Evaluation and Significance Testing for MT , 2005, IEEvaluation@ACL.
[17] Mark Hopkins,et al. Tuning as Ranking , 2011, EMNLP.
[18] K. J. Evans,et al. Computer Intensive Methods for Testing Hypotheses: An Introduction , 1990 .
[19] Franz Josef Och,et al. Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.
[20] Alexander J. Smola,et al. Parallelized Stochastic Gradient Descent , 2010, NIPS.
[21] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[22] Tong Zhang,et al. A Discriminative Global Training Algorithm for Statistical MT , 2006, ACL.
[23] Gideon S. Mann,et al. Distributed Training Strategies for the Structured Perceptron , 2010, NAACL.
[24] Noah A. Smith,et al. The CMU-ARK German-English Translation System , 2011, WMT@EMNLP.
[25] Daniel Marcu,et al. Statistical Phrase-Based Translation , 2003, NAACL.
[26] Hermann Ney,et al. Improved Statistical Alignment Models , 2000, ACL.
[27] Phil Blunsom,et al. A Discriminative Latent Variable Model for Statistical Machine Translation , 2008, ACL.
[28] Ben Taskar,et al. Joint covariate selection and joint subspace selection for multiple classification problems , 2010, Stat. Comput..
[29] Noah A. Smith,et al. Structured Ramp Loss Minimization for Machine Translation , 2012, HLT-NAACL.
[30] Shankar Kumar,et al. Efficient Minimum Error Rate Training and Minimum Bayes-Risk Decoding for Translation Hypergraphs and Lattices , 2009, ACL/IJCNLP.
[31] Jason Weston,et al. Embedded Methods , 2006, Feature Extraction.
[32] Adam Lopez,et al. Hierarchical Phrase-Based Translation with Suffix Arrays , 2007, EMNLP.
[33] S. Sathiya Keerthi,et al. Efficient algorithms for ranking with SVMs , 2010, Information Retrieval.
[34] David Chiang,et al. Hierarchical Phrase-Based Translation , 2007, CL.
[35] Yoram Singer,et al. Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..
[36] I. Dan Melamed,et al. Toward Purely Discriminative Training for Tree-Structured Translation Models , 2008 .
[37] S. T. Buckland,et al. Computer-Intensive Methods for Testing Hypotheses. , 1990 .
[38] Masoud Nikravesh,et al. Feature Extraction - Foundations and Applications , 2006, Feature Extraction.
[39] Kevin Knight,et al. 11,001 New Features for Statistical Machine Translation , 2009, NAACL.
[40] David A. McAllester,et al. Generalization bounds and consistency for latent-structural probit and ramp loss , 2011, MLSLP.
[41] Taro Watanabe,et al. Online Large-Margin Training for Statistical Machine Translation , 2007, EMNLP.
[42] Stefan Riezler,et al. Structural and Topical Dimensions in Multi-Task Patent Translation , 2012, EACL.
[43] Aravind K. Joshi,et al. Ranking and Reranking with Perceptron , 2005, Machine Learning.
[44] David Chiang,et al. A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.
[45] Gunnar Rätsch,et al. Advanced lectures on machine learning : ML Summer Schools 2003, Canberra, Australia, February 2-14, 2003, Tübingen, Germany, August 4-16, 2003 : revised lectures , 2004 .
[46] Philip Resnik,et al. Online Large-Margin Training of Syntactic and Structural Translation Features , 2008, EMNLP.
[47] Samy Bengio,et al. Links between perceptrons, MLPs and SVMs , 2004, ICML.
[48] Yoav Freund,et al. Large Margin Classification Using the Perceptron Algorithm , 1998, COLT' 98.
[49] Michael Collins,et al. Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.