暂无分享,去创建一个
[1] Donald R. Smith. Variational methods in optimization , 1974 .
[2] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[3] Jorge Nocedal,et al. A Numerical Study of the Limited Memory BFGS Method and the Truncated-Newton Method for Large Scale Optimization , 1991, SIAM J. Optim..
[4] Rich Caruana,et al. Multitask Learning: A Knowledge-Based Source of Inductive Bias , 1993, ICML.
[5] Robert A. Jacobs,et al. Hierarchical Mixtures of Experts and the EM Algorithm , 1993, Neural Computation.
[6] Jorge J. Moré,et al. Evaluation of Large-scale Optimization Problems on Vector and Parallel Architectures , 1994, SIAM J. Optim..
[7] Sebastian Thrun,et al. Is Learning The n-th Thing Any Easier Than Learning The First? , 1995, NIPS.
[8] M. Ostendorf,et al. Using out-of-domain data to improve in-domain language models , 1997, IEEE Signal Processing Letters.
[9] John D. Lafferty,et al. Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..
[10] Alex Pentland,et al. Maximum Conditional Likelihood via Bound Maximization and the CEM Algorithm , 1998, NIPS.
[11] Stanley F. Chen,et al. A Gaussian Prior for Smoothing Maximum Entropy Models , 1999 .
[12] Adam Tauman Kalai,et al. On-line algorithms for combining language models , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[13] Rebecca Hwa. Supervised Grammar Induction using Training Data with Limited Constituent Information , 1999, ACL.
[14] Jonathan Baxter,et al. A Model of Inductive Bias Learning , 2000, J. Artif. Intell. Res..
[15] Andrew McCallum,et al. Maximum Entropy Markov Models for Information Extraction and Segmentation , 2000, ICML.
[16] Geoffrey Zweig,et al. Information Extraction from Voicemail , 2001, ACL.
[17] Daniel Gildea,et al. Corpus Variation and Parser Performance , 2001, EMNLP.
[18] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[19] Jerry R. Hobbs. Information extraction from biomedical text , 2002, J. Biomed. Informatics.
[20] Rob Malouf,et al. A Comparison of Algorithms for Maximum Entropy Parameter Estimation , 2002, CoNLL.
[21] Brian Roark,et al. Unsupervised language model adaptation , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[22] Brian Roark,et al. Supervised and unsupervised PCFG adaptation to novel domains , 2003, NAACL.
[23] George Forman,et al. An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..
[24] Rich Caruana,et al. Multitask Learning , 1997, Machine Learning.
[25] Alex Acero,et al. Adaptation of Maximum Entropy Capitalizer: Little Data Can Help a Lo , 2006, Comput. Speech Lang..
[26] Owen Rambow,et al. Summarizing Email Threads , 2004, NAACL.
[27] T. Minka. A comparison of numerical optimizers for logistic regression , 2004 .
[28] Dragos Stefan Munteanu,et al. Improving Machine Translation Performance by Exploiting Non-Parallel Corpora , 2005, CL.
[29] Min-Yen Kan,et al. Customization in a unified framework for summarizing medical literature , 2005, Artif. Intell. Medicine.