A Fast Accurate Two-stage Training Algorithm for L1-regularized CRFs with Heuristic Line Search Strategy
暂无分享,去创建一个
[1] Manuela M. Veloso,et al. Feature selection in conditional random fields for activity recognition , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[2] John E. Moody,et al. Note on Learning Rate Schedules for Stochastic Optimization , 1990, NIPS.
[3] Peter L. Bartlett,et al. Exponentiated Gradient Algorithms for Conditional Random Fields and Max-Margin Markov Networks , 2008, J. Mach. Learn. Res..
[4] Jun'ichi Tsujii,et al. Evaluation and Extension of Maximum Entropy Models with Inequality Constraints , 2003, EMNLP.
[5] Christopher D. Manning,et al. Joint Learning Improves Semantic Role Labeling , 2005, ACL.
[6] Andrew McCallum,et al. Maximum Entropy Markov Models for Information Extraction and Segmentation , 2000, ICML.
[7] Mark W. Schmidt,et al. Accelerated training of conditional random fields with stochastic gradient methods , 2006, ICML.
[8] François Yvon,et al. Practical Very Large Scale CRFs , 2010, ACL.
[9] Xiao Chen,et al. The Fourth International Chinese Language Processing Bakeoff: Chinese Word Segmentation, Named Entity Recognition and Chinese POS Tagging , 2008, IJCNLP.
[10] Chun-Nan Hsu,et al. Training Conditional Random Fields by Periodic Step Size Adaptation for Large-Scale Text Mining , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).
[11] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[12] Thomas Emerson,et al. The Second International Chinese Word Segmentation Bakeoff , 2005, IJCNLP.
[13] Patrick Gallinari,et al. SGD-QN: Careful Quasi-Newton Stochastic Gradient Descent , 2009, J. Mach. Learn. Res..
[14] Andrew McCallum,et al. Accurate Information Extraction from Research Papers using Conditional Random Fields , 2004, NAACL.
[15] Christopher D. Manning,et al. Efficient, Feature-based, Conditional Random Field Parsing , 2008, ACL.
[16] Sophia Ananiadou,et al. Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty , 2009, ACL.
[17] Jianfeng Gao,et al. A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing , 2007, ACL.
[18] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.
[19] Zhao Hai,et al. Chinese Word Segmentation: A Decade Review , 2007 .
[20] Andrew McCallum,et al. An Introduction to Conditional Random Fields for Relational Learning , 2007 .
[21] Jianfeng Gao,et al. Scalable training of L1-regularized log-linear models , 2007, ICML '07.
[22] Jorge Nocedal,et al. On the limited memory BFGS method for large scale optimization , 1989, Math. Program..