Feature-Frequency–Adaptive On-line Training for Fast and Accurate Natural Language Processing
暂无分享,去创建一个
Xu Sun | Houfeng Wang | Qin Lu | Wenjie Li | Xu Sun | Houfeng Wang | Wenjie Li | Q. Lu
[1] Koby Crammer,et al. Confidence-weighted linear classification , 2008, ICML '08.
[2] Tom Schaul,et al. No more pesky learning rates , 2012, ICML.
[3] Malvina Nissim,et al. Exploiting Context for Biomedical Entity Recognition: From Syntax to the Web , 2004, NLPBA/BioNLP.
[4] Yoram Singer,et al. Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..
[5] D K Smith,et al. Numerical Optimization , 2001, J. Oper. Res. Soc..
[6] Daniel Jurafsky,et al. A Conditional Random Field Word Segmenter for Sighan Bakeoff 2005 , 2005, IJCNLP.
[7] Alexander J. Smola,et al. Parallelized Stochastic Gradient Descent , 2010, NIPS.
[8] Xu Sun,et al. Latent Structured Perceptrons for Large-Scale Learning with Hidden Information , 2013, IEEE Transactions on Knowledge and Data Engineering.
[9] Xu Sun,et al. A Discriminative Latent Variable Chinese Segmenter with Hybrid Word/Character Information , 2009, HLT-NAACL.
[10] Hai Zhao,et al. Integrating unsupervised and supervised word segmentation: The role of goodness measures , 2011, Inf. Sci..
[11] Mark W. Schmidt,et al. Accelerated training of conditional random fields with stochastic gradient methods , 2006, ICML.
[12] Xu Sun,et al. Probabilistic Chinese word segmentation with non-local information and stochastic training , 2013, Inf. Process. Manag..
[13] Xu Sun,et al. Latent Variable Perceptron Algorithm for Structured Classification , 2009, IJCAI.
[14] Noboru Murata,et al. A Statistical Study on On-line Learning , 1999 .
[15] Sophia Ananiadou,et al. Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty , 2009, ACL.
[16] Yuh-Jye Lee,et al. Periodic step-size adaptation in second-order gradient descent for single-pass on-line structured learning , 2009, Machine Learning.
[17] John Blitzer,et al. Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.
[18] Stanley F. Chen,et al. A Gaussian Prior for Smoothing Maximum Entropy Models , 1999 .
[19] Alessandro Sperduti,et al. Speed up learning and network optimization with extended back propagation , 1993, Neural Networks.
[20] Yoav Freund,et al. Large Margin Classification Using the Perceptron Algorithm , 1998, COLT.
[21] Nigel Collier,et al. Introduction to the Bio-entity Recognition Task at JNLPBA , 2004, NLPBA/BioNLP.
[22] Koby Crammer,et al. Flexible Text Segmentation with Structured Multilabel Classification , 2005, HLT.
[23] Xu Sun,et al. Modeling Latent-Dynamic in Shallow Parsing: A Latent Conditional Model with Improved Inference , 2008, COLING.
[24] Weiwei Sun. Word-based and Character-based Word Segmentation Models: Comparison and Combination , 2010, COLING.
[25] Adam L. Berger,et al. A Maximum Entropy Approach to Natural Language Processing , 1996, CL.
[26] Robert A. Jacobs,et al. Increased rates of convergence through learning rate adaptation , 1987, Neural Networks.
[27] Jun'ichi Tsujii,et al. Improving the Scalability of Semi-Markov Conditional Random Fields for Named Entity Recognition , 2006, ACL.
[28] Adwait Ratnaparkhi,et al. A Maximum Entropy Model for Part-Of-Speech Tagging , 1996, EMNLP.
[29] Burr Settles,et al. Biomedical Named Entity Recognition using Conditional Random Fields and Rich Feature Sets , 2004, NLPBA/BioNLP.
[30] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..
[31] Matthew J. Streeter,et al. Adaptive Bound Optimization for Online Convex Optimization , 2010, COLT 2010.
[32] Koby Crammer,et al. Adaptive regularization of weight vectors , 2009, Machine Learning.
[33] Yuji Matsumoto,et al. Chunking with Support Vector Machines , 2001, NAACL.
[34] Sabine Buchholz,et al. Introduction to the CoNLL-2000 Shared Task Chunking , 2000, CoNLL/LLL.
[35] Stephen Clark,et al. Chinese Segmentation with a Word-Based Perceptron Algorithm , 2007, ACL.
[36] Xu Sun,et al. Fast Online Training with Frequency-Adaptive Learning Rates for Chinese Word Segmentation and New Word Detection , 2012, ACL.
[37] Eiichiro Sumita,et al. Subword-based Tagging by Conditional Random Fields for Chinese Word Segmentation , 2006, NAACL.
[38] Michael Collins,et al. Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.
[39] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[40] Hai Zhao,et al. A Unified Character-Based Tagging Framework for Chinese Word Segmentation , 2010, TALIP.
[41] Jianfeng Gao,et al. A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing , 2007, ACL.
[42] Pierre Priouret,et al. Adaptive Algorithms and Stochastic Approximations , 1990, Applications of Mathematics.