论文信息 - Active Learning for Multi-Class Logistic Regression

Active Learning for Multi-Class Logistic Regression

Which of the many proposed methods for active learning can we expect to yield good performance in learning logistic regression classifiers? In this article, we evaluate different approaches to determine suitable practices. Among our contributions, we test several explicit objective functions for active learning: an empirical consideration lacking in the literature until this point. We develop a theoretical framework for applying different loss functions motivated by work in optimal experimental design. Empirical investigations demonstrate the benefits of our variance reduction method which gives attractive classification accuracy and matches or beats random performance in all evaluations. Of the alternative heuristic approaches, we identify a method called margin sampling as giving promising performance with little computational overhead.

Lyle H. Ungar | Andrew I. Schein

[1] Elie Bienenstock,et al. Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[2] D. Mackay,et al. Bayesian methods for adaptive models , 1992 .

[3] Naoki Abe,et al. Query Learning Strategies Using Boosting and Bagging , 1998, ICML.

[4] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[5] Greg Schohn,et al. Less is More: Active Learning with Support Vector Machines , 2000, ICML.

[6] Dana Angluin,et al. Learning Regular Sets from Queries and Counterexamples , 1987, Inf. Comput..

[7] David A. Cohn,et al. Neural Network Exploration Using Optimal Experiment Design , 1993, NIPS.

[8] William A. Gale,et al. A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[9] Daphne Koller,et al. Active Learning for Structure in Bayesian Networks , 2001, IJCAI.

[10] Andrew McCallum,et al. Toward Optimal Active Learning through Sampling Estimation of Error Reduction , 2001, ICML.

[11] Min Tang,et al. Active Learning for Statistical Natural Language Parsing , 2002, ACL.