论文信息 - Upper and Lower Error Bounds for Active Learning

Upper and Lower Error Bounds for Active Learning

This paper analyzes the potential advantages and theoretical challenges of ”active learning” algorithms. Active learning involves sequential, adaptive sampling procedures that use information gleaned from previous samples in order to focus the sampling and accelerate the learning process relative to “passive learning” algorithms, which are based on non-adaptive (usually random) samples. There are a number of empirical and theoretical results suggesting that in certain situations active learning can be significantly more effective than passive learning. However, the fact that active learning algorithms are feedback systems makes their theoretical analysis very challenging. It is known that active learning can provably improve on passive learning if the error or noise rate of the sampling process is bounded. However, if the noise rate is unbounded, perhaps the situation most common in practice, then no previously existing theory demonstrates whether or not active learning offers an advantage. To study this issue, we investigate the basic problem of learning a threshold function from noisy observations. We present an algorithm that provably improves on passive learning, even when the noise is unbounded. Moreover, we derive a minimax lower bound for this learning problem, demonstrating that our proposed active learning algorithm converges at the near-optimal rate.

R. Nowak | R. Castro

[1] David J. C. MacKay,et al. Information-Based Objective Functions for Active Data Selection , 1992, Neural Computation.

[2] David A. Cohn,et al. Active Learning with Statistical Models , 1996, NIPS.

[3] A. Tsybakov,et al. Introduction à l'estimation non-paramétrique , 2003 .

[4] Claudio Gentile,et al. Learning Probabilistic Linear-Threshold Classifiers via Selective Sampling , 2003, COLT.

[5] A. Tsybakov,et al. Optimal aggregation of classifiers in statistical learning , 2003 .

[6] P. Hall,et al. Sequential methods for design-adaptive estimation of discontinuities in regression curves and surfaces , 2003 .

[7] H. Sebastian Seung,et al. Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[8] Sanjoy Dasgupta,et al. Analysis of a greedy active learning strategy , 2004, NIPS.

[9] D. Geman,et al. Hierarchical testing designs for pattern recognition , 2005, math/0507421.

[10] Robert D. Nowak,et al. Faster Rates in Regression via Active Learning , 2005, NIPS.

[11] Adam Tauman Kalai,et al. Analysis of Perceptron-Based Active Learning , 2009, COLT.

[12] Sanjoy Dasgupta,et al. Coarse sample complexity bounds for active learning , 2005, NIPS.

[13] Matti Kääriäinen,et al. Active Learning in the Non-realizable Case , 2006, ALT.