Toward a General Theory of Online Selective Sampling: Trading Off Mistakes and Queries

While the literature on the theory of poolbased active learning has seen much progress in the past 15 years, and is now fairly mature, much less is known about its cousin problem: online selective sampling. In the stochastic online learning setting, there is a stream of iid data, and the learner is required to predict a label for each instance, and we are interested in the rate of growth of the number of mistakes the learner makes. In the selective sampling variant of this problem, after each prediction, the learner can optionally request to observe the true classification of the point. This introduces a trade-off between the number of these queries and the number of mistakes as a function of the number T of samples in the sequence. This work explores various properties of the optimal trade-off curve, both abstractly (for general VC classes), and more-concretely for several constructed examples that expose important properties of the trade-off.

[1]  Liu Yang,et al.  Minimax Analysis of Active Learning , 2014, J. Mach. Learn. Res..

[2]  Gábor Lugosi,et al.  Strong minimax lower bounds for learning , 1996, COLT '96.

[3]  Steve Hanneke,et al.  The Optimal Sample Complexity of PAC Learning , 2015, J. Mach. Learn. Res..

[4]  N. Littlestone Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[5]  Maria-Florina Balcan,et al.  Margin Based Active Learning , 2007, COLT.

[6]  Sanjoy Dasgupta,et al.  Coarse sample complexity bounds for active learning , 2005, NIPS.

[7]  Varun Kanade,et al.  Learning with a Drifting Target Concept , 2015, ALT.

[8]  Steve Hanneke,et al.  Refined Error Bounds for Several Learning Algorithms , 2015, J. Mach. Learn. Res..

[9]  Steve Hanneke Rates of convergence in active learning , 2011, 1103.1790.

[10]  Ran El-Yaniv,et al.  A compression technique for analyzing disagreement-based active learning , 2014, J. Mach. Learn. Res..

[11]  Liu Yang,et al.  Active Learning with a Drifting Distribution , 2011, NIPS.

[12]  David Haussler,et al.  Learnability and the Vapnik-Chervonenkis dimension , 1989, JACM.

[13]  Steve Hanneke,et al.  Theory of Disagreement-Based Active Learning , 2014, Found. Trends Mach. Learn..

[14]  David Haussler,et al.  Predicting {0,1}-functions on randomly drawn points , 1988, COLT '88.

[15]  Claudio Gentile,et al.  Selective sampling and active learning from single and multiple teachers , 2012, J. Mach. Learn. Res..

[16]  HannekeSteve,et al.  The true sample complexity of active learning , 2010 .

[17]  Maria-Florina Balcan,et al.  The true sample complexity of active learning , 2010, Machine Learning.

[18]  Kamalika Chaudhuri,et al.  Beyond Disagreement-Based Agnostic Active Learning , 2014, NIPS.

[19]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[20]  HausslerDavid,et al.  A general lower bound on the number of examples needed for learning , 1989 .

[21]  Steve Hanneke,et al.  A bound on the label complexity of agnostic active learning , 2007, ICML '07.

[22]  Sivan Sabato,et al.  Interactive Algorithms: Pool, Stream and Precognitive Stream , 2017, J. Mach. Learn. Res..

[23]  Dale Schuurmans,et al.  Characterizing Rational Versus Exponential learning Curves , 1995, J. Comput. Syst. Sci..

[24]  Steve Hanneke,et al.  Activized Learning: Transforming Passive to Active with Improved Label Complexity , 2011, J. Mach. Learn. Res..

[25]  Steve Hanneke,et al.  Theoretical foundations of active learning , 2009 .

[26]  H. Sebastian Seung,et al.  Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[27]  Adam Tauman Kalai,et al.  Analysis of Perceptron-Based Active Learning , 2009, COLT.

[28]  Leslie G. Valiant,et al.  A general lower bound on the number of examples needed for learning , 1988, COLT '88.

[29]  Claudio Gentile,et al.  Worst-Case Analysis of Selective Sampling for Linear Classification , 2006, J. Mach. Learn. Res..