Active Learning for Regression Based on Query by Committee

We investigate a committee-based approach for active learning of real-valued functions. This is a variance-only strategy for selection of informative training data. As such it is shown to suffer when the model class is misspecified since the learner's bias is high. Conversely, the strategy outperforms passive selection when the model class is very expressive since active minimization of the variance avoids overfitting.

[1]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[2]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[3]  Leonard G. C. Hamey,et al.  Minimisation of data collection by active learning , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.

[4]  David A. Cohn,et al.  Neural Network Exploration Using Optimal Experiment Design , 1993, NIPS.

[5]  Michael Lindenbaum,et al.  Selective Sampling for Nearest Neighbor Classifiers , 1999, Machine Learning.

[6]  D. Rubinfeld,et al.  Hedonic housing prices and the demand for clean air , 1978 .

[7]  Tsuhan Chen,et al.  An active learning framework for content-based information retrieval , 2002, IEEE Trans. Multim..

[8]  Naoki Abe,et al.  Query Learning Strategies Using Boosting and Bagging , 1998, ICML.

[9]  H. Sebastian Seung,et al.  Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[10]  M. H. Quenouille Approximate Tests of Correlation in Time‐Series , 1949 .

[11]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[12]  Leonard G. C. Hamey,et al.  Active Learning for Nonlinear System Identification and Control , 1996 .

[13]  H. Sebastian Seung,et al.  Query by committee , 1992, COLT '92.

[14]  Anders Krogh,et al.  Neural Network Ensembles, Cross Validation, and Active Learning , 1994, NIPS.

[15]  Leonard G. C. Hamey,et al.  Accurate modelling with minimised data collection - an active learning algorithm , 1996 .

[16]  Masashi Sugiyama,et al.  Active Learning in Approximately Linear Regression Based on Conditional Expectation of Generalization Error , 2006, J. Mach. Learn. Res..

[17]  Tirthankar Raychaudhuri,et al.  Cost-effective Querying Leading to Dual Control Cost-eeective Querying Leading to Dual Control , 1996 .

[18]  D. Wiens Robust weights and designs for biased regression models: Least squares and generalized M-estimation , 2000 .