Exploration-based Active Machine Learning Exploration-based Active Machine Learning

Active machine learning algorithms are used when large numbers of unlabeled examples are available and getting labels for them is costly (e.g. requiring consulting a human expert). Many conventional active learning algorithms focus on refining the decision boundary, at the expense of exploring new regions that the current hypothesis misclassifies. We propose a new active learning algorithm that balances such exploration with refining of the decision boundary by dynamically adjusting the probability to explore at each step. Our experimental results demonstrate improved performance on data sets that require extensive exploration while remaining competitive on data sets that do not. Our algorithm also shows significant tolerance of noise.

[1]  Klaus Brinker,et al.  Incorporating Diversity in Active Learning with Support Vector Machines , 2003, ICML.

[2]  Nello Cristianini,et al.  Support vector and kernel machines , 2001 .

[3]  Ran El-Yaniv,et al.  Online Choice of Active Learning Algorithms , 2003, J. Mach. Learn. Res..

[4]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[5]  H. Sebastian Seung,et al.  Query by committee , 1992, COLT '92.

[6]  Michael Lindenbaum,et al.  Selective Sampling for Nearest Neighbor Classifiers , 1999, Machine Learning.

[7]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[8]  William A. Gale,et al.  A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[9]  Arnold W. M. Smeulders,et al.  Active learning using pre-clustering , 2004, ICML.

[10]  C. A. Murthy,et al.  A probabilistic active support vector learning algorithm , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Jong-Min Park,et al.  Convergence and application of online active sampling using orthogonal pillar vectors , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Dustin Boswell,et al.  Introduction to Support Vector Machines , 2002 .

[13]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[14]  Tong Zhang,et al.  Active learning using adaptive resampling , 2000, KDD '00.

[15]  Dana Angluin,et al.  Queries and concept learning , 1988, Machine Learning.

[16]  Andrew McCallum,et al.  Toward Optimal Active Learning through Sampling Estimation of Error Reduction , 2001, ICML.

[17]  H. Sebastian Seung,et al.  Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[18]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[19]  Gökhan Tür,et al.  Unsupervised and active learning in automatic speech recognition for call classification , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[20]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[21]  Dilek Z. Hakkani-Tür,et al.  Active learning for automatic speech recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  Lawrence O. Hall,et al.  Active learning to recognize multiple types of plankton , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[23]  James M. Rehg,et al.  Active learning for automatic classification of software behavior , 2004, ISSTA '04.