Active Learning Using Hint Information

The abundance of real-world data and limited labeling budget calls for active learning, an important learning paradigm for reducing human labeling efforts. Many recently developed active learning algorithms consider both uncertainty and representativeness when making querying decisions. However, exploiting representativeness with uncertainty concurrently usually requires tackling sophisticated and challenging learning tasks, such as clustering. In this letter, we propose a new active learning framework, called hinted sampling, which takes both uncertainty and representativeness into account in a simpler way. We design a novel active learning algorithm within the hinted sampling framework with an extended support vector machine. Experimental results validate that the novel active learning algorithm can result in a better and more stable performance than that achieved by state-of-the-art algorithms. We also show that the hinted sampling framework allows improving another active learning algorithm designed from the transductive support vector machine.

[1]  J. Langford,et al.  The Epoch-Greedy algorithm for contextual multi-armed bandits , 2007, NIPS 2007.

[2]  Jiawei Han,et al.  Batch-Mode Active Learning via Error Bound Minimization , 2014, UAI.

[3]  Yi Yang,et al.  Interactive Video Indexing With Statistical Active Learning , 2012, IEEE Transactions on Multimedia.

[4]  Gunnar Rätsch,et al.  Soft Margins for AdaBoost , 2001, Machine Learning.

[5]  Sanjoy Dasgupta,et al.  Hierarchical sampling for active learning , 2008, ICML '08.

[6]  Jieping Ye,et al.  Querying discriminative and representative samples for batch mode active learning , 2013, KDD.

[7]  John Langford,et al.  Agnostic active learning , 2006, J. Comput. Syst. Sci..

[8]  Shuicheng Yan,et al.  Active learning with adaptive regularization , 2011, Pattern Recognit..

[9]  Ayhan Demiriz,et al.  Semi-Supervised Support Vector Machines , 1998, NIPS.

[10]  Sanjoy Dasgupta,et al.  Two faces of active learning , 2011, Theor. Comput. Sci..

[11]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[12]  Daphne Koller,et al.  Support Vector Machine Active Learning with Application sto Text Classification , 2000, ICML.

[13]  Xiaoli Li,et al.  Class Augmented Active Learning , 2014, SDM.

[14]  Paul N. Bennett,et al.  Dual Strategy Active Learning , 2007, ECML.

[15]  Xiaowei Xu,et al.  Representative Sampling for Text Classification Using Support Vector Machines , 2003, ECIR.

[16]  Thorsten Joachims,et al.  Making large-scale support vector machine learning practical , 1999 .

[17]  Rong Jin,et al.  Semi-supervised SVM batch mode active learning for image retrieval , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[19]  Chun-Liang Li,et al.  Active Learning with Hinted Support Vector Machine , 2012, ACML.

[20]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[21]  Raymond J. Mooney,et al.  Diverse ensembles for active learning , 2004, ICML.

[22]  Yaser S. Abu-Mostafa,et al.  Hints , 2018, Neural Computation.

[23]  Russell Greiner,et al.  Optimistic Active-Learning Using Mutual Information , 2007, IJCAI.

[24]  John Langford,et al.  Agnostic Active Learning Without Constraints , 2010, NIPS.

[25]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[26]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[27]  Mark Craven,et al.  An Analysis of Active Learning Strategies for Sequence Labeling Tasks , 2008, EMNLP.

[28]  William A. Gale,et al.  A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[29]  Rong Jin,et al.  Active Learning by Querying Informative and Representative Examples , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Arnold W. M. Smeulders,et al.  Active learning using pre-clustering , 2004, ICML.

[31]  Sanjoy Dasgupta,et al.  A General Agnostic Active Learning Algorithm , 2007, ISAIM.

[32]  Maria-Florina Balcan,et al.  Agnostic active learning , 2006, J. Comput. Syst. Sci..

[33]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[34]  Steve Hanneke,et al.  Teaching Dimension and the Complexity of Active Learning , 2007, COLT.

[35]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .