Active Learning by Querying Informative and Representative Examples

Active learning reduces the labeling cost by iteratively selecting the most valuable data to query their labels. It has attracted a lot of interests given the abundance of unlabeled data and the high cost of labeling. Most active learning approaches select either informative or representative unlabeled instances to query their labels, which could significantly limit their performance. Although several active learning algorithms were proposed to combine the two query selection criteria, they are usually ad hoc in finding unlabeled instances that are both informative and representative. We address this limitation by developing a principled approach, termed QUIRE, based on the min-max view of active learning. The proposed approach provides a systematic way for measuring and combining the informativeness and representativeness of an unlabeled instance. Further, by incorporating the correlation among labels, we extend the QUIRE approach to multi-label learning by actively querying instance-label pairs. Extensive experimental results show that the proposed QUIRE approach outperforms several state-of-the-art active learning approaches in both single-label and multi-label learning.

[1]  Rong Jin,et al.  Semi-supervised SVM batch mode active learning for image retrieval , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Zhi-Hua Zhou,et al.  Multi-instance multi-label learning , 2008, Artif. Intell..

[3]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[4]  Arnold W. M. Smeulders,et al.  Active learning using pre-clustering , 2004, ICML.

[5]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[6]  Lei Wang,et al.  Multilabel SVM active learning for image classification , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[7]  Grigorios Tsoumakas,et al.  Multi-Label Classification of Music into Emotions , 2008, ISMIR.

[8]  Chun-Liang Li,et al.  Active Learning with Hinted Support Vector Machine , 2012, ACML.

[9]  Klaus Brinker,et al.  On Active Learning in Multi-label Classification , 2005, GfKl.

[10]  Yuhong Guo,et al.  Active Instance Sampling via Matrix Partition , 2010, NIPS.

[11]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[12]  Paul N. Bennett,et al.  Dual Strategy Active Learning , 2007, ECML.

[13]  Naonori Ueda,et al.  Parametric Mixture Models for Multi-Labeled Text , 2002, NIPS.

[14]  Hsuan-Tien Lin,et al.  Multi-label Active Learning with Auxiliary Learner , 2011, ACML.

[15]  H. Sebastian Seung,et al.  Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[16]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[17]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[18]  Jing Peng,et al.  SVM vs regularized least squares classification , 2004, ICPR 2004.

[19]  Xin Li,et al.  Active Learning with Multi-Label SVM Classification , 2013, IJCAI.

[20]  Sanjoy Dasgupta,et al.  Hierarchical sampling for active learning , 2008, ICML '08.

[21]  Jieping Ye,et al.  Querying discriminative and representative samples for batch mode active learning , 2013, KDD.

[22]  Andrea Esuli,et al.  Active Learning Strategies for Multi-Label Text Classification , 2009, ECIR.

[23]  Mohan Singh,et al.  Active Learning for Multi-Label Image Annotation , 2009 .

[24]  Sethuraman Panchanathan,et al.  Batch mode active sampling based on marginal probability distribution matching , 2012, TKDD.

[25]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[26]  Steven C. H. Hoi,et al.  PAMR: Passive aggressive mean reversion strategy for portfolio selection , 2012, Machine Learning.

[27]  Jieping Ye,et al.  Hypergraph spectral learning for multi-label classification , 2008, KDD.

[28]  Kun Zhang,et al.  Multi-label learning by exploiting label dependency , 2010, KDD.

[29]  Andrew McCallum,et al.  Toward Optimal Active Learning through Sampling Estimation of Error Reduction , 2001, ICML.

[30]  Tibério S. Caetano,et al.  Submodular Multi-Label Learning , 2011, NIPS.

[31]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[32]  Yang Yu,et al.  Multi-label hypothesis reuse , 2012, KDD.

[33]  Yiming Yang,et al.  Introducing the Enron Corpus , 2004, CEAS.

[34]  Zheng Chen,et al.  Effective multi-label active learning for text classification , 2009, KDD.

[35]  Rong Yan,et al.  Model-shared subspace boosting for multi-label classification , 2007, KDD '07.

[36]  Dale Schuurmans,et al.  Discriminative Batch Mode Active Learning , 2007, NIPS.

[37]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[38]  Xiaowei Xu,et al.  Representative Sampling for Text Classification Using Support Vector Machines , 2003, ECIR.

[39]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[40]  Shlomo Argamon,et al.  Committee-Based Sampling For Training Probabilistic Classi(cid:12)ers , 1995 .

[41]  H. Sebastian Seung,et al.  Query by committee , 1992, COLT '92.

[42]  Daphne Koller,et al.  Support Vector Machine Active Learning with Application sto Text Classification , 2000, ICML.

[43]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[44]  William A. Gale,et al.  A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[45]  Maria-Florina Balcan,et al.  Margin Based Active Learning , 2007, COLT.

[46]  Jinbo Bi,et al.  Active learning via transductive experimental design , 2006, ICML.

[47]  Grigorios Tsoumakas,et al.  Correlation-Based Pruning of Stacked Binary Relevance Models for Multi-Label Learning , 2009 .

[48]  Zhi-Hua Zhou,et al.  Active Query Driven by Uncertainty and Diversity for Incremental Multi-label Learning , 2013, 2013 IEEE 13th International Conference on Data Mining.

[49]  David D. Lewis,et al.  Heterogeneous Uncertainty Sampling for Supervised Learning , 1994, ICML.

[50]  Michael I. Jordan,et al.  Robust design of biological experiments , 2005, NIPS.

[51]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[52]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[53]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[54]  Xian-Sheng Hua,et al.  Two-Dimensional Active Learning for image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.