Active Batch Selection via Convex Relaxations with Guaranteed Solution Bounds

Active learning techniques have gained popularity to reduce human effort in labeling data instances for inducing a classifier. When faced with large amounts of unlabeled data, such algorithms automatically identify the exemplar instances for manual annotation. More recently, there have been attempts towards a batch mode form of active learning, where a batch of data points is simultaneously selected from an unlabeled set. In this paper, we propose two novel batch mode active learning (BMAL) algorithms: BatchRank and BatchRand. We first formulate the batch selection task as an NP-hard optimization problem; we then propose two convex relaxations, one based on linear programming and the other based on semi-definite programming to solve the batch selection problem. Finally, a deterministic bound is derived on the solution quality for the first relaxation and a probabilistic bound for the second. To the best of our knowledge, this is the first research effort to derive mathematical guarantees on the solution quality of the BMAL problem. Our extensive empirical studies on 15 binary, multi-class and multi-label challenging datasets corroborate that the proposed algorithms perform at par with the state-of-the-art techniques, deliver high quality solutions and are robust to real-world issues like label noise and class imbalance.

[1]  Gwen Littlewort,et al.  Dynamics of Facial Expression Extracted Automatically from Video , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[2]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[3]  Peter Robinson,et al.  Mind reading machines: automated inference of cognitive mental states from video , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[4]  Greg Schohn,et al.  Less is More: Active Learning with Support Vector Machines , 2000, ICML.

[5]  Qin Jin,et al.  Multi-modal Person Identification in a Smart Environment , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Lise Getoor,et al.  Active Learning for Networked Data , 2010, ICML.

[7]  Xiao-Tong Yuan,et al.  Truncated power method for sparse eigenvalue problems , 2011, J. Mach. Learn. Res..

[8]  Mohan Singh,et al.  Active Learning for Multi-Label Image Annotation , 2009 .

[9]  David P. Williamson,et al.  Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming , 1995, JACM.

[10]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[11]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[12]  Klaus Brinker,et al.  Incorporating Diversity in Active Learning with Support Vector Machines , 2003, ICML.

[13]  Andrew McCallum,et al.  Employing EM and Pool-Based Active Learning for Text Classification , 1998, ICML.

[14]  Russell Greiner,et al.  Optimistic Active-Learning Using Mutual Information , 2007, IJCAI.

[15]  Dale Schuurmans,et al.  Discriminative Batch Mode Active Learning , 2007, NIPS.

[16]  Maja Pantic,et al.  Web-based database for facial expression analysis , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[17]  Xiaowei Xu,et al.  Representative Sampling for Text Classification Using Support Vector Machines , 2003, ECIR.

[18]  Edward Y. Chang,et al.  Support vector machine active learning for image retrieval , 2001, MULTIMEDIA '01.

[19]  Sethuraman Panchanathan,et al.  Generalized Query by Transduction for online active learning , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[20]  Maria-Florina Balcan,et al.  The true sample complexity of active learning , 2010, Machine Learning.

[21]  Rong Jin,et al.  Batch Mode Active Learning with Applications to Text Categorization and Image Retrieval , 2009, IEEE Transactions on Knowledge and Data Engineering.

[22]  Rong Jin,et al.  Semi-supervised SVM batch mode active learning for image retrieval , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Sethuraman Panchanathan,et al.  Optimal batch selection for active learning in multi-label classification , 2011, ACM Multimedia.

[24]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[25]  Ran El-Yaniv,et al.  Online Choice of Active Learning Algorithms , 2003, J. Mach. Learn. Res..

[26]  Yiming Yang,et al.  Active Learning for Multi-Task Adaptive Filtering , 2010, ICML.

[27]  Steve Hanneke,et al.  A bound on the label complexity of agnostic active learning , 2007, ICML '07.

[28]  Rong Jin,et al.  Active Learning by Querying Informative and Representative Examples , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Rong Jin,et al.  Large-scale text categorization by batch mode active learning , 2006, WWW '06.

[30]  Kun Zhang,et al.  Multi-label learning by exploiting label dependency , 2010, KDD.

[31]  Matjaz Kukar,et al.  Transductive reliability estimation for medical diagnosis , 2003, Artif. Intell. Medicine.

[32]  Klaus Brinker,et al.  On Active Learning in Multi-label Classification , 2005, GfKl.

[33]  Rong Jin,et al.  Batch mode active learning and its application to medical image classification , 2006, ICML.

[34]  Yuhong Guo,et al.  Active Instance Sampling via Matrix Partition , 2010, NIPS.

[35]  Chi-Ho Chan,et al.  On the Results of the First Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation , 2010, ICPR Contests.

[36]  John Langford,et al.  Importance weighted active learning , 2008, ICML '09.

[37]  Prasad Tadepalli,et al.  Active Learning with Committees for Text Categorization , 1997, AAAI/IAAI.

[38]  Sethuraman Panchanathan,et al.  Generalized batch mode active learning for face-based biometric recognition , 2013, Pattern Recognit..

[39]  Claire Monteleoni,et al.  Practical Online Active Learning for Classification , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Conrad Sanderson,et al.  Biometric Person Recognition: Face, Speech and Fusion , 2008 .

[41]  H. Sebastian Seung,et al.  Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[42]  Harry Wechsler,et al.  Query by Transduction , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.