BatchRank: A Novel Batch Mode Active Learning Framework for Hierarchical Classification

Active learning algorithms automatically identify the salient and exemplar instances from large amounts of unlabeled data and thus reduce human annotation effort in inducing a classification model. More recently, Batch Mode Active Learning (BMAL) techniques have been proposed, where a batch of data samples is selected simultaneously from an unlabeled set. Most active learning algorithms assume a flat label space, that is, they consider the class labels to be independent. However, in many applications, the set of class labels are organized in a hierarchical tree structure, with the leaf nodes as outputs and the internal nodes as clusters of outputs at multiple levels of granularity. In this paper, we propose a novel BMAL algorithm (BatchRank) for hierarchical classification. The sample selection is posed as an NP-hard integer quadratic programming problem and a convex relaxation (based on linear programming) is derived, whose solution is further improved by an iterative truncated power method. Finally, a deterministic bound is established on the quality of the solution. Our empirical results on several challenging, real-world datasets from multiple domains, corroborate the potential of the proposed framework for real-world hierarchical classification applications.

[1]  Saso Dzeroski,et al.  Hierarchical annotation of medical images , 2011, Pattern Recognit..

[2]  Juho Rousu,et al.  Kernel-Based Learning of Hierarchical Multilabel Classification Models , 2006, J. Mach. Learn. Res..

[3]  Alok N. Choudhary,et al.  On active learning in hierarchical classification , 2012, CIKM '12.

[4]  Dale Schuurmans,et al.  Discriminative Batch Mode Active Learning , 2007, NIPS.

[5]  Thomas Hofmann,et al.  Hierarchical document categorization with support vector machines , 2004, CIKM '04.

[6]  Kenneth Ward Church,et al.  Very sparse random projections , 2006, KDD '06.

[7]  Yoram Singer,et al.  Large margin hierarchical classification , 2004, ICML.

[8]  Rong Jin,et al.  Large-scale text categorization by batch mode active learning , 2006, WWW '06.

[9]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[10]  Steve Hanneke,et al.  A bound on the label complexity of agnostic active learning , 2007, ICML '07.

[11]  David P. Williamson,et al.  Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming , 1995, JACM.

[12]  Greg Schohn,et al.  Less is More: Active Learning with Support Vector Machines , 2000, ICML.

[13]  Xiao-Tong Yuan,et al.  Truncated power method for sparse eigenvalue problems , 2011, J. Mach. Learn. Res..

[14]  Yuhong Guo,et al.  Active Instance Sampling via Matrix Partition , 2010, NIPS.

[15]  Juho Rousu,et al.  Learning hierarchical multi-category text classification models , 2005, ICML.

[16]  Fei Wang,et al.  Batch Mode Active Learning with Hierarchical-Structured Embedded Variance , 2014, SDM.

[17]  Xiao Li,et al.  Effective Top-Down Active Learning for Hierarchical Text Classification , 2013, PAKDD.

[18]  Yiming Yang,et al.  RCV1: A New Benchmark Collection for Text Categorization Research , 2004, J. Mach. Learn. Res..

[19]  Lin Xiao,et al.  Hierarchical Classification via Orthogonal Transfer , 2011, ICML.

[20]  Santosh S. Vempala,et al.  The Random Projection Method , 2005, DIMACS Series in Discrete Mathematics and Theoretical Computer Science.

[21]  Xiao Li,et al.  Active Learning for Hierarchical Text Classification , 2012, PAKDD.

[22]  Klaus Brinker,et al.  Incorporating Diversity in Active Learning with Support Vector Machines , 2003, ICML.

[23]  Jian Su,et al.  Multi-Criteria-based Active Learning for Named Entity Recognition , 2004, ACL.

[24]  Maria-Florina Balcan,et al.  The true sample complexity of active learning , 2010, Machine Learning.

[25]  Shuiwang Ji,et al.  SLEP: Sparse Learning with Efficient Projections , 2011 .

[26]  Susan T. Dumais,et al.  Hierarchical classification of Web content , 2000, SIGIR '00.

[27]  Rong Jin,et al.  Batch Mode Active Learning with Applications to Text Categorization and Image Retrieval , 2009, IEEE Transactions on Knowledge and Data Engineering.

[28]  Rong Jin,et al.  Semi-supervised SVM batch mode active learning for image retrieval , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.