Active Multi-Instance Multi-Label Learning

Multi-instance multi-label learning (MIML) introduced by Zhou and Zhang is a comparatively new framework in machine learning with two special characteristics: Firstly, each instance is represented by a set of feature vectors (a bag of instances), and secondly, bags of instances may belong to many classes (a Multi-Label). Thus, an MIML classifier receives a bag of instances and produces a Multi-Label. For classifier training, the training set is also of this MIML structure. Labeling a data set is always cost-intensive, especially in an MIMIL framework. In order to reduce the labeling costs it is important to restructure the annotation process in such a way that the most informative examples are labeled in the beginning, and less or non-informative data more to the end of the annotation phase. Active learning is a possible approach to tackle this kind of problems in this work we focus on the MIMLSVM algorithm in combination with the k-Medoids clustering algorithm to transform the Multi-Instance to a Single-Instance representation. For the clustering distance measure we consider variants of the Hausdorff distance, namely Median- and Average-Based Hausdorff distance. Finally, active learning strategies derived from the single-instance scenario have been investigated in the MIML setting and evaluated on a benchmark data set.

[1]  Zhi-Hua Zhou,et al.  Multi-Instance Multi-Label Learning with Application to Scene Classification , 2006, NIPS.

[2]  Jun Wang,et al.  Solving the Multiple-Instance Problem: A Lazy Learning Approach , 2000, ICML.

[3]  Zhi-Hua Zhou,et al.  Multi-instance clustering with applications to multi-instance prediction , 2009, Applied Intelligence.

[4]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[5]  Gesellschaft für Klassifikation. Jahrestagung,et al.  From Data and Information Analysis to Knowledge Engineering, Proceedings of the 29th Annual Conference of the Gesellschaft für Klassifikation e.V., University of Magdeburg, March 9-11, 2005 , 2006, GfKl.

[6]  Mark Craven,et al.  Multiple-Instance Active Learning , 2007, NIPS.

[7]  Lei Wang,et al.  Multilabel SVM active learning for image classification , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[8]  Friedhelm Schwenker,et al.  Pattern classification and clustering: A review of partially supervised learning approaches , 2014, Pattern Recognit. Lett..

[9]  Jian Fu,et al.  Bag-level active multi-instance learning , 2011, 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[10]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[11]  Zheng Chen,et al.  Effective multi-label active learning for text classification , 2009, KDD.

[12]  Thomas G. Dietterich,et al.  Solving the Multiple Instance Problem with Axis-Parallel Rectangles , 1997, Artif. Intell..