ML-FOREST: A Multi-Label Tree Ensemble Method for Multi-Label Classification

Multi-label classification deals with the problem where each example is associated with multiple class labels. Since the labels are often dependent to other labels, exploiting label dependencies can significantly improve the multi-label classification performance. The label dependency in existing studies is often given as prior knowledge or learned from the labels only. However, in many real applications, such prior knowledge may not be available, or labeled information might be very limited. In this paper, we propose a new algorithm, called Ml-Forest , to learn an ensemble of hierarchical multi-label classifier trees to reveal the intrinsic label dependencies. In Ml-Forest, we construct a set of hierarchical trees, and develop a label transfer mechanism to identify the multiple relevant labels in a hierarchical way. In general, the relevant labels at higher levels of the trees capture more discriminable label concepts, and they will be transferred into lower level children nodes that are harder to discriminate. The relevant labels in the hierarchy are then aggregated to compute label dependency and make the final prediction. Our empirical study shows encouraging results of the proposed algorithm in comparison with the state-of-the-art multi-label classification algorithms under Friedman test and post-hoc Nemenyi test.

[1]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[2]  Yong Luo,et al.  Low-Rank Multi-View Learning in Matrix Completion for Multi-Label Image Classification , 2015, AAAI.

[3]  许超,et al.  Low-Rank Multi-View Learning in Matrix Completion for Multi-Label Image Classification , 2015 .

[4]  K. Dembczynski,et al.  On Label Dependence in Multi-Label Classification , 2010 .

[5]  Remo Guidieri Res , 1995, RES: Anthropology and Aesthetics.

[6]  Jason Weston,et al.  Label Embedding Trees for Large Multi-Class Tasks , 2010, NIPS.

[7]  Dejan Gjorgjevikj,et al.  Hybrid Decision Tree Architecture Utilizing Local SVMs for Multi-Label Classification , 2012, HAIS.

[8]  Michael I. Jordan,et al.  Learning with Mixtures of Trees , 2001, J. Mach. Learn. Res..

[9]  Alexander C. Berg,et al.  Fast and Balanced: Efficient Label Tree Learning for Large Scale Object Recognition , 2011, NIPS.

[10]  Manik Varma,et al.  Multi-label learning with millions of labels: recommending advertiser bid phrases for web pages , 2013, WWW.

[11]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[12]  Guandong Xu,et al.  Learning Tree Structure of Label Dependency for Multi-label Learning , 2012, PAKDD.

[13]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[14]  Johannes Fürnkranz,et al.  Large-Scale Multi-label Text Classification - Revisiting Neural Networks , 2013, ECML/PKDD.

[15]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[16]  Jieping Ye,et al.  Extracting shared subspace for multi-label classification , 2008, KDD.

[17]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[18]  Zhi-Hua Zhou,et al.  Multi-instance multi-label learning , 2008, Artif. Intell..

[19]  Saso Dzeroski,et al.  An extensive experimental comparison of methods for multi-label learning , 2012, Pattern Recognit..

[20]  Grigorios Tsoumakas,et al.  Effective and Efficient Multilabel Classification in Domains with Large Number of Labels , 2008 .

[21]  Saso Dzeroski,et al.  Tree ensembles for predicting structured outputs , 2013, Pattern Recognit..

[22]  Tao Mei,et al.  Correlative multi-label video annotation , 2007, ACM Multimedia.

[23]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[24]  Amanda Clare,et al.  Knowledge Discovery in Multi-label Phenotype Data , 2001, PKDD.

[25]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[26]  Timothy N. Rubin,et al.  Statistical topic models for multi-label document classification , 2011, Machine Learning.

[27]  Yang Yu,et al.  Multi-label hypothesis reuse , 2012, KDD.

[28]  Luc De Raedt,et al.  Top-Down Induction of Clustering Trees , 1998, ICML.

[29]  Andreas Ziegler,et al.  On safari to Random Jungle: a fast implementation of Random Forests for high-dimensional data , 2010, Bioinform..

[30]  K. Chou,et al.  iAMP-2L: a two-level multi-label classifier for identifying antimicrobial peptides and their functional types. , 2013, Analytical biochemistry.

[31]  Yihong Gong,et al.  Multi-labelled classification using maximum entropy method , 2005, SIGIR '05.

[32]  Eyke Hüllermeier,et al.  On label dependence in multilabel classification , 2010, ICML 2010.

[33]  Jesse Read,et al.  Multi-label Classification with Meta-Labels , 2014, 2014 IEEE International Conference on Data Mining.

[34]  Dirk Van,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[35]  James T. Kwok,et al.  MultiLabel Classification on Tree- and DAG-Structured Hierarchies , 2011, ICML.

[36]  Rémi Gilleron,et al.  Learning Multi-label Alternating Decision Trees from Texts and Data , 2003, MLDM.

[37]  Ivor W. Tsang,et al.  Core Vector Machines: Fast SVM Training on Very Large Data Sets , 2005, J. Mach. Learn. Res..

[38]  Eyke Hüllermeier,et al.  Combining instance-based learning and logistic regression for multilabel classification , 2009, Machine Learning.

[39]  Claudio Gentile,et al.  Hierarchical classification: combining Bayes with SVM , 2006, ICML.

[40]  Kuo-Chen Chou,et al.  Some remarks on predicting multi-label attributes in molecular biosystems. , 2013, Molecular bioSystems.

[41]  Saso Dzeroski,et al.  Two stage architecture for multi-label learning , 2012, Pattern Recognit..

[42]  Saso Dzeroski,et al.  Decision trees for hierarchical multi-label classification , 2008, Machine Learning.

[43]  Grigorios Tsoumakas,et al.  MULAN: A Java Library for Multi-Label Learning , 2011, J. Mach. Learn. Res..

[44]  Jieping Ye,et al.  Hypergraph spectral learning for multi-label classification , 2008, KDD.

[45]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[46]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[47]  Yang Zhang,et al.  An image-based multi-label human protein subcellular localization predictor (iLocator) reveals protein mislocalizations in cancer tissues , 2013, Bioinform..

[48]  Eyke Hüllermeier,et al.  Bayes Optimal Multilabel Classification via Probabilistic Classifier Chains , 2010, ICML.

[49]  Kun Zhang,et al.  Multi-label learning by exploiting label dependency , 2010, KDD.

[50]  Alexandre Bernardino,et al.  Matrix Completion for Weakly-Supervised Multi-Label Image Classification , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[52]  Christophe Moulin,et al.  MCut: A Thresholding Strategy for Multi-label Classification , 2012, IDA.

[53]  Yunming Ye,et al.  Stratified sampling for feature subspace selection in random forests for high dimensional data , 2013, Pattern Recognit..

[54]  Andreas Ziegler,et al.  On safari to Random Jungle: a fast implementation of Random Forests for high-dimensional data , 2010, Bioinform..

[55]  Haojie Li,et al.  Multi-Label Image Categorization With Sparse Factor Representation , 2014, IEEE Transactions on Image Processing.