论文信息 - Efficient Monte Carlo Optimization for Multi-dimensional Classifier Chains

Efficient Monte Carlo Optimization for Multi-dimensional Classifier Chains

Multi-dimensional classification (MDC) is the supervised learning problem where an instance may be associated with multiple classes, rather than with a single class as in traditional binary or multi-class single-dimensional classification (SDC) problems. MDC is closely related to multi-task learning, and multi-target learning (generally, in the literature, multi-target refers to the regression case). Modeling dependencies between labels allows MDC methods to improve their performance at the expense of an increased computational cost. In this paper we focus on the classifier chains (CC) approach for modeling dependencies. On the one hand, the original CC algorithm makes a greedy approximation, and is fast but tends to propagate errors down the chain. On the other hand, a recent Bayes-optimal method improves the performance, but is computationally intractable in practice. Here we present novel Monte Carlo schemes, both for finding a good chain sequence and performing efficient inference. Our algorithms remain tractable for high-dimensional data sets and obtains the best overall accuracy, as shown on several real data sets.

Luca Martino | David Luengo | Jesse Read

[1] Saso Dzeroski,et al. An extensive experimental comparison of methods for multi-label learning , 2012, Pattern Recognit..

[2] Eyke Hüllermeier,et al. Bayes Optimal Multilabel Classification via Probabilistic Classifier Chains , 2010, ICML.

[3] Concha Bielza,et al. Bayesian Chain Classifiers for Multidimensional Classification , 2011, IJCAI.

[4] Grigorios Tsoumakas,et al. Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[5] Janez Demsar,et al. Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[6] Jesse Read,et al. Scalable Multi-label Classification , 2010 .

[7] Grigorios Tsoumakas,et al. Random k -Labelsets: An Ensemble Method for Multilabel Classification , 2007, ECML.

[8] Geoff Holmes,et al. Classifier chains for multi-label classification , 2009, Machine Learning.

[9] Grigorios Tsoumakas,et al. Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[10] Robert Tibshirani,et al. Classification by Pairwise Coupling , 1997, NIPS.

[11] Geoff Holmes,et al. Multi-label Classification Using Ensembles of Pruned Sets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[12] Ian H. Witten,et al. The WEKA data mining software: an update , 2009, SKDD.