A label compression coding approach through maximizing dependence between features and labels for multi-label classification

Label compression coding strategy aims at multi-label classification problems with high-dimensional and/or sparse label vectors. Without deteriorating classification performance significantly, its efficiency depends on two key aspects: coding raw binary label vectors into real or binary codewords shortly, and decoding binary label vectors from predicted codewords speedily, which reduce the computational costs in training and testing procedures respectively. In this paper, we propose a novel label compression coding method for multi-label classification, which maximizes dependence between features and labels using Hilbert-Schmidt independence criterion and thus considers both feature and label information simultaneously. Via solving an eigenvalue problem, our method results in a small-scale coding matrix and a fast decoding operation. The experiments on ten various bench-mark data sets illustrate that our proposed technique is superior to three existing approaches, including compressive sensing based method, principal label space transformation technique and its conditional version, according to five ranking-based and instance-based performance evaluation measures.

[1]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[2]  Jianmin Wang,et al.  Multi-label Classification via Feature-aware Implicit Label Space Encoding , 2014, ICML.

[3]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[4]  Ann Q. Gates,et al.  TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING , 2005 .

[5]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[6]  Jeff G. Schneider,et al.  Maximum Margin Output Coding , 2012, ICML.

[7]  Stefan Kramer,et al.  Multi-label classification using boolean matrix decomposition , 2012, SAC '12.

[8]  Hsuan-Tien Lin,et al.  Feature-aware Label Space Dimension Reduction for Multi-label Classification , 2012, NIPS.

[9]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[10]  ZhouZhi-Hua,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006 .

[11]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[12]  Grigorios Tsoumakas,et al.  Multi-label classification of music by emotion , 2011, EURASIP J. Audio Speech Music. Process..

[13]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[14]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[15]  Bernhard Schölkopf,et al.  Measuring Statistical Dependence with Hilbert-Schmidt Norms , 2005, ALT.

[16]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[17]  Saso Dzeroski,et al.  An extensive experimental comparison of methods for multi-label learning , 2012, Pattern Recognit..

[18]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[19]  Jeff G. Schneider,et al.  Multi-Label Output Codes using Canonical Correlation Analysis , 2011, AISTATS.

[20]  Jianhua Xu,et al.  An extended one-versus-rest support vector machine for multi-label classification , 2011, Neurocomputing.

[21]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[22]  Xindong Wu,et al.  Compressed labeling on distilled labelsets for multi-label learning , 2012, Machine Learning.

[23]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[24]  Jianhua Xu,et al.  Fast multi-label core vector machine , 2013, Pattern Recognit..

[25]  Sungyoung Lee,et al.  Compressive sensing: From theory to applications, a survey , 2013, Journal of Communications and Networks.

[26]  Hsuan-Tien Lin,et al.  Multilabel Classification with Principal Label Space Transformation , 2012, Neural Computation.

[27]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[28]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[29]  John Langford,et al.  Multi-Label Prediction via Compressed Sensing , 2009, NIPS.

[30]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.