A Label Embedding Method for Multi-label Classification via Exploiting Local Label Correlations

Multi-label learning has attracted more attention recently due to many real-world applications (e.g., text categorization and scene annotation). As the dimensionality of label space increases, it becomes more difficult to deal with this kind of applications. Therefore, dimensionality reduction techniques originally for feature space is also applied to label space, one of which is label embedding strategy which converts the high-dimensional label space into a low-dimensional reduced one. So far, existing label embedding methods mainly investigate the global recoverability between original labels and reduced labels, dependency between original features and reduced labels, or both. It is widely recognized that local label correlations could improve multi-label classification performance effectively. In this paper, we construct a trace ratio minimization problem as a novel label embedding criterion, which not only includes the global label recoverability and dependency, but also exploits the local label correlations as a local recoverability factor. Experiments on four benchmark data sets with more than 100 labels demonstrate that our proposed method is superior to four state-of-the-art techniques, according to two performance metrics for high-dimensional label space.

[1]  Hsuan-Tien Lin,et al.  Feature-aware Label Space Dimension Reduction for Multi-label Classification , 2012, NIPS.

[2]  John Langford,et al.  Multi-Label Prediction via Compressed Sensing , 2009, NIPS.

[3]  Kun Zhang,et al.  Multi-label learning by exploiting label dependency , 2010, KDD.

[4]  Zhi-Hong Mao,et al.  Multilabel Feature Extraction Algorithm via Maximizing Approximated and Symmetrized Normalized Cross-Covariance Operator , 2019, IEEE Transactions on Cybernetics.

[5]  Francisco Charte,et al.  LI-MLC: A Label Inference Methodology for Addressing High Dimensionality in the Label Space for Multilabel Classification , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Feiping Nie,et al.  Trace Ratio Problem Revisited , 2009, IEEE Transactions on Neural Networks.

[7]  Yang Yu,et al.  Binary Linear Compression for Multi-label Classification , 2017, IJCAI.

[8]  Stefan Kramer,et al.  Multi-label classification using boolean matrix decomposition , 2012, SAC '12.

[9]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[10]  Feiping Nie,et al.  Globally and Locally Consistent Unsupervised Projection , 2014, AAAI.

[11]  Bernhard Schölkopf,et al.  Measuring Statistical Dependence with Hilbert-Schmidt Norms , 2005, ALT.

[12]  Krishnakumar Balasubramanian,et al.  The Landmark Selection Method for Multiple Output Prediction , 2012, ICML.

[13]  Lei Cao,et al.  A label compression coding approach through maximizing dependence between features and labels for multi-label classification , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[14]  Xindong Wu,et al.  Compressed labeling on distilled labelsets for multi-label learning , 2012, Machine Learning.

[15]  Lei Cao,et al.  A Non-linear Label Compression Coding Method Based on Five-Layer Auto-Encoder for Multi-label Classification , 2016, ICONIP.

[16]  Francisco Charte,et al.  Multilabel Classification: Problem Analysis, Metrics and Techniques , 2016 .

[17]  Manik Varma,et al.  Extreme Multi-label Loss Functions for Recommendation, Tagging, Ranking & Other Missing Label Applications , 2016, KDD.

[18]  Jianmin Wang,et al.  Multi-label Classification via Feature-aware Implicit Label Space Encoding , 2014, ICML.

[19]  Ling Shao,et al.  End-to-End Feature-Aware Label Space Encoding for Multilabel Classification With Many Classes , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[20]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[21]  Hsuan-Tien Lin,et al.  Multilabel Classification with Principal Label Space Transformation , 2012, Neural Computation.

[22]  Xiao Li,et al.  Dependence maximization based label space dimension reduction for multi-label classification , 2015, Eng. Appl. Artif. Intell..