Multi-label classification via incremental clustering on an evolving data stream

With the advancement of storage and processing technology, an enormous amount of data is collected on a daily basis in many applications. Nowadays, advanced data analytics have been used to mine the collected data for useful information and make predictions, contributing to the competitive advantages of companies. The increasing data volume, however, has posed many problems to classical batch learning systems, such as the need to retrain the model completely with the newly arrived samples or the impracticality of storing and accessing a large volume of data. This has prompted interest on incremental learning that operates on data streams. In this study, we develop an incremental online multi-label classification (OMLC) method based on a weighted clustering model. The model is made to adapt to the change of data via the decay mechanism in which each sample's weight dwindles away over time. The clustering model therefore always focuses more on newly arrived samples. In the classification process, only clusters whose weights are greater than a threshold (called mature clusters) are employed to assign labels for the samples. In our method, not only is the clustering model incrementally maintained with the revealed ground truth labels of the arrived samples, the number of predicted labels in a sample are also adjusted based on the Hoeffding inequality and the label cardinality. The experimental results show that our method is competitive compared to several well-known benchmark algorithms on six performance measures in both the stationary and the concept drift settings.

[1]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[2]  Xin Li,et al.  Active Learning with Multi-Label SVM Classification , 2013, IJCAI.

[3]  Grigorios Tsoumakas,et al.  Dealing with Concept Drift and Class Imbalance in Multi-Label Stream Classification , 2011, IJCAI.

[4]  Alan Wee-Chung Liew,et al.  Heterogeneous classifier ensemble with fuzzy rule-based meta learner , 2018, Inf. Sci..

[5]  Hong Shen,et al.  Weighted Ensemble Classification of Multi-label Data Streams , 2017, PAKDD.

[6]  Jianhua Xu,et al.  Multi-Label Weighted k-Nearest Neighbor Classifier with Adaptive Weight Estimation , 2011, ICONIP.

[7]  Saso Dzeroski,et al.  Decision trees for hierarchical multi-label classification , 2008, Machine Learning.

[8]  Hsuan-Tien Lin,et al.  Feature-aware Label Space Dimension Reduction for Multi-label Classification , 2012, NIPS.

[9]  Hai Zhao,et al.  Drift Detection for Multi-label Data Streams Based on Label Grouping and Entropy , 2014, 2014 IEEE International Conference on Data Mining Workshop.

[10]  Grigorios Tsoumakas,et al.  An Empirical Comparison of Methods for Multi-label Data Stream Classification , 2016, INNS Conference on Big Data.

[11]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[12]  Alan Wee-Chung Liew,et al.  A Novel Bayesian Framework for Online Imbalanced Learning , 2017, 2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[13]  Andrew McCallum,et al.  Collective multi-label classification , 2005, CIKM '05.

[14]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[15]  João Gama,et al.  Learning with Drift Detection , 2004, SBIA.

[16]  Geoff Holmes,et al.  Scalable and efficient multi-label classification for evolving data streams , 2012, Machine Learning.

[17]  Alan Wee-Chung Liew,et al.  Learning from Data Stream Based on Random Projection and Hoeffding Tree Classifier , 2017, 2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[18]  Qinghua Hu,et al.  Multi-label feature selection with streaming labels , 2016, Inf. Sci..

[19]  Eyke Hüllermeier,et al.  Bayes Optimal Multilabel Classification via Probabilistic Classifier Chains , 2010, ICML.

[20]  Arya Mazumdar,et al.  Multilabel Classification with Group Testing and Codes , 2017, ICML.

[21]  Johannes Fürnkranz,et al.  Maximizing Subset Accuracy with Recurrent Neural Networks in Multi-label Classification , 2017, NIPS.

[22]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[23]  Jesse Read,et al.  Scalable Multi-label Classification , 2010 .

[24]  Alan Wee-Chung Liew,et al.  Variational inference based bayes online classifiers with concept drift adaptation , 2018, Pattern Recognit..

[25]  Luca Martino,et al.  Scalable multi-output label prediction: From classifier chains to classifier trellises , 2015, Pattern Recognit..

[26]  Luca Martino,et al.  Efficient monte carlo methods for multi-dimensional learning with classifier chains , 2012, Pattern Recognit..

[27]  Charles Elkan,et al.  Beam search algorithms for multilabel learning , 2013, Machine Learning.

[28]  S. García,et al.  An Extension on "Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons , 2008 .

[29]  Yang Zhang,et al.  Mining Multi-label Concept-Drifting Data Streams Using Dynamic Classifier Ensemble , 2009, ACML.

[30]  Geoff Holmes,et al.  Multi-label Classification Using Ensembles of Pruned Sets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[31]  Yuhong Guo,et al.  Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence Multi-Label Classification Using Conditional Dependency Networks , 2022 .

[32]  Aoying Zhou,et al.  Density-Based Clustering over an Evolving Data Stream with Noise , 2006, SDM.

[33]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[34]  Saso Dzeroski,et al.  Multi-label classification via multi-target regression on data streams , 2016, Machine Learning.

[35]  Dae-Won Kim,et al.  SCLS: Multi-label feature selection based on scalable criterion for large label set , 2017, Pattern Recognit..

[36]  Guoyong Cai,et al.  Efficient class incremental learning for multi-label classification of evolving data streams , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[37]  Ricard Gavaldà,et al.  Adaptive Learning from Evolving Data Streams , 2009, IDA.

[38]  Ricard Gavaldà,et al.  Learning from Time-Changing Data with Adaptive Windowing , 2007, SDM.

[39]  Ashish Kapoor,et al.  Multilabel Classification using Bayesian Compressed Sensing , 2012, NIPS.