Swarm Intelligence-Based Feature Selection for Multi-Label Classification: A Review

Multi-label classification is the process of specifying more than one class label for each instance. The high-dimensional data in various multi-label classification tasks have a direct impact on reducing the efficiency of traditional multi-label classifiers. To tackle this problem, feature selection is used as an effective approach to retain relevant features and eliminating redundant ones to reduce dimensionality. Multi-label classification has a wide range of real-world applications such as image classification, emotion analysis, text mining and bioinformatics. Moreover, in recent years researchers have focused on applying swarm intelligence methods in selecting prominent features of multi-label data. After reviewing various researches, it seems there are no researches that provide a review of swarm intelligence-based methods for multi-label feature selection. Thus, in this study, a comprehensive review of different swarm intelligence and evolutionary computing methods of feature selection presented for the tasks of multi-label classification. To this end, in this review, we have investigated most of the well-known and state-of-the-art methods and categorize them based on different perspectives. We then provided the main characteristics of the existing multi-label feature selection techniques and compared them analytically. We also introduce benchmarks, evaluation measures and standard datasets to facilitate research in this field. Moreover, we performed some experiments to compare existing works and at the end of this survey, some challenges, issues and open problems of this field are introduced to be considered by researchers in future.

[1]  Omar Ahmed,et al.  Gene Expression Classification Based on Deep Learning , 2019, 2019 4th Scientific International Conference Najaf (SICN).

[2]  Hossein Nezamabadi-pour,et al.  Multilabel feature selection: A comprehensive review and guiding experiments , 2018, Wiley Interdiscip. Rev. Data Min. Knowl. Discov..

[3]  Parham Moradi,et al.  Relevance-redundancy feature selection based on ant colony optimization , 2015, Pattern Recognit..

[4]  Philip S. Yu,et al.  Multi-View Multi-Label Learning With Sparse Feature Selection for Image Annotation , 2020, IEEE Transactions on Multimedia.

[5]  Diyar Qader Zeebaree,et al.  Significant features for steganography techniques using deoxyribonucleic acid: a review , 2021 .

[6]  Bahzad Charbuty,et al.  Classification Based on Decision Tree Algorithm for Machine Learning , 2021, Journal of Applied Science and Technology Trends.

[7]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[8]  Haza Nuzly Abdul Hamed,et al.  Improved Threshold Based and Trainable Fully Automated Segmentation for Breast Cancer Boundary and Pectoral Muscle in Mammogram Images , 2020, IEEE Access.

[9]  Hojat Ghimatgar,et al.  An improved feature selection algorithm based on graph clustering and ant colony optimization , 2018, Knowl. Based Syst..

[10]  Nawzat Sadiq Ahmed,et al.  Swarm Intelligence Algorithms in Gene Selection Profile Based on Classification of Microarray Data: A Review , 2021 .

[11]  Grigorios Tsoumakas,et al.  A systematic review of multi-label feature selection and a new method based on label construction , 2016, Neurocomputing.

[12]  Parham Moradi,et al.  A hybrid particle swarm optimization for feature subset selection by integrating a novel local search strategy , 2016, Appl. Soft Comput..

[13]  Giancarlo Fortino,et al.  Supervised Feature Selection Techniques in Network Intrusion Detection: a Critical Review , 2021, Eng. Appl. Artif. Intell..

[14]  Jianmin Zhao,et al.  Multi-label Feature Selection via Information Gain , 2014, ADMA.

[15]  Xiaoyan Sun,et al.  Multi-objective feature selection based on artificial bee colony: An acceleration approach with variable sample size , 2020, Appl. Soft Comput..

[16]  Vibha Vyas,et al.  Component-based face recognition under transfer learning for forensic applications , 2019, Inf. Sci..

[17]  Zhixin Sun,et al.  An Improved Feature Selection Algorithm Based on Ant Colony Optimization , 2018, IEEE Access.

[18]  Diyar Qader Zeebaree,et al.  A Comprehensive Review of Dimensionality Reduction Techniques for Feature Selection and Feature Extraction , 2020, Journal of Applied Science and Technology Trends.

[19]  A Fusion Scheme of Texture Features for COVID-19 Detection of CT Scan Images , 2020, 2020 International Conference on Advanced Science and Engineering (ICOASE).

[20]  Nada Almugren,et al.  A Survey on Hybrid Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification , 2019, IEEE Access.

[21]  Adnan Mohsin Abdulazeez,et al.  Data Mining Classification Techniques for Diabetes Prediction , 2021 .

[22]  Parham Moradi,et al.  Gene selection for microarray data classification using a novel ant colony optimization , 2015, Neurocomputing.

[23]  Vinod Kumar Jain,et al.  Correlation feature selection based improved-Binary Particle Swarm Optimization for gene selection and cancer classification , 2018, Appl. Soft Comput..

[24]  Verónica Bolón-Canedo,et al.  A review of feature selection methods in medical applications , 2019, Comput. Biol. Medicine.

[25]  Diyar Qader Zeebaree,et al.  Skin Lesions Classification Using Deep Learning Techniques: Review , 2021, Asian Journal of Research in Computer Science.

[26]  Mayyadah Ramiz Mahmood,et al.  Different Model for Hand Gesture Recognition with a Novel Line Feature Extraction , 2019, 2019 International Conference on Advanced Science and Engineering (ICOASE).

[27]  Dae-Won Kim,et al.  Feature selection for multi-label classification using multivariate mutual information , 2013, Pattern Recognit. Lett..

[28]  Adnan Mohsin Abdulazeez,et al.  The Role of Machine Learning Algorithms for Diagnosing Diseases , 2021 .

[29]  Mohamed Elhoseny,et al.  Feature selection based on artificial bee colony and gradient boosting decision tree , 2019, Appl. Soft Comput..

[30]  Dae-Won Kim,et al.  Approximating mutual information for multi-label feature selection , 2012 .

[31]  Ying Yu,et al.  Feature Selection for Multi-label Learning Using Mutual Information and GA , 2014, RSKT.

[32]  Newton Spolaôr,et al.  A Comparison of Multi-label Feature Selection Methods using the Problem Transformation Approach , 2013, CLEI Selected Papers.

[33]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[34]  Qinghua Hu,et al.  Multi-label feature selection with missing labels , 2018, Pattern Recognit..

[35]  Dinggang Shen,et al.  Review of Artificial Intelligence Techniques in Imaging Data Acquisition, Segmentation, and Diagnosis for COVID-19 , 2020, IEEE Reviews in Biomedical Engineering.

[36]  Adnan Mohsin Abdulazeez,et al.  A Review of Principal Component Analysis Algorithm for Dimensionality Reduction , 2021 .

[37]  Yong Zhang,et al.  Cost-sensitive feature selection using two-archive multi-objective artificial bee colony algorithm , 2019, Expert Syst. Appl..

[38]  Ping Zhang,et al.  Distinguishing two types of labels for multi-label feature selection , 2019, Pattern Recognit..

[39]  Ram Sarkar,et al.  A wrapper-filter feature selection technique based on ant colony optimization , 2019, Neural Computing and Applications.

[40]  Diyar Qader Zeebaree,et al.  Robust watermarking scheme based LWT and SVD using artificial bee colony optimization , 2021 .

[41]  Víctor Robles,et al.  Feature selection for multi-label naive Bayes classification , 2009, Inf. Sci..

[42]  Hossein Nezamabadi-pour,et al.  MLACO: A multi-label feature selection algorithm based on ant colony optimization , 2020, Knowl. Based Syst..

[43]  Habibollah Haron,et al.  Gene Selection and Classification of Microarray Data Using Convolutional Neural Network , 2018, 2018 International Conference on Advanced Science and Engineering (ICOASE).

[44]  A Graph-based Multi-Label Feature Selection using ant Colony Optimization , 2020, 2020 10th International Symposium onTelecommunications (IST).

[45]  Adnan Mohsin Abdulazeez,et al.  Machine Learning Classifiers Based Classification For IRIS Recognition , 2021 .

[46]  Qinghua Hu,et al.  Multi-label feature selection based on max-dependency and min-redundancy , 2015, Neurocomputing.

[47]  Zhiwei Ye,et al.  A feature selection method based on modified binary coded ant colony optimization algorithm , 2016, Appl. Soft Comput..

[48]  Parham Moradi,et al.  An unsupervised feature selection algorithm based on ant colony optimization , 2014, Eng. Appl. Artif. Intell..

[49]  Utkarsh Singh,et al.  A new optimal feature selection scheme for classification of power quality disturbances based on ant colony framework , 2019, Appl. Soft Comput..

[50]  Jia Zhang,et al.  Mutual information based multi-label feature selection via constrained convex optimization , 2019, Neurocomputing.

[51]  Alex Alves Freitas,et al.  A new genetic algorithm for multi-label correlation-based feature selection , 2015, ESANN.

[52]  Roberto Navigli,et al.  Knowledge-enhanced document embeddings for text classification , 2019, Knowl. Based Syst..

[53]  Adnan Mohsin Abdulazeez,et al.  COVID-19 World Vaccination Progress Using Machine Learning Classification Algorithms , 2021 .

[54]  Hossein Nezamabadi-pour,et al.  MGFS: A multi-label graph-based feature selection algorithm via PageRank centrality , 2020, Expert Syst. Appl..

[55]  Diyar Qader Zeebaree,et al.  A Review on Region of Interest Segmentation Based on Clustering Techniques for Breast Cancer Ultrasound Images , 2020, Journal of Applied Science and Technology Trends.

[56]  Habibollah Haron,et al.  Trainable Model Based on New Uniform LBP Feature to Identify the Risk of the Breast Cancer , 2019, 2019 International Conference on Advanced Science and Engineering (ICOASE).

[57]  Adel Sabry Eesa,et al.  A New DIDS Design Based on a Combination Feature Selection Approach , 2015 .

[58]  Adnan Mohsin Abdulazeez,et al.  Comparative Study for Classification Algorithms Performance in Crop Yields Prediction Systems , 2021 .

[59]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[60]  Yong Zhang,et al.  A PSO-based multi-objective multi-label feature selection method in classification , 2017, Scientific Reports.

[61]  Parham Moradi,et al.  Integration of graph clustering with ant colony optimization for feature selection , 2015, Knowl. Based Syst..

[62]  José Ramón Quevedo,et al.  Graphical Feature Selection for Multilabel Classification Tasks , 2011, IDA.

[63]  Dae-Won Kim,et al.  SCLS: Multi-label feature selection based on scalable criterion for large label set , 2017, Pattern Recognit..

[64]  Sheng Wang,et al.  Low-rank graph preserving discriminative dictionary learning for image recognition , 2020, Knowl. Based Syst..

[65]  Mehrdad Rostami,et al.  Review of Swarm Intelligence-based Feature Selection Methods , 2020, Eng. Appl. Artif. Intell..

[66]  Swanand Kadhe,et al.  Private Information Retrieval With Side Information , 2017, IEEE Transactions on Information Theory.

[67]  Kok-Swee Sim,et al.  Convolutional neural network improvement for breast cancer classification , 2019, Expert Syst. Appl..

[68]  Hossein Nezamabadi-pour,et al.  An advanced ACO algorithm for feature subset selection , 2015, Neurocomputing.

[69]  Alex Alves Freitas,et al.  Two Extensions to Multi-label Correlation-Based Feature Selection: A Case Study in Bioinformatics , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.