When is resampling beneficial for feature selection with imbalanced wide data?

[1]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[2]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[3]  Aytaç Altan,et al.  Recognition Model for Solar Radiation Time Series based on Random Forest with Feature Selection Approach , 2019, 2019 11th International Conference on Electrical and Electronics Engineering (ELECO).

[4]  Richard Weber,et al.  Feature selection for high-dimensional class-imbalanced data sets using Support Vector Machines , 2014, Inf. Sci..

[5]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[6]  Kevin J. Johnson,et al.  Pattern recognition of jet fuels: comprehensive GC×GC with ANOVA-based feature selection and principal component analysis , 2002 .

[7]  Lining Xing,et al.  Diagnosis of Rolling Bearing Based on Classification for High Dimensional Unbalanced Data , 2019, IEEE Access.

[8]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[9]  Bernd Bischl,et al.  Benchmark for filter methods for feature selection in high-dimensional classification data , 2020, Comput. Stat. Data Anal..

[10]  Daniel S. Yeung,et al.  Diversified Sensitivity-Based Undersampling for Imbalance Classification Problems , 2015, IEEE Transactions on Cybernetics.

[11]  Juan José Rodríguez Diez,et al.  Diversity techniques improve the performance of the best imbalance learning ensembles , 2015, Inf. Sci..

[12]  Kewei Cheng,et al.  Feature Selection , 2016, ACM Comput. Surv..

[13]  Randal S. Olson,et al.  Relief-Based Feature Selection: Introduction and Review , 2017, J. Biomed. Informatics.

[14]  Sattar Hashemi,et al.  To Combat Multi-Class Imbalanced Problems by Means of Over-Sampling Techniques , 2016, IEEE Transactions on Knowledge and Data Engineering.

[15]  Nordin Saad,et al.  A Review of Artificial Intelligence Methods for Condition Monitoring and Fault Diagnosis of Rolling Element Bearings for Induction Motor , 2020, Shock and Vibration.

[16]  Yang Liu,et al.  Classification of EEG Signals for Epileptic Seizures Using Feature Dimension Reduction Algorithm based on LPP , 2020, Multimedia Tools and Applications.

[17]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[18]  Juan José Rodríguez Diez,et al.  Experimental evaluation of ensemble classifiers for imbalance in Big Data , 2021, Applied Soft Computing.

[19]  Sergio Ramírez-Gallego,et al.  Evolutionary Feature Selection for Big Data Classification: A MapReduce Approach , 2015 .

[20]  Francisco Herrera,et al.  Learning from Imbalanced Data Sets , 2018, Springer International Publishing.

[21]  Huan Liu,et al.  Chi2: feature selection and discretization of numeric attributes , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.

[22]  Sonia Migliorati,et al.  A generalization of the Dirichlet distribution , 2013, J. Multivar. Anal..

[23]  Dirk P. Kroese,et al.  Why the Monte Carlo method is so important today , 2014 .

[24]  Yue Zhao,et al.  A Simple Recurrent Unit Model Based Intrusion Detection System With DCGAN , 2019, IEEE Access.

[25]  Barbara Pes,et al.  Learning From High-Dimensional Biomedical Datasets: The Issue of Class Imbalance , 2020, IEEE Access.

[26]  Marco Zaffalon,et al.  Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis , 2016, J. Mach. Learn. Res..

[27]  Satchidananda Dehuri,et al.  A Study on the Relevance of Feature Selection Methods in Microarray Data , 2018, The Open Bioinformatics Journal.

[28]  Francisco Herrera,et al.  A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[29]  Marco Zaffalon,et al.  A Bayesian Wilcoxon signed-rank test based on the Dirichlet process , 2014, ICML.

[30]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[31]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[32]  Chongsheng Zhang,et al.  Feature selection and resampling in class imbalance learning: Which comes first? An empirical study in the biological domain , 2017, 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[33]  Amalia Luque,et al.  The impact of class imbalance in classification performance metrics based on the binary confusion matrix , 2019, Pattern Recognit..

[34]  Randy Kerber,et al.  ChiMerge: Discretization of Numeric Attributes , 1992, AAAI.

[35]  Álvar Arnaiz-González,et al.  Early and extremely early multi-label fault diagnosis in induction motors. , 2020, ISA transactions.

[36]  Emanuele Frontoni,et al.  Discovering the Type 2 Diabetes in Electronic Health Records Using the Sparse Balanced Support Vector Machine , 2020, IEEE Journal of Biomedical and Health Informatics.

[37]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[38]  Zexuan Zhu,et al.  Markov blanket-embedded genetic algorithm for gene selection , 2007, Pattern Recognit..

[39]  Stefan C. Kremer,et al.  An Accurate, Fast Embedded Feature Selection for SVMs , 2014, 2014 13th International Conference on Machine Learning and Applications.