An Insight on the Class Imbalance Problem and Its Solutions in Big Data

[1]  Yidan Wang,et al.  A robust loss function for classification with imbalanced datasets , 2019, Neurocomputing.

[2]  Yonggang Wen,et al.  Toward Scalable Systems for Big Data Analytics: A Technology Tutorial , 2014, IEEE Access.

[3]  Francesco Marcelloni,et al.  Spreading fuzzy random forests with MapReduce , 2016, 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[4]  Enrico Zio,et al.  Integration of feature vector selection and support vector machine for classification of imbalanced data , 2019, Appl. Soft Comput..

[5]  Francisco Herrera,et al.  ROSEFW-RF: The winner algorithm for the ECBDL'14 big data competition: An extremely imbalanced big data bioinformatics problem , 2015, Knowl. Based Syst..

[6]  Amit Prakash Singh,et al.  Empirical Evaluation of Map Reduce Based Hybrid Approach for Problem of Imbalanced Classification in Big Data , 2019, Int. J. Grid High Perform. Comput..

[7]  George K. Karagiannidis,et al.  Efficient Machine Learning for Big Data: A Review , 2015, Big Data Res..

[8]  Athanasios V. Vasilakos,et al.  Big data analytics: a survey , 2015, Journal of Big Data.

[9]  Francisco Herrera,et al.  A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[10]  Fatemeh Afsari,et al.  Hesitant fuzzy decision tree approach for highly imbalanced data classification , 2017, Appl. Soft Comput..

[11]  Y Al-JarrahOmar,et al.  Efficient Machine Learning for Big Data , 2015 .

[12]  Ling Tang,et al.  A DBN-based resampling SVM ensemble learning paradigm for credit classification with imbalanced data , 2018, Appl. Soft Comput..

[13]  Francisco Herrera,et al.  A MapReduce Approach to Address Big Data Classification Problems Based on the Fusion of Linguistic Fuzzy Rules , 2015, Int. J. Comput. Intell. Syst..

[14]  Bartosz Krawczyk,et al.  Learning from imbalanced data: open challenges and future directions , 2016, Progress in Artificial Intelligence.

[15]  Jian Pei,et al.  Classification: Basic Concepts , 2012 .

[16]  Stan Matwin,et al.  A distributed instance-weighted SVM algorithm on large-scale imbalanced datasets , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[17]  Seong-hun Park,et al.  Large Imbalance Data Classification Based on MapReduce for Traffic Accident Prediction , 2014, 2014 Eighth International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing.

[18]  Amit Prakash Singh,et al.  Benchmarking framework for class imbalance problem using novel sampling approach for big data , 2019, Int. J. Syst. Assur. Eng. Manag..

[19]  Francisco Herrera,et al.  Cost-sensitive linguistic fuzzy rule based classification systems under the MapReduce framework for imbalanced big data , 2015, Fuzzy Sets Syst..