Approx-SMOTE: Fast SMOTE for Big Data on Apache Spark
暂无分享,去创建一个
Álvar Arnaiz-González | César García-Osorio | Mario Juez-Gil | Juan José Rodríguez Diez | Carlos López Nozal | C. García-Osorio | C. L. Nozal | Mario Juez-Gil | Álvar Arnaiz-González
[1] Bartosz Krawczyk,et al. Multi-class imbalanced big data classification on Spark , 2021, Knowl. Based Syst..
[2] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.
[3] Yonggang Wen,et al. Toward Scalable Systems for Big Data Analytics: A Technology Tutorial , 2014, IEEE Access.
[4] Juan José Rodríguez Diez,et al. Random Balance: Ensembles of variable priors classifiers for imbalanced data , 2015, Knowl. Based Syst..
[5] Francisco Herrera,et al. SMOTE-BD: An Exact and Scalable Oversampling Method for Imbalanced Classification in Big Data , 2018, J. Comput. Sci. Technol..
[6] Nitesh V. Chawla,et al. Editorial: special issue on learning from imbalanced data sets , 2004, SKDD.
[7] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..
[8] Andrew K. C. Wong,et al. Classification of Imbalanced Data: a Review , 2009, Int. J. Pattern Recognit. Artif. Intell..
[9] Haibo He,et al. Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.
[10] Nitesh V. Chawla,et al. SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..
[11] Ameet Talwalkar,et al. MLlib: Machine Learning in Apache Spark , 2015, J. Mach. Learn. Res..
[12] Reynold Xin,et al. Apache Spark , 2016 .
[13] Andrew W. Moore,et al. An Investigation of Practical Approximate Nearest Neighbor Algorithms , 2004, NIPS.
[14] Piotr Indyk,et al. Similarity Search in High Dimensions via Hashing , 1999, VLDB.
[15] Marco Zaffalon,et al. Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis , 2016, J. Mach. Learn. Res..
[16] Juan José Rodríguez Diez,et al. Diversity techniques improve the performance of the best imbalance learning ensembles , 2015, Inf. Sci..
[17] Francisco Herrera,et al. kNN-IS: An Iterative Spark-based design of the k-Nearest Neighbors classifier for big data , 2017, Knowl. Based Syst..
[18] Ting Liu,et al. Clustering Billions of Images with Large Scale Nearest Neighbor Search , 2007, 2007 IEEE Workshop on Applications of Computer Vision (WACV '07).