Imperfect Big Data
暂无分享,去创建一个
Sergio Ramírez-Gallego | Julián Luengo | Francisco Herrera | Diego García-Gil | Salvador García | S. García | F. Herrera | S. Ramírez-Gallego | J. Luengo | Diego García-Gil
[1] Taghi M. Khoshgoftaar,et al. Improving Software Quality Prediction by Noise Filtering Techniques , 2007, Journal of Computer Science and Technology.
[2] Xingquan Zhu,et al. Class Noise vs. Attribute Noise: A Quantitative Study , 2003, Artificial Intelligence Review.
[3] Francisco Herrera,et al. INFFC: An iterative class noise filter based on the fusion of classifiers with noise sensitivity control , 2016, Inf. Fusion.
[4] Francisco Herrera,et al. Enabling Smart Data: Noise filtering in Big Data classification , 2017, Inf. Sci..
[5] André Carlos Ponce de Leon Ferreira de Carvalho,et al. Effect of label noise in the complexity of classification problems , 2015, Neurocomputing.
[6] M. Verleysen,et al. Classification in the Presence of Label Noise: A Survey , 2014, IEEE Transactions on Neural Networks and Learning Systems.
[7] Carla E. Brodley,et al. Identifying Mislabeled Training Data , 1999, J. Artif. Intell. Res..
[8] T. Schneider. Analysis of Incomplete Climate Data: Estimation of Mean Values and Covariance Matrices and Imputation of Missing Values. , 2001 .
[9] Charles Bouveyron,et al. Robust supervised classification with mixture models: Learning from data with uncertain labels , 2009, Pattern Recognit..
[10] D. Rubin,et al. Statistical Analysis with Missing Data. , 1989 .
[11] R. Little. A Test of Missing Completely at Random for Multivariate Data with Missing Values , 1988 .
[12] Francisco Herrera,et al. kNN-IS: An Iterative Spark-based design of the k-Nearest Neighbors classifier for big data , 2017, Knowl. Based Syst..
[13] Gustavo E. A. P. A. Batista,et al. An analysis of four missing data treatment methods for supervised learning , 2003, Appl. Artif. Intell..
[14] Dennis L. Wilson,et al. Asymptotic Properties of Nearest Neighbor Rules Using Edited Data , 1972, IEEE Trans. Syst. Man Cybern..
[15] Francisco Herrera,et al. Transforming big data into smart data: An insight on the use of the k‐nearest neighbors algorithm to obtain quality data , 2018, WIREs Data Mining Knowl. Discov..
[16] Maoguo Gong,et al. RBoost: Label Noise-Robust Boosting Algorithm Based on a Nonconvex Loss Function and the Numerically Stable Base Learners , 2016, IEEE Transactions on Neural Networks and Learning Systems.
[17] Francisco Herrera,et al. MRPR: A MapReduce solution for prototype reduction in big data classification , 2015, Neurocomputing.
[18] Francisco Herrera,et al. ROSEFW-RF: The winner algorithm for the ECBDL'14 big data competition: An extremely imbalanced big data bioinformatics problem , 2015, Knowl. Based Syst..
[19] Anneleen Van Assche,et al. Ensemble Methods for Noise Elimination in Classification Problems , 2003, Multiple Classifier Systems.
[20] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.
[21] Han Liu,et al. Challenges of Big Data Analysis. , 2013, National science review.
[22] Taghi M. Khoshgoftaar,et al. Analyzing software measurement data with clustering techniques , 2004, IEEE Intelligent Systems.
[23] P. Baldi,et al. Searching for exotic particles in high-energy physics with deep learning , 2014, Nature Communications.
[24] Filiberto Pla,et al. Prototype selection for the nearest neighbour rule through proximity graphs , 1997, Pattern Recognit. Lett..
[25] Francisco Herrera,et al. Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[26] Roberto Alejo,et al. Analysis of new techniques to obtain quality training sets , 2003, Pattern Recognit. Lett..
[27] Aníbal R. Figueiras-Vidal,et al. Pattern classification with missing data: a review , 2010, Neural Computing and Applications.
[28] Francisco Herrera,et al. A Taxonomy and Experimental Study on Prototype Generation for Nearest Neighbor Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).
[29] Gene H. Golub,et al. Missing value estimation for DNA microarray gene expression data: local least squares imputation , 2005, Bioinform..
[30] Francisco Herrera,et al. Data Preprocessing in Data Mining , 2014, Intelligent Systems Reference Library.
[31] Salvatore J. Stolfo,et al. Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem , 1998, Data Mining and Knowledge Discovery.
[32] Marcel J. T. Reinders,et al. Classification in the presence of class noise using a probabilistic Kernel Fisher method , 2007, Pattern Recognit..
[33] Francisco Herrera,et al. On the choice of the best imputation methods for missing values considering three groups of classification methods , 2012, Knowledge and Information Systems.
[34] Xindong Wu. Knowledge Acquisition from Databases , 1995 .