Multigranulation consensus fuzzy-rough based attribute reduction

Abstract As big data often contains a significant amount of unstructured, imprecise, and uncertain data, the fuzzy-rough-set-based attribute reduction is a valuable technique for uncertainty reasoning and data mining. However, tradition fuzzy-rough-set algorithms take all data correlations into account, which may lead to challenging requirements for computing and memory space resources. In this paper, we propose a novel Multigranulation Consensus Fuzzy-Rough Attribute Reduction (MCFR) algorithm for big data analysis, which assigns tasks to multiple nodes for parallel computing with different multigranulation weights and creates reasonable granular subpopulations for attribute reduction. A multigranulation margin model with a self-evolving consensus scheme is constructed wherein super elitists are classified into different multigranulation populations using the weighted margins’ matrix to calculate the parallel positive regions. This model allows penalizing super elitists with cooperative behaviors from coarsening to refining. A cascade cross-coevolutionary multigranulation learning model utilizes a more efficient coevolutionary attribute classification with multigranulation flexible thresholds. Here, the super elitist crossover learning strategy is employed to search for elitists’ chromosomes in granular subpopulations to yield better objective function values for handling dynamic big data classification tasks. Our experimental results demonstrate that MCFR can achieve a high performance in addressing uncertainty and fuzzy attribute reduction problems for big datasets with increasing noise. We also demonstrate a practical application of MCFR by using it to automatically segment fMRIs of the neonatal brain tissues at both the global and tissue levels. The results were in the good agreement with expert manual classifications, thus introducing a new method for the medical decision support problem of predicting disorders from neonatal brain fMRIs.

[1]  Dinggang Shen,et al.  Neonatal brain image segmentation in longitudinal MRI studies , 2010, NeuroImage.

[2]  Hui Wang,et al.  Fuzzy rough set based incremental attribute reduction from dynamic data with sample arriving , 2017, Fuzzy Sets Syst..

[3]  Santiago Aja-Fernández,et al.  Non-Stationary Rician Noise Estimation in Parallel MRI Using a Single Image: A Variance-Stabilizing Approach , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Zeshui Xu,et al.  Towards felicitous decision making: An overview on challenges and trends of Big Data , 2016, Inf. Sci..

[5]  Qinghua Hu,et al.  A Fitting Model for Feature Selection With Fuzzy Rough Sets , 2017, IEEE Transactions on Fuzzy Systems.

[6]  Anna Maria Radzikowska,et al.  A comparative study of fuzzy rough sets , 2002, Fuzzy Sets Syst..

[7]  Witold Pedrycz,et al.  Kernelized Fuzzy Rough Sets and Their Applications , 2011, IEEE Transactions on Knowledge and Data Engineering.

[8]  Jiye Liang,et al.  Local multigranulation decision-theoretic rough sets , 2017, Int. J. Approx. Reason..

[9]  Maria J. Martín-Bautista,et al.  Finding tendencies in streaming data using Big Data frequent itemset mining , 2019, Knowl. Based Syst..

[10]  Alicja Mieszkowicz-Rolka,et al.  Variable Precision Fuzzy Rough Sets , 2004, Trans. Rough Sets.

[11]  Chris Cornelis,et al.  Applications of Fuzzy Rough Set Theory in Machine Learning: a Survey , 2015, Fundam. Informaticae.

[12]  Sourabh Bhattacharya,et al.  A Bayesian approach to modeling dynamic effective connectivity with fMRI data , 2006, NeuroImage.

[13]  Mukesh Prasad,et al.  Attribute Equilibrium Dominance Reduction Accelerator (DCCAEDR) Based on Distributed Coevolutionary Cloud and Its Application in Medical Records , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[14]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[15]  Witold Pedrycz,et al.  Data-Distribution-Aware Fuzzy Rough Set Model and its Application to Robust Classification , 2016, IEEE Transactions on Cybernetics.

[16]  David Zhang,et al.  Two-stage image denoising by principal component analysis with local pixel grouping , 2010, Pattern Recognit..

[17]  Jianzhong Li,et al.  Data Source Selection for Information Integration in Big Data Era , 2016, Inf. Sci..

[18]  Giancarlo Fortino,et al.  Intelligent temporal classification and fuzzy rough set-based feature selection algorithm for intrusion detection system in WSNs , 2019, Inf. Sci..

[19]  Stephen M Smith,et al.  Fast robust automated brain extraction , 2002, Human brain mapping.

[20]  Lotfi A. Zadeh,et al.  Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic , 1997, Fuzzy Sets Syst..

[21]  Yuhua Qian,et al.  Accelerator for multi-granularity attribute reduction , 2019, Knowl. Based Syst..

[22]  Samuel Madden,et al.  From Databases to Big Data , 2012, IEEE Internet Comput..

[23]  Bao Qing Hu,et al.  Granular fuzzy rough sets based on fuzzy implicators and coimplicators , 2019, Fuzzy Sets Syst..

[24]  Yiyu Yao,et al.  Ensemble selector for attribute reduction , 2018, Appl. Soft Comput..

[25]  Wei-Zhi Wu,et al.  Intuitionistic Fuzzy Rough Set-Based Granular Structures and Attribute Subset Selection , 2019, IEEE Transactions on Fuzzy Systems.

[26]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[27]  Zehong Cao,et al.  Shared Nearest-Neighbor Quantum Game-Based Attribute Reduction With Hierarchical Coevolutionary Spark and Its Application in Consistent Segmentation of Neonatal Cerebral Cortical Surfaces , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Richard Jensen,et al.  Unsupervised fuzzy-rough set-based dimensionality reduction , 2013, Inf. Sci..

[29]  Ge Yu,et al.  Rim: A reusable iterative model for big data , 2018, Knowl. Based Syst..

[30]  Weiping Ding,et al.  A Layered-Coevolution-Based Attribute-Boosted Reduction Using Adaptive Quantum-Behavior PSO and Its Consistent Segmentation for Neonates Brain Tissue , 2018, IEEE Transactions on Fuzzy Systems.

[31]  Yuwen Li,et al.  Attribute reduction for multi-label learning with fuzzy rough set , 2018, Knowl. Based Syst..

[32]  Witold Pedrycz,et al.  Large-Scale Multimodality Attribute Reduction With Multi-Kernel Fuzzy Rough Sets , 2018, IEEE Transactions on Fuzzy Systems.

[33]  D. Louis Collins,et al.  BEaST: Brain extraction based on nonlocal segmentation technique , 2012, NeuroImage.

[34]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[35]  Ming-Wen Shao,et al.  Fuzzy rough set-based attribute reduction using distance measures , 2019, Knowl. Based Syst..

[36]  Irena Koprinska,et al.  Multi-step forecasting for big data time series based on ensemble learning , 2019, Knowl. Based Syst..

[37]  Witold Pedrycz,et al.  Granular Computing: Perspectives and Challenges , 2013, IEEE Transactions on Cybernetics.

[38]  Chin-Teng Lin,et al.  Multiagent-consensus-MapReduce-based attribute reduction using co-evolutionary quantum PSO for big data applications , 2018, Neurocomputing.

[39]  Hamido Fujita,et al.  An efficient selector for multi-granularity attribute reduction , 2019, Inf. Sci..

[40]  Richard M. Leahy,et al.  Automated graph-based analysis and correction of cortical volume topology , 2001, IEEE Transactions on Medical Imaging.