Imbalanced TSK Fuzzy Classifier by Cross-Class Bayesian Fuzzy Clustering and Imbalance Learning

In this paper, a novel construction algorithm called imbalanced Takagi–Sugeno–Kang fuzzy classifier (IB-TSK-FC) for the TSK fuzzy classifier is presented to improve the classification performance and rule-based interpretability for imbalanced datasets. IB-TSK-FC consists of two components: 1) a cross-class Bayesian fuzzy clustering algorithm (BF3C) and 2) an imbalance learning algorithm. In order to achieve high interpretability, BF3C is developed to determine an appropriate number of fuzzy rules and identify antecedent parameters of fuzzy rules from the perspective of the probabilistic model. In addition to inheriting the distinctive advantage of Bayesian fuzzy clustering that the number of clusters can be estimated in the framework of Bayesian inference, BF3C considers repulsion forces between cluster centers belonging to different classes, and uses an alternating iterative strategy to obtain more interpretable antecedent parameters for imbalanced datasets. In order to improve the classification performance for imbalanced datasets, an imbalance learning algorithm is derived to estimate consequent parameters of fuzzy rules on the basis of the weighted average misclassification error. Comprehensive experiments on synthetic and UCI datasets demonstrate the effectiveness of the proposed IB-TSK-FC algorithm.

[1]  Edwin Lughofer,et al.  Reliable All-Pairs Evolving Fuzzy Classifiers , 2013, IEEE Transactions on Fuzzy Systems.

[2]  Zhaohong Deng,et al.  Scalable TSK Fuzzy Modeling for Very Large Datasets Using Minimal-Enclosing-Ball Approximation , 2011, IEEE Transactions on Fuzzy Systems.

[3]  Paul D. Gader,et al.  Bayesian Fuzzy Clustering , 2015, IEEE Transactions on Fuzzy Systems.

[4]  Jianbin Qiu,et al.  Adaptive Fuzzy Backstepping Control for A Class of Nonlinear Systems With Sampled and Delayed Measurements , 2015, IEEE Transactions on Fuzzy Systems.

[5]  Shaocheng Tong,et al.  Neural Controller Design-Based Adaptive Control for Nonlinear MIMO Systems With Unknown Hysteresis Inputs , 2016, IEEE Transactions on Cybernetics.

[6]  Hwa Jen Yap,et al.  On Equivalence of FIS and ELM for Interpretable Rule-Based Knowledge Representation , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Korris Fu-Lai Chung,et al.  Support vector machine with manifold regularization and partially labeling privacy protection , 2015, Inf. Sci..

[8]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[9]  Francisco Herrera,et al.  An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics , 2013, Inf. Sci..

[10]  Nathalie Japkowicz,et al.  The Class Imbalance Problem: Significance and Strategies , 2000 .

[11]  Shuzhi Sam Ge,et al.  Top Tension Control of a Flexible Marine Riser by Using Integral-Barrier Lyapunov Function , 2015, IEEE/ASME Transactions on Mechatronics.

[12]  Shaocheng Tong,et al.  Adaptive fuzzy control for a class of unknown nonlinear dynamical systems , 2015, Fuzzy Sets Syst..

[13]  Francisco Herrera,et al.  ROSEFW-RF: The winner algorithm for the ECBDL'14 big data competition: An extremely imbalanced big data bioinformatics problem , 2015, Knowl. Based Syst..

[14]  Janusz Korniak,et al.  Nearest-Neighbor Spline Approximation (NNSA) Improvement to TSK Fuzzy Systems , 2016, IEEE Transactions on Industrial Informatics.

[15]  J.M. Leski,et al.  An /spl epsiv/-margin nonlinear classifier based on fuzzy if-then rules , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[16]  Yurii Nesterov,et al.  Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.

[17]  Chia-Feng Juang,et al.  A Fuzzy Model With Online Incremental SVM and Margin-Selective Gradient Descent Learning for Classification Problems , 2014, IEEE Transactions on Fuzzy Systems.

[18]  Antonio J. Rivera,et al.  GP-COACH: Genetic Programming-based learning of COmpact and ACcurate fuzzy rule-based classification systems for High-dimensional problems , 2010, Inf. Sci..

[19]  Zhaohong Deng,et al.  Knowledge-leverage-based TSK Fuzzy System modeling. , 2013, IEEE transactions on neural networks and learning systems.

[20]  Yuming Zhou,et al.  A novel ensemble method for classifying imbalanced data , 2015, Pattern Recognit..

[21]  Francisco Herrera,et al.  EUSBoost: Enhancing ensembles for highly imbalanced data-sets by evolutionary undersampling , 2013, Pattern Recognit..

[22]  Longbing Cao,et al.  Minimax Probability TSK Fuzzy System Classifier: A More Transparent and Highly Interpretable Classification Model , 2015, IEEE Transactions on Fuzzy Systems.

[23]  Witold Pedrycz,et al.  Cluster-Centric Fuzzy Modeling , 2014, IEEE Transactions on Fuzzy Systems.

[24]  Vasile Palade,et al.  FSVM-CIL: Fuzzy Support Vector Machines for Class Imbalance Learning , 2010, IEEE Transactions on Fuzzy Systems.

[25]  J. Łęski,et al.  Iteratively reweighted least squares classifier and its l2- and l1-regularized Kernel versions , 2010 .

[26]  Zhaohong Deng,et al.  Knowledge-Leverage-Based Fuzzy System and Its Modeling , 2013, IEEE Transactions on Fuzzy Systems.

[27]  Francisco Herrera,et al.  A New Multiobjective Evolutionary Algorithm for Mining a Reduced Set of Interesting Positive and Negative Quantitative Association Rules , 2014, IEEE Transactions on Evolutionary Computation.

[28]  Yang Wang,et al.  Cost-sensitive boosting for classification of imbalanced data , 2007, Pattern Recognit..

[29]  Jesús Alcalá-Fdez,et al.  Local identification of prototypes for genetic learning of accurate TSK fuzzy rule‐based systems , 2007, Int. J. Intell. Syst..

[30]  Lala Septem Riza,et al.  frbs: Fuzzy Rule-Based Systems for Classification and Regression in R , 2015 .

[31]  Chia-Feng Juang,et al.  A TS Fuzzy System Learned Through a Support Vector Machine in Principal Component Space for Real-Time Object Detection , 2012, IEEE Transactions on Industrial Electronics.

[32]  P. Moral Feynman-Kac Formulae: Genealogical and Interacting Particle Systems with Applications , 2004 .

[33]  Hisao Ishibuchi,et al.  Fuzzy rule selection by multi-objective genetic local search algorithms and rule evaluation measures in data mining , 2004, Fuzzy Sets Syst..

[34]  Zhi-Hua Zhou,et al.  Exploratory Undersampling for Class-Imbalance Learning , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[35]  Jacek M Leski An epsilon-margin nonlinear classifier based on fuzzy if-then rules. , 2004, IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society.

[36]  Shaocheng Tong,et al.  Fuzzy Approximation-Based Adaptive Backstepping Optimal Control for a Class of Nonlinear Discrete-Time Systems With Dead-Zone , 2016, IEEE Transactions on Fuzzy Systems.

[37]  María José del Jesús,et al.  Hierarchical fuzzy rule based classification systems with genetic rule selection for imbalanced data-sets , 2009, Int. J. Approx. Reason..

[38]  María José del Jesús,et al.  A hierarchical genetic fuzzy system based on genetic programming for addressing classification with highly imbalanced and borderline data-sets , 2013, Knowl. Based Syst..

[39]  Francisco Herrera,et al.  A Review of the Application of Multiobjective Evolutionary Fuzzy Systems: Current Status and Further Directions , 2013, IEEE Transactions on Fuzzy Systems.

[40]  Chia-Feng Juang,et al.  Fuzzy System Learned Through Fuzzy Clustering and Support Vector Machine for Human Skin Color Segmentation , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[41]  Timothy J. Robinson,et al.  Sequential Monte Carlo Methods in Practice , 2003 .

[42]  Francisco Herrera,et al.  Analysis of preprocessing vs. cost-sensitive learning for imbalanced classification. Open problems on intrinsic data characteristics , 2012, Expert Syst. Appl..

[43]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[44]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[45]  Jacek M. Leski,et al.  Fuzzy $(c+p)$-Means Clustering and Its Application to a Fuzzy Rule-Based Classifier: Toward Good Generalization and Good Interpretability , 2015, IEEE Transactions on Fuzzy Systems.

[46]  Zhaohong Deng,et al.  Multitask TSK Fuzzy System Modeling by Mining Intertask Common Hidden Structure , 2015, IEEE Transactions on Cybernetics.

[47]  Yanqing Zhang,et al.  SVMs Modeling for Highly Imbalanced Classification , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).