On the Functional Equivalence of TSK Fuzzy Systems to Neural Networks, Mixture of Experts, CART, and Stacking Ensemble Regression

Fuzzy systems have achieved great success in numerous applications. However, there are still many challenges in designing an optimal fuzzy system, e.g., how to efficiently optimize its parameters, how to balance the trade-off between cooperations and competitions among the rules, how to overcome the curse of dimensionality, how to increase its generalization ability, etc. Literature has shown that by making appropriate connections between fuzzy systems and other machine learning approaches, good practices from other domains may be used to improve the fuzzy systems, and vice versa. This article gives an overview on the functional equivalence between Takagi–Sugeno–Kang fuzzy systems and four classic machine learning approaches—neural networks, mixture of experts, classification and regression trees, and stacking ensemble regression—for regression problems. We also point out some promising new research directions, inspired by the functional equivalence, that could lead to solutions to the aforementioned problems. To our knowledge, this is so far the most comprehensive overview on the connections between fuzzy systems and other popular machine learning approaches, and hopefully will stimulate more hybridization between different machine learning algorithms.

[1]  Hong Jia,et al.  Subspace Clustering of Categorical and Numerical Data With an Unknown Number of Clusters , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Jerry M. Mendel,et al.  Linguistic Summarization Using IF–THEN Rules and Interval Type-2 Fuzzy Sets , 2011, IEEE Transactions on Fuzzy Systems.

[3]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[4]  Dirk Van,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[5]  John Moody,et al.  Fast Learning in Networks of Locally-Tuned Processing Units , 1989, Neural Computation.

[6]  G. Stewart,et al.  Rank degeneracy and least squares problems , 1976 .

[7]  Cezary Z. Janikow,et al.  Fuzzy decision trees: issues and methods , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[8]  Michael W. Berry,et al.  Lecture Notes in Data Mining , 2006 .

[9]  Yu-Jun Zheng,et al.  Airline Passenger Profiling Based on Fuzzy Deep Machine Learning , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[10]  Kasun Amarasinghe,et al.  Explaining What a Neural Network has Learned: Toward Transparent Classification , 2019, 2019 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[11]  Tor Arne Johansen,et al.  Local learning in local model networks , 1995 .

[12]  Geoffrey J. McLachlan,et al.  A Universal Approximation Theorem for Mixture-of-Experts Models , 2016, Neural Computation.

[13]  J.-S.R. Jang,et al.  Structure determination in fuzzy modeling: a fuzzy CART approach , 1994, Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference.

[14]  L X Wang,et al.  Fuzzy basis functions, universal approximation, and orthogonal least-squares learning , 1992, IEEE Trans. Neural Networks.

[15]  Roderick Murray-Smith,et al.  Extending the functional equivalence of radial basis function networks and fuzzy inference systems , 1996, IEEE Trans. Neural Networks.

[16]  C. S. George Lee,et al.  Neural fuzzy systems: a neuro-fuzzy synergism to intelligent systems , 1996 .

[17]  Robert Ivor John,et al.  Type-1 and interval type-2 ANFIS: A comparison , 2017, 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[18]  Antonio F. Gómez-Skarmeta,et al.  A fuzzy clustering-based rapid prototyping for fuzzy rule-based modeling , 1997, IEEE Trans. Fuzzy Syst..

[19]  Malcolm I. Heywood,et al.  Input partitioning to mixture of experts , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[20]  Joseph N. Wilson,et al.  Twenty Years of Mixture of Experts , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[21]  Yuichi Motai,et al.  Multicolumn RBF Network , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[22]  Babak Nadjar Araabi,et al.  Introducing evolving Takagi–Sugeno method based on local least squares support vector machine models , 2011, Evolving Systems.

[23]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[24]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[25]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[26]  Dongrui Wu,et al.  Approaches for Reducing the Computational Cost of Interval Type-2 Fuzzy Logic Systems: Overview and Comparisons , 2013, IEEE Transactions on Fuzzy Systems.

[27]  Hugues Bersini,et al.  Now comes the time to defuzzify neuro-fuzzy models , 1997, Fuzzy Sets Syst..

[28]  Reza Ebrahimpour,et al.  Mixture of MLP-experts for trend forecasting of time series: A case study of the Tehran stock exchange , 2011 .

[29]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[30]  Yann LeCun,et al.  Regularization of Neural Networks using DropConnect , 2013, ICML.

[31]  J. Buckley Sugeno type controllers are universal controllers , 1993 .

[32]  Lotfi A. Zadeh,et al.  On Fuzzy Mapping and Control , 1996, IEEE Trans. Syst. Man Cybern..

[33]  Ahmad Lotfi,et al.  Comments on "Functional equivalence between radial basis function networks and fuzzy inference systems" [and reply] , 1998, IEEE Trans. Neural Networks.

[34]  Dimitar Filev,et al.  Generation of Fuzzy Rules by Mountain Clustering , 1994, J. Intell. Fuzzy Syst..

[35]  Michael K. Ng,et al.  An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data , 2007, IEEE Transactions on Knowledge and Data Engineering.

[36]  Jacek Łęski,et al.  A fuzzy if-then rule-based nonlinear classifier , 2003 .

[37]  Reza Ebrahimpour,et al.  Mixture of experts: a literature survey , 2014, Artificial Intelligence Review.

[38]  Youyong Kong,et al.  A Hierarchical Fused Fuzzy Deep Neural Network for Data Classification , 2017, IEEE Transactions on Fuzzy Systems.

[39]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[40]  Dongrui Wu,et al.  Optimize TSK Fuzzy Systems for Regression Problems: Minibatch Gradient Descent With Regularization, DropRule, and AdaBound (MBGD-RDA) , 2019, IEEE Transactions on Fuzzy Systems.

[41]  Chuen-Tsai Sun,et al.  Functional equivalence between radial basis function networks and fuzzy inference systems , 1993, IEEE Trans. Neural Networks.

[42]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[43]  Dongrui Wu,et al.  Recommendations on Designing Practical Interval Type-2 Fuzzy Systems , 2019, Eng. Appl. Artif. Intell..

[44]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[45]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[46]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[47]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[48]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[49]  Jooyoung Park,et al.  Universal Approximation Using Radial-Basis-Function Networks , 1991, Neural Computation.

[50]  Hisao Ishibuchi,et al.  A simple but powerful heuristic method for generating fuzzy rules from numerical data , 1997, Fuzzy Sets Syst..

[51]  Dongrui Wu,et al.  GENETIC LEARNING AND PERFORMANCE EVALUATION OF TYPE-2 FUZZY LOGIC CONTROLLERS , 2006 .

[52]  Johannes Gehrke,et al.  SECRET: a scalable linear regression tree algorithm , 2002, KDD.

[53]  Harry Wechsler,et al.  Mixture of experts for classification of gender, ethnic origin, and pose of human faces , 2000, IEEE Trans. Neural Networks Learn. Syst..

[54]  Dongrui Wu,et al.  Computationally Efficient Type-Reduction Strategies for a Type-2 Fuzzy Logic Controller , 2005, The 14th IEEE International Conference on Fuzzy Systems, 2005. FUZZ '05..

[55]  Jerry M. Mendel,et al.  Back-propagation fuzzy system as nonlinear dynamic system identifiers , 1992, [1992 Proceedings] IEEE International Conference on Fuzzy Systems.

[56]  Xin Yao,et al.  Ensemble learning via negative correlation , 1999, Neural Networks.

[57]  Theodosios Pavlidis,et al.  Fuzzy Decision Tree Algorithms , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[58]  J. Buckley,et al.  Fuzzy neural networks: a survey , 1994 .

[59]  Chin-Teng Lin,et al.  An online self-constructing neural fuzzy inference network and its applications , 1998, IEEE Trans. Fuzzy Syst..

[60]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[61]  Jerry M. Mendel New book “Uncertain Rule-Based Fuzzy Systems Introduction and New Directios, 2nd Edition” , 2017 .

[62]  Robert Ivor John,et al.  An extended ANFIS architecture and its learning properties for type-1 and interval type-2 models , 2016, 2016 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[63]  Wei-Yin Loh,et al.  Fifty Years of Classification and Regression Trees , 2014 .

[64]  Dongrui Wu,et al.  Active Stacking for Heart Rate Estimation , 2019, 2020 International Joint Conference on Neural Networks (IJCNN).

[65]  John Yen,et al.  Improving the interpretability of TSK fuzzy models by combining global learning and local learning , 1998, IEEE Trans. Fuzzy Syst..

[66]  J. R. Quinlan Learning With Continuous Classes , 1992 .

[67]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[68]  Alberto Suárez,et al.  Globally Optimal Fuzzy Decision Trees for Classification and Regression , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[69]  Saman K. Halgamuge,et al.  Neural networks in designing fuzzy systems for real world applications , 1994 .

[70]  Geoffrey E. Hinton,et al.  Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[71]  Hamid R. Berenji,et al.  Learning and tuning fuzzy logic controllers through reinforcements , 1992, IEEE Trans. Neural Networks.

[72]  Chia-Feng Juang,et al.  An Interval Type-2 Fuzzy-Neural Network With Support-Vector Regression for Noisy Regression Problems , 2010, IEEE Transactions on Fuzzy Systems.