SparseFIS: Data-Driven Learning of Fuzzy Systems With Sparsity Constraints

In this paper, we deal with a novel data-driven learning method [sparse fuzzy inference systems (SparseFIS)] for Takagi-Sugeno (T-S) fuzzy systems, extended by including rule weights. Our learning method consists of three phases: The first phase conducts a clustering process in the input/output feature space with iterative vector quantization and projects the obtained clusters onto 1-D axes to form the fuzzy sets (centers and widths) in the antecedent parts of the rules. Hereby, the number of clusters = rules is predefined and denotes a kind of upper bound on a reasonable granularity. The second phase optimizes the rule weights in the fuzzy systems with respect to least-squares error measure by applying a sparsity-constrained steepest descent-optimization procedure. Depending on the sparsity threshold, weights of many or a few rules can be forced toward 0, thereby, switching off (eliminating) some rules (rule selection). The third phase estimates the linear consequent parameters by a regularized sparsity-constrained-optimization procedure for each rule separately (local learning approach). Sparsity constraints are applied in order to force linear parameters to be 0, triggering a feature-selection mechanism per rule. Global feature selection is achieved whenever the linear parameters of some features in each rule are (near) 0. The method is evaluated, which is based on high-dimensional data from industrial processes and based on benchmark datasets from the Internet and compared with well-known batch-training methods, in terms of accuracy and complexity of the fuzzy systems.

[1]  L. Wang,et al.  Fuzzy systems are universal approximators , 1992, [1992 Proceedings] IEEE International Conference on Fuzzy Systems.

[2]  Donald Gustafson,et al.  Fuzzy clustering with a fuzzy covariance matrix , 1978, 1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes.

[3]  Edwin Lughofer,et al.  A Comparison of Variable Selection Methods with the Main Focus on Orthogonalization , 2004 .

[4]  Edwin Lughofer,et al.  An On-Line Interactive Self-adaptive Image Classification Framework , 2008, ICVS.

[5]  Otmar Scherzer,et al.  A Convergence Rate Result for a Steepest Descent Method and a Minimal Error Method for the Solution of Nonlinear Ill-Posed Problems , 1995 .

[6]  R. Ramlau,et al.  Surrogate functionals and thresholding for inverse interface problems , 2007 .

[7]  Ronald R. Yager,et al.  Learning of Fuzzy Rules by Mountain Clustering , 1992 .

[8]  Plamen P. Angelov,et al.  Automatic generation of fuzzy rule-based models from data by genetic algorithms , 2003, Inf. Sci..

[9]  O. Nelles Nonlinear System Identification , 2001 .

[10]  Ferenc Szeifert,et al.  Modified Gath-Geva fuzzy clustering for identification of Takagi-Sugeno fuzzy models , 2002, IEEE Trans. Syst. Man Cybern. Part B.

[11]  Edwin Lughofer,et al.  Improving the robustness of data-driven fuzzy systems with regularization , 2008, 2008 IEEE International Conference on Fuzzy Systems (IEEE World Congress on Computational Intelligence).

[12]  Magne Setnes Simplification and reduction of fuzzy rules , 2003 .

[13]  Allen Gersho,et al.  Asymptotically optimal block quantization , 1979, IEEE Trans. Inf. Theory.

[14]  Robert Babuska,et al.  Fuzzy Modeling for Control , 1998 .

[15]  Edwin Lughofer,et al.  An approach to model-based fault detection in industrial measurement systems with application to engine test benches , 2006 .

[16]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[17]  A. N. Tikhonov,et al.  Solutions of ill-posed problems , 1977 .

[18]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[19]  Edwin Lughofer,et al.  Extensions of vector quantization for incremental clustering , 2008, Pattern Recognit..

[20]  Plamen Angelov,et al.  Toward Robust Evolving Fuzzy Systems , 2010 .

[21]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[22]  I. Daubechies,et al.  An iterative thresholding algorithm for linear inverse problems with a sparsity constraint , 2003, math/0307152.

[23]  Dirk A. Lorenz,et al.  A generalized conditional gradient method for nonlinear operator equations with sparsity constraints , 2007 .

[24]  Martin Burger,et al.  Regularized data-driven construction of fuzzy controllers , 2002 .

[25]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[26]  D. Nauck,et al.  NEFCLASS-X — a Soft Computing Tool to Build Readable Fuzzy Classifiers , 1998 .

[27]  János Abonyi,et al.  Fuzzy Model Identification for Control , 2003 .

[28]  Alan J. Miller Subset Selection in Regression , 1992 .

[29]  Kazufumi Ito,et al.  The Primal-Dual Active Set Strategy as a Semismooth Newton Method , 2002, SIAM J. Optim..

[30]  László T. Kóczy,et al.  Separated Antecedent and Consequent Learning for Takagi-Sugeno Fuzzy Systems , 2006, 2006 IEEE International Conference on Fuzzy Systems.

[31]  Francisco Herrera,et al.  A Two-stage Evolutionary Process for Designing Tsk Fuzzy Rule-based Systems a Two-stage Evolutionary Process for Designing Tsk Fuzzy Rule-based Systems , 1996 .

[32]  L X Wang,et al.  Fuzzy basis functions, universal approximation, and orthogonal least-squares learning , 1992, IEEE Trans. Neural Networks.

[33]  Hubert Schwetlick,et al.  Least squares approximation by splines with free knots , 1995 .

[34]  D.P. Filev,et al.  Novelty Detection Based Machine Health Prognostics , 2006, 2006 International Symposium on Evolving Fuzzy Systems.

[35]  Masoud Nikravesh,et al.  Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing) , 2006 .

[36]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[37]  Eyke Hüllermeier,et al.  Improving the interpretability of data-driven evolving fuzzy systems , 2005, EUSFLAT Conf..

[38]  Jyh-Shing Roger Jang,et al.  Self-learning fuzzy controllers based on temporal backpropagation , 1992, IEEE Trans. Neural Networks.

[39]  D. Lorenz,et al.  A semismooth Newton method for Tikhonov functionals with sparsity constraints , 2007, 0709.3186.

[40]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[41]  Edwin Lughofer,et al.  FLEXFIS: A Robust Incremental Learning Approach for Evolving Takagi–Sugeno Fuzzy Models , 2008, IEEE Transactions on Fuzzy Systems.

[42]  Masoud Nikravesh,et al.  Feature Extraction - Foundations and Applications , 2006, Feature Extraction.

[43]  Magne Setnes,et al.  Compact and transparent fuzzy models and classifiers through iterative complexity reduction , 2001, IEEE Trans. Fuzzy Syst..

[44]  Robert Babuska,et al.  Constructing fuzzy models by product space clustering , 1997 .

[45]  David L Donoho,et al.  Compressed sensing , 2006, IEEE Transactions on Information Theory.

[46]  J. Casillas Interpretability issues in fuzzy modeling , 2003 .

[47]  Edwin Lughofer,et al.  On-Line Fault Detection with Data-Driven Evolving Fuzzy Models , 2008, Control. Intell. Syst..

[48]  H. Engl,et al.  Regularization of Inverse Problems , 1996 .

[49]  K. alik An efficient k'-means clustering algorithm , 2008 .

[50]  Ronny Ramlau,et al.  A Tikhonov-based projection iteration for nonlinear Ill-posed problems with sparsity constraints , 2006, Numerische Mathematik.

[51]  E. Lughofer,et al.  Evolving fuzzy classifiers using different model architectures , 2008, Fuzzy Sets Syst..

[52]  R. DeVore,et al.  Compressed sensing and best k-term approximation , 2008 .

[53]  John Yen,et al.  Improving the interpretability of TSK fuzzy models by combining global learning and local learning , 1998, IEEE Trans. Fuzzy Syst..

[54]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[55]  E. Candès,et al.  Sparsity and incoherence in compressive sampling , 2006, math/0611957.

[56]  Hazem Tawfik,et al.  Handoff algorithms based on fuzzy classifiers , 2000, IEEE Trans. Veh. Technol..

[57]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[58]  H. Ishibuchi Genetic fuzzy systems: evolutionary tuning and learning of fuzzy knowledge bases , 2004 .

[59]  K.M. Alexiev,et al.  Improved fuzzy clustering for identification of Takagi-Sugeno model , 2004, 2004 2nd International IEEE Conference on 'Intelligent Systems'. Proceedings (IEEE Cat. No.04EX791).

[60]  Francisco Herrera,et al.  Ten years of genetic fuzzy systems: current framework and new trends , 2004, Fuzzy Sets Syst..