Convergence analysis of the batch gradient-based neuro-fuzzy learning algorithm with smoothing L1/2 regularization for the first-order Takagi-Sugeno system

Abstract It has been proven that Takagi–Sugeno systems are universal approximators, and they are applied widely to classification and regression problems. The main challenges of these models are convergence analysis and their computational complexity due to the large number of connections and the pruning of unnecessary parameters. The neuro-fuzzy learning algorithm involves two tasks: generating comparable sparse networks and training the parameters. In addition, regularization methods have attracted increasing attention for network pruning, particularly the L q ( 0 q 1 ) regularizer after L 1 regularization, which can obtain better solutions to sparsity problems. The L 1 / 2 regularizer has a specific sparsity capacity and it is representative of L q ( 0 q 1 ) regularizations. However, the nonsmoothness of the L 1 / 2 regularizer may lead to oscillations in the learning process. In this study, we propose a gradient-based neuro-fuzzy learning algorithm with a smoothing L 1 / 2 regularization for the first-order Takagi–Sugeno fuzzy inference system. The proposed approach has the following three advantages: (i) it enhances the original L 1 / 2 regularizer by eliminating the oscillation of the gradient in the cost function during the training; (ii) it performs better by pruning inactive connections, where the number of the redundant connections for removal is higher than that generated by the original L 1 / 2 regularizer, while it is also implemented by simultaneous structure and parameter learning processes; and (iii) it is possible to demonstrate the theoretical convergence analysis of this learning method, which we focus on explicitly. We also provide a series of simulations to demonstrate that the smoothing L 1 / 2 regularization can often obtain more compressive representations than the current L 1 / 2 regularization.

[1]  Kerim Guney,et al.  Genetic tracker with adaptive neuro-fuzzy inference system for multiple target tracking , 2008, Expert Syst. Appl..

[2]  Xiang Li,et al.  An Online Self-Organizing Scheme for Parsimonious and Accurate Fuzzy Neural Networks , 2010, Int. J. Neural Syst..

[3]  Sami Ekici,et al.  An adaptive neuro-fuzzy inference system (ANFIS) model for wire-EDM , 2009, Expert Syst. Appl..

[4]  Stefka Stoeva,et al.  A fuzzy backpropagation algorithm , 2000, Fuzzy Sets Syst..

[5]  Junfei Qiao,et al.  Nonlinear Systems Modeling Based on Self-Organizing Fuzzy-Neural-Network With Adaptive Computation Algorithm , 2014, IEEE Transactions on Cybernetics.

[6]  Kazumi Saito,et al.  Second-Order Learning Algorithm with Squared Penalty Term , 1996, Neural Computation.

[7]  Rozaida Ghazali,et al.  THE EFFECT OF NETWORK PARAMETERS ON PI-SIGMA NEURAL NETWORK FOR TEMPERATURE FORECASTING , 2012 .

[8]  Chang-Hyun Kim,et al.  Evolving Compact and Interpretable Takagi–Sugeno Fuzzy Models With a New Encoding Scheme , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[9]  David J. C. MacKay,et al.  Bayesian Interpolation , 1992, Neural Computation.

[10]  Hiok Chai Quek,et al.  FITSK: online local learning with generic fuzzy input Takagi-Sugeno-Kang fuzzy framework for nonlinear system estimation , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[11]  Xiaoou Li,et al.  Fuzzy identification using fuzzy neural networks with stable learning algorithms , 2004, IEEE Transactions on Fuzzy Systems.

[12]  Jiao Luo,et al.  Prediction of the mechanical properties of the post-forged Ti–6Al–4V alloy using fuzzy neural network , 2010 .

[13]  Zongben Xu Data Modeling: Visual Psychology Approach and L1/2 Regularization Theory , 2011 .

[14]  Özgür Kisi,et al.  Adaptive neuro-fuzzy computing technique for suspended sediment estimation , 2009, Adv. Eng. Softw..

[15]  Long Li,et al.  A modified gradient-based neuro-fuzzy learning algorithm and its convergence , 2010, Inf. Sci..

[16]  C. Xiang Robust regularization theory based on L_q (0, 2010 .

[17]  Javier Echanobe,et al.  Efficient Hardware/Software Implementation of an Adaptive Neuro-Fuzzy System , 2008, IEEE Transactions on Fuzzy Systems.

[18]  Zhan-Li Sun,et al.  A Neuro-Fuzzy Inference System Through Integration of Fuzzy Logic and Extreme Learning Machines , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[19]  M. Sugeno,et al.  Multi-dimensional fuzzy reasoning , 1983 .

[20]  Robert E. Hampson,et al.  Identification of sparse neural functional connectivity using penalized likelihood estimation and basis functions , 2013, Journal of Computational Neuroscience.

[21]  Abir Jaafar Hussain,et al.  Non-stationary and stationary prediction of financial time series using dynamic ridge polynomial neural network , 2009, Neurocomputing.

[22]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[23]  Yue Wang,et al.  Using the multi-living agent concept to investigate complex information systems , 2008, Science in China Series F: Information Sciences.

[24]  F. Sheikholeslam,et al.  Fuzzy Wavelet Neural Network With an Accelerated Hybrid Learning Algorithm , 2012, IEEE Transactions on Fuzzy Systems.

[25]  Yaochu Jin,et al.  Fuzzy modeling of high-dimensional systems: complexity reduction and interpretability improvement , 2000, IEEE Trans. Fuzzy Syst..

[26]  Joydeep Ghosh,et al.  The pi-sigma network: an efficient higher-order neural network for pattern classification and function approximation , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[27]  Emil Levi,et al.  Identification of complex systems based on neural and Takagi-Sugeno fuzzy model , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[28]  W. Pedrycz An identification algorithm in fuzzy relational systems , 1984 .

[29]  Nikhil R. Pal,et al.  A neuro-fuzzy scheme for simultaneous feature selection and fuzzy rule-based classification , 2004, IEEE Transactions on Neural Networks.

[30]  George W. Irwin,et al.  Improving neural network training solutions using regularisation , 2001, Neurocomputing.

[31]  D.P. Filev,et al.  An approach to online identification of Takagi-Sugeno fuzzy models , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[32]  Nikhil R. Pal,et al.  Simultaneous Structure Identification and Fuzzy Rule Generation for Takagi–Sugeno Models , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).