A genetic fuzzy system for large-scale regression

In genetic fuzzy systems (GFS) the size of the problem has a huge influence in the performance of the obtained models, since i) the fuzzy rule bases learned suffer from exponential rule explosion when the number of variables increases, and ii) the convergence time increments with the number of examples. In this paper we present S-FRULER, a scalable distributed version of FRULER which is a GFS that learns simple and linguistic TSK-1 knowledge bases for regression problems. S-FRULER obtains models with high accuracy and low complexity, whilst reducing the algorithm runtime. S-FRULER focuses on splitting the problem into smaller partitions and incorporates a feature selection process for reducing the number of variables used in each partition. Each partition is then solved independently using the FRULER algorithm. The Reduce function obtains linguistic TSK fuzzy rule bases from the information generated in each partition. S-FRULER has been validated in terms of scalability, precision and complexity using 10 large-scale datasets and has been compared with three state of the art GFSs. Experimental results show that S-FRULER scales well while achieving simple models with a linguistic approach and a precision comparable with approximative models.

[1]  Sophia Ananiadou,et al.  Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty , 2009, ACL.

[2]  Jerry M. Mendel,et al.  Generating fuzzy rules by learning from examples , 1992, IEEE Trans. Syst. Man Cybern..

[3]  Francisco Herrera,et al.  A taxonomy for the crossover operator for real‐coded genetic algorithms: An experimental study , 2003, Int. J. Intell. Syst..

[4]  Rafael Alcalá,et al.  METSK-HDe: A multiobjective evolutionary algorithm to learn accurate TSK-fuzzy systems in high-dimensional and large-scale regression problems , 2014, Inf. Sci..

[5]  Francisco Herrera,et al.  Genetic fuzzy systems: taxonomy, current research trends and prospects , 2008, Evol. Intell..

[6]  Michela Antonelli,et al.  An efficient multi-objective evolutionary fuzzy system for regression problems , 2013, Int. J. Approx. Reason..

[7]  Manuel Mucientes,et al.  An instance selection algorithm for regression and its application in variance reduction , 2013, 2013 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[8]  Manuel Mucientes,et al.  Reducing the complexity in genetic learning of accurate regression TSK rule-based systems , 2015, 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[9]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[10]  María José del Jesús,et al.  KEEL: a software tool to assess evolutionary algorithms for data mining problems , 2008, Soft Comput..

[11]  Francisco Herrera,et al.  A Fast and Scalable Multiobjective Genetic Fuzzy System for Linguistic Fuzzy Modeling in High-Dimensional Regression Problems , 2011, IEEE Transactions on Fuzzy Systems.

[12]  Nicolás García-Pedrajas,et al.  Scaling up data mining algorithms: review and taxonomy , 2012, Progress in Artificial Intelligence.

[13]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[14]  Hisao Ishibuchi,et al.  Performance evaluation of fuzzy partitions with different fuzzification grades , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[15]  Manuel Mucientes,et al.  FRULER: Fuzzy Rule Learning through Evolution for Regression , 2015, Inf. Sci..

[16]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[17]  Jesús Alcalá-Fdez,et al.  Genetic learning of accurate and compact fuzzy rule based systems based on the 2-tuples linguistic representation , 2007, Int. J. Approx. Reason..

[18]  María José del Jesús,et al.  Revisiting Evolutionary Fuzzy Systems: Taxonomy, applications, new trends and challenges , 2015, Knowl. Based Syst..