Handling drifts and shifts in on-line data streams with evolving fuzzy systems

In this paper, we present new approaches to handling drift and shift in on-line data streams with the help of evolving fuzzy systems (EFS), which are characterized by the fact that their structure (rule base and parameters) is not fixed and not pre-determined, but is extracted from data streams on-line and in an incremental manner. When dealing with so-called drifts and s hifts in data streams, one needs to take into account (1) automatic detection of drifts and shifts and (2) automatic reaction to the drifts and shifts. This is important to avoid interruptions in the learning process and downtrends in predictive accuracy. To address the first problem, we propose an approach based on the concept fuzzy rule age. The second problem is addressed by including gradual forgetting of (1) antecedent parts and (2) consequent parameters. The latter can be achieved by including a forgetting factor in the recursive local learning process of the parameters, whose value is automatically extracted based on the intensity of the shift/drift. For addressing the former problem, we introduce two alternative methods: one is based on the evolving density-based clustering (eClustering) used to form the antecedents in the eTS approach; the other is based on the automatic adaptation of the learning rate of the evolving vector quantization (eVQ) method used to form the antecedent in the FLEXFIS approach. The paper concludes with an empirical evaluation of the impact of the proposed approaches in (on-line) real-world data sets in which drifts and shifts occur.

[1]  Plamen P. Angelov,et al.  Evolving Fuzzy-Rule-Based Classifiers From Data Streams , 2008, IEEE Transactions on Fuzzy Systems.

[2]  Plamen Angelov,et al.  Evolving Fuzzy Modeling Using Participatory Learning , 2010 .

[3]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[4]  Raj K. Bhatnagar,et al.  Tracking recurrent concept drift in streaming data using ensemble classifiers , 2007, ICMLA 2007.

[5]  Edwin Lughofer,et al.  Human–Machine Interaction Issues in Quality Control Based on Online Image Classification , 2009, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[6]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[7]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[8]  Nikola K. Kasabov,et al.  DENFIS: dynamic evolving neural-fuzzy inference system and its application for time-series prediction , 2002, IEEE Trans. Fuzzy Syst..

[9]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[10]  Guojun Gan,et al.  Data Clustering: Theory, Algorithms, and Applications (ASA-SIAM Series on Statistics and Applied Probability) , 2007 .

[11]  L. Wang,et al.  Fuzzy systems are universal approximators , 1992, [1992 Proceedings] IEEE International Conference on Fuzzy Systems.

[12]  D. Nauck,et al.  NEFCLASS-X — a Soft Computing Tool to Build Readable Fuzzy Classifiers , 1998 .

[13]  Edwin Lughofer,et al.  An On-Line Interactive Self-adaptive Image Classification Framework , 2008, ICVS.

[14]  Plamen P. Angelov,et al.  Simpl_eTS: a simplified method for learning evolving Takagi-Sugeno fuzzy models , 2005, The 14th IEEE International Conference on Fuzzy Systems, 2005. FUZZ '05..

[15]  Edwin Lughofer,et al.  Improving the robustness of data-driven fuzzy systems with regularization , 2008, 2008 IEEE International Conference on Fuzzy Systems (IEEE World Congress on Computational Intelligence).

[16]  Nikola Kasabov,et al.  Evolving computational intelligence systems , 2005 .

[17]  Robert Babuska,et al.  Fuzzy Modeling for Control , 1998 .

[18]  J. Casillas Interpretability issues in fuzzy modeling , 2003 .

[19]  Edwin Lughofer,et al.  FLEXFIS: A Robust Incremental Learning Approach for Evolving Takagi–Sugeno Fuzzy Models , 2008, IEEE Transactions on Fuzzy Systems.

[20]  Jianhong Wu,et al.  Data clustering - theory, algorithms, and applications , 2007 .

[21]  Ronald R. Yager,et al.  Learning of Fuzzy Rules by Mountain Clustering , 1992 .

[22]  Padraig Cunningham,et al.  A case-based technique for tracking concept drift in spam filtering , 2004, Knowl. Based Syst..

[23]  T. Martin McGinnity,et al.  An approach for on-line extraction of fuzzy rules using a self-organising fuzzy neural network , 2005, Fuzzy Sets Syst..

[24]  Plamen P. Angelov,et al.  Evolving Single- And Multi-Model Fuzzy Classifiers with FLEXFIS-Class , 2007, 2007 IEEE International Fuzzy Systems Conference.

[25]  Sin Chun Ng,et al.  Gradient based variable forgetting factor RLS algorithm , 2003, Signal Process..

[26]  Edwin Lughofer,et al.  Extensions of vector quantization for incremental clustering , 2008, Pattern Recognit..

[27]  Paramasivan Saratchandran,et al.  Sequential Adaptive Fuzzy Inference System (SAFIS) for nonlinear system identification and prediction , 2006, Fuzzy Sets Syst..

[28]  Gerhard Widmer,et al.  Learning in the Presence of Concept Drift and Hidden Contexts , 1996, Machine Learning.

[29]  P. Angelov,et al.  Evolving Fuzzy Systems from Data Streams in Real-Time , 2006, 2006 International Symposium on Evolving Fuzzy Systems.

[30]  P. Angelov,et al.  Evolving rule-based models: A tool for intelligent adaptation , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[31]  Ronald R. Yager,et al.  A model of participatory learning , 1990, IEEE Trans. Syst. Man Cybern..

[32]  Thorsten Joachims,et al.  Detecting Concept Drift with Support Vector Machines , 2000, ICML.

[33]  Ralf Klinkenberg,et al.  Learning drifting concepts: Example selection vs. example weighting , 2004, Intell. Data Anal..

[34]  Plamen P. Angelov,et al.  Adaptive Inferential Sensors Based on Evolving Fuzzy Models , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[35]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[36]  Frank Klawonn,et al.  Obtaining interpretable fuzzy models from fuzzy clustering and fuzzy regression , 2000, KES'2000. Fourth International Conference on Knowledge-Based Intelligent Engineering Systems and Allied Technologies. Proceedings (Cat. No.00TH8516).

[37]  E. Lughofer,et al.  Evolving fuzzy classifiers using different model architectures , 2008, Fuzzy Sets Syst..

[38]  Plamen Angelov,et al.  Evolving Rule-Based Models: A Tool For Design Of Flexible Adaptive Systems , 2002 .

[39]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[40]  Eyke Hüllermeier,et al.  Efficient instance-based learning on data streams , 2007, Intell. Data Anal..

[41]  Plamen P. Angelov,et al.  On line learning fuzzy rule-based system structure from data streams , 2008, 2008 IEEE International Conference on Fuzzy Systems (IEEE World Congress on Computational Intelligence).

[42]  Meng Joo Er,et al.  A fast approach for automatic generation of fuzzy rules by generalized dynamic fuzzy neural networks , 2001, IEEE Trans. Fuzzy Syst..

[43]  Alexey Tsymbal,et al.  The problem of concept drift: definitions and related work , 2004 .

[44]  E. Lughofer Process Safety Enhancements for Data-Driven Evolving Fuzzy Models , 2006, 2006 International Symposium on Evolving Fuzzy Systems.

[45]  Plamen Angelov,et al.  Evolving Takagi‐Sugeno Fuzzy Systems from Streaming Data (eTS+) , 2010 .

[46]  D.P. Filev,et al.  An approach to online identification of Takagi-Sugeno fuzzy models , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).