A New Fuzzy Classifier for Data Streams

Along with technological developments we observe an increasing amount of stored and processed data. It is not possible to store all incoming data and analyze it on the fly. Therefore many researchers are working on new algorithms for data stream mining. New algorithm should be fast and should use a small amount of memory. We will consider the problem of data stream classification. To increase the accuracy we propose to use an ensemble of classifiers based on a modified FID3 algorithm. The experimental results show that this algorithm is fast and accurate. Therefore it is adequate tool for data stream classification.

[1]  Charu C. Aggarwal,et al.  Data Streams - Models and Algorithms , 2014, Advances in Database Systems.

[2]  Jacek M. Zurada,et al.  Artificial Intelligence and Soft Computing, 10th International Conference, ICAISC 2010, Zakopane, Poland, June 13-17, 2010, Part I , 2010, International Conference on Artificial Intelligence and Soft Computing.

[3]  R. Nowicki Nonlinear modelling and classification based on the MICOG defuzzification , 2009 .

[4]  Richard Brendon Kirkby,et al.  Improving Hoeffding Trees , 2007 .

[5]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[6]  Zhi-Hua Zhou,et al.  Emerging Technologies in Knowledge Discovery and Data Mining, PAKDD 2007, International Workshops, Nanjing, China, May 22-25, 2007, Revised Selected Papers , 2007, PAKDD Workshops.

[7]  L. Rutkowski Non-parametric learning algorithms in time-varying environments☆ , 1989 .

[8]  Carlo Zaniolo,et al.  An Adaptive Nearest Neighbor Classification Algorithm for Data Streams , 2005, PKDD.

[9]  Leszek Rutkowski,et al.  Neural Networks and Soft Computing , 2003 .

[10]  Rosa Maria Valdovinos,et al.  New Applications of Ensembles of Classifiers , 2003, Pattern Analysis & Applications.

[11]  Zhoujun Li,et al.  An Incremental Fuzzy Decision Tree Classification Method for Mining Data Streams , 2007, MLDM.

[12]  Mohamed Medhat Gaber,et al.  On-board Mining of Data Streams in Sensor Networks , 2005 .

[13]  Janusz T. Starczewski,et al.  Interval Type 2 Neuro-Fuzzy Systems Based on Interval Consequents , 2003 .

[14]  Leszek Rutkowski,et al.  A general approach to neuro-fuzzy systems , 2001, 10th IEEE International Conference on Fuzzy Systems. (Cat. No.01CH37297).

[15]  Qin Ding,et al.  k-nearest Neighbor Classification on Spatial Data Streams Using P-trees , 2002, PAKDD.

[16]  Zhoujun Li,et al.  A New Decision Tree Classification Method for Mining High-Speed Data Streams Based on Threaded Binary Search Trees , 2007, PAKDD Workshops.

[17]  Piotr Duda,et al.  Decision Trees for Mining Data Streams Based on the McDiarmid's Bound , 2013, IEEE Transactions on Knowledge and Data Engineering.

[18]  Rafal Scherer,et al.  Neuro-fuzzy Systems with Relation Matrix , 2010, ICAISC.

[19]  L. Rutkowski,et al.  A neuro-fuzzy controller with a compromise fuzzy reasoning , 2002 .

[20]  Marcin Korytkowski,et al.  Modular Type-2 Neuro-fuzzy Systems , 2007, PPAM.

[21]  Sattar Hashemi,et al.  Flexible decision tree for data stream classification in the presence of concept change, noise and missing values , 2009, Data Mining and Knowledge Discovery.

[22]  Geoff Holmes,et al.  Accurate Ensembles for Data Streams: Combining Restricted Hoeffding Trees using Stacking , 2010, ACML.

[23]  Rafal Scherer Boosting Ensemble of Relational Neuro-fuzzy Systems , 2006, ICAISC.

[24]  Carlo Zaniolo,et al.  Fast and Light Boosting for Adaptive Mining of Data Streams , 2004, PAKDD.

[25]  William Nick Street,et al.  A streaming ensemble algorithm (SEA) for large-scale classification , 2001, KDD '01.

[26]  Ryszard Tadeusiewicz,et al.  Artificial Intelligence and Soft Computing - ICAISC 2006, 8th International Conference, Zakopane, Poland, June 25-29, 2006, Proceedings , 2006, International Conference on Artificial Intelligence and Soft Computing.

[27]  Janusz T. Starczewski,et al.  Connectionist Structures of Type 2 Fuzzy Inference Systems , 2001, PPAM.

[28]  L. Rutkowski Application of multiple Fourier series to identification of multivariable non-stationary systems , 1989 .

[29]  Charu C. Aggarwal,et al.  Data Streams: Models and Algorithms (Advances in Database Systems) , 2006 .

[30]  R. Polikar,et al.  Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.

[31]  Luís Torgo,et al.  Knowledge Discovery in Databases: PKDD 2005, 9th European Conference on Principles and Practice of Knowledge Discovery in Databases, Porto, Portugal, October 3-7, 2005, Proceedings , 2005, PKDD.

[32]  Leszek Rutlowski Sequential pattern recognition procedures derived from multiple Fourier series , 1988 .

[33]  Geoff Holmes,et al.  New ensemble methods for evolving data streams , 2009, KDD.

[34]  L. Rutkowski Real-time identification of time-varying systems by non-parametric algorithms based on Parzen kernels , 1985 .

[35]  R. Nedunchezhian,et al.  Minig rules of concept drift using genetic algorithm , 2011 .

[36]  I. Hatono,et al.  Fuzzy decision trees by fuzzy ID3 algorithm and its application to diagnosis systems , 1994, Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference.