A Family of the Online Distance-Based Classifiers

In this paper a family of algorithms for the online learning and classification is considered. These algorithms work in rounds, where at each round a new instance is given and the algorithm makes a prediction. After the true class of the instance is revealed, the learning algorithm updates its internal hypothesis. The proposed algorithms are based on fuzzy C-means clustering followed by calculation of distances between cluster centroids and the incoming instance for which the class label is to be predicted. Simple distance-based classifiers thus obtained serve as basic classifiers for the implemented rotation forest kernel. The proposed approach is validated experimentally. Experiment results show that proposed classifiers perform well against competitive approaches.

[1]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[2]  Shonali Krishnaswamy,et al.  Mining data streams: a review , 2005, SGMD.

[3]  William Nick Street,et al.  A streaming ensemble algorithm (SEA) for large-scale classification , 2001, KDD '01.

[4]  K. Wisaeng A Comparison of Different Classification Techniques for Bank Direct Marketing , 2013 .

[5]  Daijin Kim,et al.  Adaptive active appearance model with incremental learning , 2009, Pattern Recognit. Lett..

[6]  LastMark Online classification of nonstationary data streams , 2002 .

[7]  Indre Zliobaite,et al.  Controlled Permutations for Testing Adaptive Classifiers , 2011, Discovery Science.

[8]  Mohamed Medhat Gaber,et al.  Data Stream Mining , 2010, Data Mining and Knowledge Discovery Handbook.

[9]  Lei Wang,et al.  Fuzzy Passive-Aggressive classification: A robust and efficient algorithm for online classification problems , 2013, Inf. Sci..

[10]  Alneu de Andrade Lopes,et al.  An incremental learning algorithm based on the K-associated graph for non-stationary data classification , 2013, Inf. Sci..

[11]  João Gama,et al.  Learning from Data Streams , 2009, Encyclopedia of Data Warehousing and Mining.

[12]  Tadeusz M. Szuba,et al.  Computational Collective Intelligence , 2001, Lecture Notes in Computer Science.

[13]  Joanna Jedrzejowicz,et al.  Cellular GEP-Induced Classifiers , 2010, ICCCI.

[14]  Gregory Ditzler,et al.  Incremental Learning of Concept Drift from Streaming Imbalanced Data , 2013, IEEE Transactions on Knowledge and Data Engineering.

[15]  Motoaki Kawanabe,et al.  On-line learning in changing environments with applications in supervised and unsupervised learning , 2002, Neural Networks.

[16]  Joanna Jedrzejowicz,et al.  Online Classifiers Based on Fuzzy C-means Clustering , 2013, ICCCI.

[17]  O. P. Vyas,et al.  Data Stream Mining: A Review on Windowing Approach , 2012 .

[18]  Joanna Jedrzejowicz,et al.  Rotation Forest with GEP-Induced Expression Trees , 2011, KES-AMSTA.

[19]  Gerhard Widmer,et al.  Learning in the Presence of Concept Drift and Hidden Contexts , 1996, Machine Learning.