Evolving local means method for clustering of streaming data

A new on-line evolving clustering approach for streaming data is proposed in this paper. The approach is based on the concept that local mean of samples within a region has the highest density and the gradient of the density points towards the local mean. The algorithm merely requires recursive calculation of local mean and variance, due to which it easily meets the memory and time constraints for data stream processing. The experimental results using synthetic and benchmark datasets show that the proposed approach attains results at par with offline approach and is comparable to popular density-based mean-shift clustering yet it is significantly more efficient being one-pass and non-iterative.

[1]  Plamen P. Angelov,et al.  Fuzzily Connected Multimodel Systems Evolving Autonomously From Data Streams , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[2]  Jack-Gérard Postaire,et al.  Mode Detection by Relaxation , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Bohyung Han,et al.  Sequential Kernel Density Approximation and Its Application to Real-Time Visual Tracking , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Plamen Angelov,et al.  Evolving Intelligent Systems: Methodology and Applications , 2010 .

[5]  Yizong Cheng,et al.  Mean Shift, Mode Seeking, and Clustering , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Larry D. Hostetler,et al.  The estimation of the gradient of a density function, with applications in pattern recognition , 1975, IEEE Trans. Inf. Theory.

[7]  Hongbin Wang,et al.  Highly efficient incremental estimation of Gaussian mixture models for online data stream clustering , 2005, SPIE Defense + Commercial Sensing.

[8]  Justus H. Piater,et al.  Online Learning of Gaussian Mixture Models - a Two-Level Approach , 2008, VISAPP.

[9]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[10]  Michel Herbin,et al.  A clustering method based on the estimation of the probability density function and on the skeleton by influence zones. Application to image processing , 1996, Pattern Recognit. Lett..

[11]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Plamen P. Angelov,et al.  Evolving fuzzy systems for data streams: a survey , 2011, WIREs Data Mining Knowl. Discov..