A novel approach to unsupervised robust clustering using genetic niching

We present a new unsupervised robust clustering algorithm that can successfully find dense areas (clusters) in feature space and determine their number. The clustering problem is converted to a multimodal function optimization problem within the context of genetic niching. The niche peaks, which constitute the final cluster centers, are identified based on deterministic crowding (DC). The problem of crossover interactions in DC is eliminated by restricting the mating to members of the same niche only. Finally, the correct number of niche peaks or cluster centers is extracted from the final population. Genetic optimization makes our approach much less prone to suboptimal solutions than other objective function based approaches, and frees it from the necessity of an analytical derivation of the prototypes. As a result, our approach can handle a vast array of general subjective, even non-metric dissimilarities, and is thus useful in many applications such as Web and data mining. Additionally, the use of robust weights makes it less sensitive to the presence of noise than most traditional unsupervised clustering techniques.

[1]  James C. Bezdek,et al.  Genetic algorithm guided clustering , 1994, Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence.

[2]  K. Dejong,et al.  An analysis of the behavior of a class of genetic adaptive systems , 1975 .

[3]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[4]  Raghu Krishnapuram,et al.  Fitting an unknown number of lines and planes to image data through compatible cluster merging , 1992, Pattern Recognit..

[5]  Kenneth Alan De Jong,et al.  An analysis of the behavior of a class of genetic adaptive systems. , 1975 .

[6]  Olfa Nasraoui,et al.  Clustering using a genetic fuzzy least median of squares algorithm , 1997, 1997 Annual Meeting of the North American Fuzzy Information Processing Society - NAFIPS (Cat. No.97TH8297).

[7]  Michael J. Shaw,et al.  Genetic algorithms with dynamic niche sharing for multimodal function optimization , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.

[8]  Hichem Frigui,et al.  Clustering by competitive agglomeration , 1997, Pattern Recognit..

[9]  Jean-Michel Jolion,et al.  Robust Clustering with Applications in Computer Vision , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[11]  Olfa Nasraoui,et al.  A robust estimator based on density and scale optimization and its application to clustering , 1996, Proceedings of IEEE 5th International Fuzzy Systems.

[12]  Samir W. Mahfoud Crowding and Preselection Revisited , 1992, PPSN.