Noise Clustering via Dynamic Data Assigning Assessment

A new clustering algorithm that identifies clusters step by step is introduced. It is based on the principles of noise clustering dividing the data set into a good cluster and the remaining data that might contain only noise or also other clusters. The algorithm can be applied to finding just a few substructures (clusters), but also as an iterative method to data partition including the identification of the number of clusters and noise data. The algorithm is applicable in terms of both hard and fuzzy clustering techniques. An extended variant of the algorithm is developed in order to solve the problem of determining clusters that are sub-clusters of a certain separated cluster. The algorithm is applied to a gene expression data set and finds groups of coregulated genes.

[1]  Rajesh N. Davé,et al.  Robust clustering methods: a unified view , 1997, IEEE Trans. Fuzzy Syst..

[2]  Kenneth G. Manton,et al.  Fuzzy Cluster Analysis , 2005 .

[3]  Isak Gath,et al.  Unsupervised Optimal Fuzzy Clustering , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Rajesh N. Davé,et al.  Characterization and detection of noise in clustering , 1991, Pattern Recognit. Lett..

[5]  Frank Klawonn,et al.  Adaptation of Cluster Sizes in Objective Function Based Fuzzy Clustering Technology , 2002 .

[6]  J. C. Peters,et al.  Fuzzy cluster analysis--a new method to predict future cardiac events in patients with positive stress tests. , 1998, Japanese circulation journal.

[7]  Donald Gustafson,et al.  Fuzzy clustering with a fuzzy covariance matrix , 1978, 1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes.

[8]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[9]  James C. Bezdek,et al.  A Convergence Theorem for the Fuzzy ISODATA Clustering Algorithms , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.