A new multimembership clustering method

Clustering method is one of the most important tools in statistics. In a graph theory model, clustering is the process of finding all dense subgraphs. In this paper, a new clustering method is introduced. One of the most significant differences between the new method and other existing methods is that this new method constructs a much smaller hierarchical tree, which clearly highlights meaningful clusters. Another important feature of the new method is the feature of overlapping clustering or multi-membership. The property of multi-membership is a concept that has recently received increased attention in the literature (Palla, Derenyi, Farkas and Vicsek, (Nature 2005); Pereira-Leal, Enright and Ouzounis, (Bioinformatics, 2004); Futschik and Carlisle, (J. Bioinformatics and Computational Biology 2005))

[1]  Ying Xu,et al.  Clustering gene expression data using a graph-theoretic approach: an application of minimum spanning trees , 2002, Bioinform..

[2]  W. Härdle,et al.  Applied Multivariate Statistical Analysis , 2003 .

[3]  Anton J. Enright,et al.  Detection of functional modules from protein interaction networks , 2003, Proteins.

[4]  Roger N. Shepard,et al.  Additive clustering: Representation of similarities as combinations of discrete overlapping properties. , 1979 .

[5]  J. A. Bondy,et al.  Graph Theory with Applications , 1978 .

[6]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[7]  D. West Introduction to Graph Theory , 1995 .

[8]  Pierre Hansen,et al.  Cluster analysis and mathematical programming , 1997, Math. Program..

[9]  Cun-Quan Zhang,et al.  A NEW CLUSTERING METHOD AND ITS APPLICATION TO PROTEOMIC PROFILING FOR COLON CANCER. , 2006, IASTED International Conference on Computational and Systems Biology : November 13-14, 2006, Dallas, Texas, USA.

[10]  Rainer Fuchs,et al.  Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters , 2001, Bioinform..

[11]  Reinhard Diestel,et al.  Graph Theory , 1997 .

[12]  Brian Everitt,et al.  Cluster analysis , 1974 .

[13]  Matthias E. Futschik,et al.  Noise-robust Soft Clustering of Gene Expression Time-course Data , 2005, J. Bioinform. Comput. Biol..