Approximate and Spectral Clustering for Network and Affinity Data

This chapter is devoted to clustering similarity, graph and network data – these are represented by square matrices rather than rectangular ones. This chapter describes methods for finding a cluster or two-cluster split combining three types of approaches from both old and recent developments: (a)combinatorial approach that is oriented at clustering as optimization of some reasonable measure of cluster homogeneity, (b)additive clustering approach that is based on a data recovery model at which the data is decoded from a cluster structure to be found by minimizing the discrepancy between them and observed similarities, and (c)spectral clustering approach exploiting the machinery of matrix eigenvalues and eigenvectors as a relaxation of combinatorial problems for similarity clustering.

[1]  Roger N. Shepard,et al.  Additive clustering: Representation of similarities as combinations of discrete overlapping properties. , 1979 .

[2]  Boris Mirkin,et al.  Mathematical Classification and Clustering , 1996 .

[3]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Stephen Guattery,et al.  On the Quality of Spectral Separators , 1998, SIAM J. Matrix Anal. Appl..

[6]  Alex S. Taylor,et al.  Machine intelligence , 2009, CHI.

[7]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  B. Mirkin Additive clustering and qualitative factor analysis methods for similarity matrices , 1989 .

[9]  George Loizou,et al.  Similarity clustering of proteins using substantive knowledge and reconstruction of evolutionary gene histories in herpesvirus , 2010 .

[10]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..