Block spectral clustering for multiple graphs with inter-relation

Clustering methods for multiple graphs explore and exploit multiple graphs simultaneously to obtain a more accurate and robust partition of the data than that using single graph clustering methods. In this paper, we study the clustering of multiple graphs with inter-relation among vertices in different graphs. The main contribution is to propose and develop a block spectral clustering method for multiple graphs with inter-relation. Our idea is to construct a block Laplacian matrix for multiple graphs and make use of its eigenvectors to perform clustering very efficiently. Global optimal solutions are obtained in the proposed method and they are solutions of relaxation of multiple graphs ratio cut and normalized cut problems. In contrast, existing clustering methods cannot guarantee optimal solutions and their solutions are dependent on initial guesses. Experimental results on both synthetic and real-world data sets are given to demonstrate that the clustering accuracy achieved and computational time required by the proposed block clustering method are better than those by the testing clustering methods in the literature.

[1]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[2]  Suvrit Sra,et al.  Approximation Algorithms for Tensor Clustering , 2009, ALT.

[3]  Bart De Moor,et al.  Multiview Partitioning via Tensor Methods , 2013, IEEE Transactions on Knowledge and Data Engineering.

[4]  R. Kreienberg,et al.  Expression Profiling of Mammary Carcinoma Cell Lines: Correlation of in vitro Invasiveness with Expression of CD24 , 2002, Tumor Biology.

[5]  Camille Roth,et al.  Natural Scales in Geographical Patterns , 2017, Scientific Reports.

[6]  Pauline M. Rudd,et al.  Glycomic and glycoproteomic analysis of serum from patients with stomach cancer reveals potential markers arising from host defense response mechanisms. , 2011, Journal of proteome research.

[7]  Steffen Bickel,et al.  Multi-view clustering , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[8]  Bharti Odhav,et al.  Immune responses in cancer. , 2003, Pharmacology & therapeutics.

[9]  Yunming Ye,et al.  MultiRank: co-ranking for objects and relations in multi-relational data , 2011, KDD.

[10]  Alberto Mantovani,et al.  Cancer-related inflammation, the seventh hallmark of cancer: links to genetic instability. , 2009, Carcinogenesis.

[11]  Chuanshu Huang,et al.  Inflammation, a Key Event in Cancer Development , 2006, Molecular Cancer Research.

[12]  Sham M. Kakade,et al.  Multi-view clustering via canonical correlation analysis , 2009, ICML '09.

[13]  Philip S. Yu,et al.  Spectral clustering for multi-type relational data , 2006, ICML.

[14]  Hal Daumé,et al.  Co-regularized Multi-view Spectral Clustering , 2011, NIPS.

[15]  Zheng Chen,et al.  Text representation: from vector to tensor , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[16]  T. Jacks,et al.  The cell cycle and cancer. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Pascal Frossard,et al.  Clustering With Multi-Layer Graphs: A Spectral Perspective , 2011, IEEE Transactions on Signal Processing.

[18]  Rosalind Ramsey-Goldman,et al.  Exploring the links between systemic lupus erythematosus and cancer. , 2005, Rheumatic diseases clinics of North America.

[19]  Lloyd J. Old,et al.  Cancer/testis antigens, gametogenesis and cancer , 2005, Nature Reviews Cancer.

[20]  Jiawei Han,et al.  Tensor space model for document analysis , 2006, SIGIR.

[21]  Yang Liu,et al.  Recent duplication and positive selection of the GAGE gene family , 2008, Genetica.

[22]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[23]  Richard H. White,et al.  Cancer risk in a cohort of patients with systemic lupus erythematosus (SLE) in California , 2008, Cancer Causes & Control.

[24]  Jiawei Han,et al.  Mining coherent dense subgraphs across massive biological networks for functional discovery , 2005, ISMB.

[25]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[26]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[27]  Wei Tang,et al.  Clustering with Multiple Graphs , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[28]  S. Horvath,et al.  A General Framework for Weighted Gene Co-Expression Network Analysis , 2005, Statistical applications in genetics and molecular biology.

[29]  Wei Cheng,et al.  Flexible and robust co-regularized multi-domain graph clustering , 2013, KDD.

[30]  Hal Daumé,et al.  A Co-training Approach for Multi-view Spectral Clustering , 2011, ICML.