Integrating multiple biomedical resources for protein complex prediction

Prediction of protein complexes from protein-protein interaction (PPI) networks is crucial to unraveling the principles of cellular organization. Most existing approaches only exploit high-throughput experimental PPI data to predict protein complexes. In this paper, we integrate the multiple biomedical resources for protein complex prediction by constructing attributed PPI networks, which include high-throughput data, co-expression data, genomic data, text mining data and gene ontology data. Multiple biomedical resources are complementary in attributed PPI networks. We propose a novel approach called IMBP based on attributed PPI networks. IMBP can effectively learn the degree of contributions of different biomedical resource for complex prediction. The experimental results show that IMBP can make good use of multiple biomedical data and achieve state-of-the-art performance.

[1]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[2]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[3]  Min Wu,et al.  A core-attachment based method to detect protein complexes in PPI networks , 2009, BMC Bioinformatics.

[4]  Tao Jiang,et al.  A max-flow based approach to the identification of protein complexes using protein interaction and microarray data. , 2008, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[5]  Guimei Liu,et al.  Complex discovery from weighted PPI networks , 2009, Bioinform..

[6]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[7]  Haiyuan Yu,et al.  Detecting overlapping protein complexes in protein-protein interaction networks , 2012, Nature Methods.

[8]  Akira Tanaka,et al.  The worst-case time complexity for generating all maximal cliques and computational experiments , 2006, Theor. Comput. Sci..

[9]  Mohammed J. Zaki,et al.  Mining Attribute-structure Correlated Patterns in Large Attributed Graphs , 2012, Proc. VLDB Endow..

[10]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[11]  S. Pu,et al.  Up-to-date catalogues of yeast protein complexes , 2008, Nucleic acids research.

[12]  Gang Chen,et al.  Modifying the DPClus algorithm for identifying protein complexes based on new topological structures , 2008, BMC Bioinformatics.