A core-attachment based method to detect protein complexes in PPI networks

BackgroundHow to detect protein complexes is an important and challenging task in post genomic era. As the increasing amount of protein-protein interaction (PPI) data are available, we are able to identify protein complexes from PPI networks. However, most of current studies detect protein complexes based solely on the observation that dense regions in PPI networks may correspond to protein complexes, but fail to consider the inherent organization within protein complexes.ResultsTo provide insights into the organization of protein complexes, this paper presents a novel core-attachment based method (COACH) which detects protein complexes in two stages. It first detects protein-complex cores as the "hearts" of protein complexes and then includes attachments into these cores to form biologically meaningful structures. We evaluate and analyze our predicted protein complexes from two aspects. First, we perform a comprehensive comparison between our proposed method and existing techniques by comparing the predicted complexes against benchmark complexes. Second, we also validate the core-attachment structures using various biological evidence and knowledge.ConclusionOur proposed COACH method has been applied on two different yeast PPI networks and the experimental results show that COACH performs significantly better than the state-of-the-art techniques. In addition, the identified complexes with core-attachment structures are demonstrated to match very well with existing biological knowledge and thus provide more insights for future biological study.

[1]  Limsoon Wong,et al.  Using Indirect protein-protein Interactions for protein Complex Prediction , 2008, J. Bioinform. Comput. Biol..

[2]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[3]  Gary D Bader,et al.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry , 2002, Nature.

[4]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[5]  Stefan Hohmann,et al.  Composition and Functional Analysis of the Saccharomyces cerevisiae Trehalose Synthase Complex* , 1998, The Journal of Biological Chemistry.

[6]  Sean R. Collins,et al.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae , 2006, Nature.

[7]  Igor Jurisica,et al.  Protein complex prediction via cost-based clustering , 2004, Bioinform..

[8]  H. Y. Liu,et al.  Characterization of CAF4 and CAF16 Reveals a Functional Connection between the CCR4-NOT Complex and a Subset of SRB Proteins of the RNA Polymerase II Holoenzyme* , 2001, The Journal of Biological Chemistry.

[9]  P. Bork,et al.  Structure-Based Assembly of Protein Complexes in Yeast , 2004, Science.

[10]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[11]  Anastasios Bezerianos,et al.  Growing functional modules from a seed protein via integration of protein interaction and gene expression data , 2007, BMC Bioinformatics.

[12]  Dong-Soo Han,et al.  Protein complex prediction based on mutually exclusive interactions in protein interaction network. , 2008, Genome informatics. International Conference on Genome Informatics.

[13]  Mong-Li Lee,et al.  Increasing confidence of protein interactomes using network topological metrics , 2006, Bioinform..

[14]  Ozlem Keskin,et al.  Architectures and functional coverage of protein-protein interfaces. , 2008, Journal of molecular biology.

[15]  Kara Dolinski,et al.  Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO) , 2002, Nucleic Acids Res..

[16]  Yoshihide Hayashizaki,et al.  Construction of reliable protein-protein interaction networks with a new interaction generality measure , 2003, Bioinform..

[17]  W. Wickner,et al.  A Ypt/Rab effector complex containing the Sec1 homolog Vps33p is required for homotypic vacuole fusion. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[18]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[19]  L. Mirny,et al.  Protein complexes and functional modules in molecular networks , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[20]  D. Balciunas,et al.  Three subunits of the RNA polymerase II mediator complex are involved in glucose repression. , 1995, Nucleic acids research.

[21]  Philip M. Kim,et al.  Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights , 2006, Science.

[22]  See-Kiong Ng,et al.  Discovering protein complexes in dense reliable neighborhoods of protein interaction networks. , 2007, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[23]  Markus Schwarz,et al.  Yeast oligosaccharyltransferase consists of two functionally distinct sub‐complexes, specified by either the Ost3p or Ost6p subunit , 2005, FEBS letters.

[24]  A. Barabasi,et al.  Bioinformatics analysis of experimentally determined protein complexes in the yeast Saccharomyces cerevisiae. , 2003, Genome research.

[25]  Shigehiko Kanaya,et al.  Development and implementation of an algorithm for detection of protein complexes in large interaction networks , 2006, BMC Bioinformatics.

[26]  Kelly Jm,et al.  Carbon catabolite repression. , 1994 .

[27]  Dongsoo Han,et al.  PROTEIN COMPLEX PREDICTION BASED ON MUTUALLY EXCLUSIVE INTERACTIONS IN PROTEIN INTERACTION NETWORK , 2008 .

[28]  Siu-Ming Yiu,et al.  Predicting Protein Complexes from PPI Data: A Core-Attachment Approach , 2009, J. Comput. Biol..

[29]  P. Sung,et al.  Nucleotide Excision Repair in Yeast Is Mediated by Sequential Assembly of Repair Factors and Not by a Pre-assembled Repairosome (*) , 1996, The Journal of Biological Chemistry.

[30]  D. Goldberg,et al.  Assessing experimentally derived interactions in a small world , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[31]  See-Kiong Ng,et al.  Biological Data Mining in Protein Interaction Networks , 2009 .

[32]  J. Gancedo Yeast Carbon Catabolite Repression , 1998, Microbiology and Molecular Biology Reviews.

[33]  Anton J. Enright,et al.  Detection of functional modules from protein interaction networks , 2003, Proteins.

[34]  A. Barabasi,et al.  Functional and topological characterization of protein interaction networks , 2004, Proteomics.

[35]  Gabriele Varani,et al.  The Cbf5–Nop10 complex is a molecular bracket that organizes box H/ACA RNPs , 2005, Nature Structural &Molecular Biology.

[36]  Limsoon Wong,et al.  Exploiting Indirect Neighbours and Topological Weight to Predict Protein Function from Protein-Protein Interactions , 2006, BioDM.

[37]  A. van Dorsselaer,et al.  The GPI transamidase complex of Saccharomyces cerevisiae contains Gaa1p, Gpi8p, and Gpi16p. , 2001, Molecular biology of the cell.

[38]  B. Séraphin,et al.  The tandem affinity purification (TAP) method: a general procedure of protein complex purification. , 2001, Methods.

[39]  Yoshihide Hayashizaki,et al.  Interaction generality, a measurement to assess the reliability of a protein-protein interaction. , 2002, Nucleic acids research.

[40]  Nagiza F. Samatova,et al.  From pull-down data to protein interaction networks and complexes with biological relevance. , 2008, Bioinformatics.

[41]  John R Yates,et al.  A Subset of TAFIIs Are Integral Components of the SAGA Complex Required for Nucleosome Acetylation and Transcriptional Stimulation , 1998, Cell.

[42]  Limsoon Wong,et al.  Using indirect protein-protein interactions for protein complex predication. , 2007, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[43]  P. Sorger,et al.  The yeast DASH complex forms closed rings on microtubules , 2005, Nature Structural &Molecular Biology.

[44]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2005, Nucleic Acids Res..

[45]  Chee Keong Kwoh,et al.  Algorithms for Detecting Protein Complexes in PPI Networks: An Evaluation Study , 2008 .

[46]  Jacques van Helden,et al.  Evaluation of clustering algorithms for protein-protein interaction networks , 2006, BMC Bioinformatics.

[47]  Philip S. Yu,et al.  A new method to measure the semantic similarity of GO terms , 2007, Bioinform..

[48]  D. Stillman,et al.  Spt16–Pob3 and the HMG protein Nhp6 combine to form the nucleosome‐binding factor SPN , 2001, The EMBO journal.

[49]  David Botstein,et al.  GO: : TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes , 2004, Bioinform..

[50]  S. Emr,et al.  A novel RING finger protein complex essential for a late step in protein transport to the yeast vacuole. , 1997, Molecular biology of the cell.

[51]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Gary D Bader,et al.  A Combined Experimental and Computational Strategy to Define Protein Interaction Networks for Peptide Recognition Modules , 2001, Science.

[53]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[54]  Jiawei Han,et al.  Mining coherent dense subgraphs across massive biological networks for functional discovery , 2005, ISMB.

[55]  Igor Jurisica,et al.  Functional topology in a network of protein interactions , 2004, Bioinform..

[56]  Caroline C. Friedel,et al.  Bootstrapping the Interactome: Unsupervised Identification of Protein Complexes in Yeast , 2008, RECOMB.