Correlating interactions with gene expressions to detect protein complexes in protein interaction networks

In protein-protein interaction networks, proteins combine into macromolecular complexes to execute essential functions in the cells, such as replication, transcription, protein transport. Considering the certain rate of false positive and false negative interactions, we take a confidence probability on interactions and correlate interactions and gene expression data to assign weights to edges in PPI networks. Then we propose the CIGE algorithm to detecting protein complexes from protein interaction networks. Our algorithm takes a maximal full-connected sub-graph as core graph of a seed node, and decides whether a node belongs to a protein complex through judging in-module weight and out-module weight between core graph and nodes out of core graph. Experiment results show that our algorithm has an excellent performance in both accuracy and hit rate.

[1]  Xiaohua Hu,et al.  Topological Analysis and Sub-Network Mining of Protein-Protein Interactions , 2007 .

[2]  Merja Penttilä,et al.  Transcriptional responses of Saccharomyces cerevisiae to shift from respiratory and respirofermentative to fully fermentative metabolism. , 2011, Omics : a journal of integrative biology.

[3]  Matej Oresic,et al.  Systematic construction of gene coexpression networks with applications to human T helper cell differentiation process , 2007, Bioinform..

[4]  Jesús A. Izaguirre,et al.  Belief Propagation Estimation of Protein and Domain Interactions Using the Sum–Product Algorithm , 2010, IEEE Transactions on Information Theory.

[5]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[6]  T. Samuelsson,et al.  Mediator – a universal complex in transcriptional regulation , 2001, Molecular microbiology.

[7]  F. Winston,et al.  Analysis of Spt7 Function in the Saccharomyces cerevisiae SAGA Coactivator Complex , 2002, Molecular and Cellular Biology.

[8]  Tao Jiang,et al.  A max-flow based approach to the identification of protein complexes using protein interaction and microarray data. , 2008, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[9]  Roded Sharan,et al.  BMC Bioinformatics BioMed Central , 2006 .

[10]  Chung-Yen Lin,et al.  A hub-attachment based method to detect functional modules from confidence-scored protein interactions and expression profiles , 2010, BMC Bioinformatics.

[11]  S. MacNeill,et al.  Genome Stability: A New Member of the RFC family , 2003, Current Biology.

[12]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[13]  E. Kovacs,et al.  The web and the rock: cell adhesion and the ARP2/3 complex. , 2002, Developmental cell.

[14]  Siu-Ming Yiu,et al.  Unsupervised binning of environmental genomic fragments based on an error robust selection of l-mers , 2009, BMC Bioinformatics.

[15]  J. Locker Transcription factors: essential data. , 1996 .

[16]  Kok-Leong Ong,et al.  Dynamical Systems for Discovering Protein Complexes and Functional Modules from Biological Networks , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[17]  H. Y. Liu,et al.  Characterization of CAF4 and CAF16 Reveals a Functional Connection between the CCR4-NOT Complex and a Subset of SRB Proteins of the RNA Polymerase II Holoenzyme* , 2001, The Journal of Biological Chemistry.

[18]  J. Workman,et al.  The many HATs of transcription coactivators. , 2000, Trends in biochemical sciences.

[19]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[20]  Xiaohua Hu,et al.  Data Mining and Predictive Modeling of Biomolecular Network from Biomedical Literature Databases , 2007, TCBB.

[21]  Chung-Yen Lin,et al.  Hubba: hub objects analyzer—a framework of interactome hubs identification for network biology , 2008, Nucleic Acids Res..

[22]  Aidong Zhang,et al.  A “Seed-Refine” Algorithm for Detecting Protein Complexes From Protein Interaction Data , 2007, IEEE Transactions on NanoBioscience.

[23]  S. Pu,et al.  Up-to-date catalogues of yeast protein complexes , 2008, Nucleic acids research.

[24]  Francis D. Gibbons,et al.  Predicting protein complex membership using probabilistic network reliability. , 2004, Genome research.

[25]  Frederick P. Roth,et al.  Probabilistic Paths for Protein Complex Inference , 2006, Systems Biology and Computational Proteomics.