Functional Flow Simulation Based Analysis of Protein Interaction Network

Protein-protein interactions (PPIs) play fundamental roles in nearly all biological processes and differ based on the composition, affinity and lifetime of the association. A vast amount of PPI data for various organisms is avaiable from MIPS, DIP and other sources. The identification of functional modules in PPI network is of great interest because they often reveal unknown functional ties between proteins and hence predict functions for unknown proteins. In this paper, we propose using functional flow simulation and the topology of the network for the functional module detection and function prediction problem. Our approach is based on the functional influence model that quantifies the influence of a biological component on another. We introduce a flow simulation algorithm to generate a functional profile for each component. In addition, a new clustering method FMD (Functional Module Detection) is designed to associate with functional profiles to detect functional modules. We evaluate the proposed technique on three different yeast networks with MIPS functional categories and compare it with several other existing techniques in terms of precision and recall. Our experiments show that our approach achieves better accuracy than other existing methods.

[1]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[2]  R. Carter 11 – IT and society , 1991 .

[3]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[4]  Ron Shamir,et al.  A clustering algorithm based on graph connectivity , 2000, Inf. Process. Lett..

[5]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[6]  Lei Shi,et al.  ANN Based Protein Function Prediction Using Integrated Protein-Protein Interaction Data , 2009, 2009 International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing.

[7]  D. Bu,et al.  Topological structure analysis of the protein-protein interaction network in budding yeast. , 2003, Nucleic acids research.

[8]  Mona Singh,et al.  Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps , 2005, ISMB.

[9]  Limsoon Wong,et al.  Exploiting Indirect Neighbours and Topological Weight to Predict Protein Function from Protein-Protein Interactions , 2006, BioDM.

[10]  Aidong Zhang,et al.  Protein Interaction Networks: Computational Analysis , 2009 .

[11]  Aidong Zhang,et al.  A novel functional module detection algorithm for protein-protein interaction networks , 2006, Algorithms for Molecular Biology.

[12]  Ignacio Marín,et al.  Iterative Cluster Analysis of Protein Interaction Data , 2005, Bioinform..

[13]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[14]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[15]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[16]  Frank Dudbridge,et al.  The Use of Edge-Betweenness Clustering to Investigate Biological Function in Protein Interaction Networks , 2005, BMC Bioinformatics.

[17]  Aidong Zhang,et al.  A “Seed-Refine” Algorithm for Detecting Protein Complexes From Protein Interaction Data , 2007, IEEE Transactions on NanoBioscience.

[18]  Aidong Zhang,et al.  A topological measurement for weighted protein interaction network , 2005, 2005 IEEE Computational Systems Bioinformatics Conference (CSB'05).

[19]  Igor Jurisica,et al.  Protein complex prediction via cost-based clustering , 2004, Bioinform..

[20]  Shigehiko Kanaya,et al.  Development and implementation of an algorithm for detection of protein complexes in large interaction networks , 2006, BMC Bioinformatics.

[21]  B. Schwikowski,et al.  A network of protein–protein interactions in yeast , 2000, Nature Biotechnology.

[22]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[23]  Li Wang,et al.  An integrative approach for causal gene identification and gene regulatory pathway inference , 2006, ISMB.

[24]  ArnauVicente,et al.  Iterative Cluster Analysis of Protein Interaction Data , 2005 .

[25]  T. Takagi,et al.  Assessment of prediction accuracy of protein function from protein–protein interaction data , 2001, Yeast.

[26]  Yonina C. Eldar,et al.  eQED: an efficient method for interpreting eQTL associations using protein networks , 2008, Molecular systems biology.

[27]  Adam J. Smith,et al.  The Database of Interacting Proteins: 2004 update , 2004, Nucleic Acids Res..

[28]  Lei Shi,et al.  Functional module detection by functional flow pattern mining in protein interaction networks , 2008, BMC Bioinformatics.

[29]  Aidong Zhang,et al.  Semantic integration to identify overlapping functional modules in protein interaction networks , 2007, BMC Bioinformatics.

[30]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2005, Nucleic Acids Res..