An ACO based functional module detection algorithm for protein interaction networks

Protein-protein interactions (PPIs) play fundamental roles in nearly all biological processes and differ based on the composition, affinity and lifetime of the association. A vast amount of PPI data for various organisms is available from MIPS, DIP and other sources. The identification of functional modules in PPI network is of great interest because they often reveal unknown functional ties between proteins and hence predict functions for unknown proteins. However the noise in the PPI network and the complexity of the network structure present great challenges to the functional module detection problem. In this paper, we propose a flexible framework which integrates the topological features of the network and the Ant Colony Optimization (ACO) algorithm to solve the problem. We first create an reliability measurement of the protein-protein interaction to rebuild the PPI network. Then we reformulate the problem to an optimal path detecting problem from the perspective of information flow. Last, an ACO-based functional module detection method is proposed by simulating the ants' behavior. We evaluate the proposed technique on the yeast protein-protein interaction network with MIPS functional categories and compare it with several other existing techniques. Our experiments show that our approach achieves better accuracy than other existing methods.

[1]  Weixiong Zhang,et al.  Rearrangement Clustering: Pitfalls, Remedies, and Applications , 2006, J. Mach. Learn. Res..

[2]  N. Biggs THE TRAVELING SALESMAN PROBLEM A Guided Tour of Combinatorial Optimization , 1986 .

[3]  D. Bu,et al.  Topological structure analysis of the protein-protein interaction network in budding yeast. , 2003, Nucleic acids research.

[4]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[5]  Mona Singh,et al.  Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps , 2005, ISMB.

[6]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[7]  ArnauVicente,et al.  Iterative Cluster Analysis of Protein Interaction Data , 2005 .

[8]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2005, Nucleic Acids Res..

[9]  Lei Shi,et al.  Functional module detection by functional flow pattern mining in protein interaction networks , 2008, BMC Bioinformatics.

[10]  Aidong Zhang,et al.  Semantic integration to identify overlapping functional modules in protein interaction networks , 2007, BMC Bioinformatics.

[11]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[12]  Jing Liu,et al.  A traveling salesman approach for predicting protein functions , 2006, Source Code for Biology and Medicine.

[13]  Aidong Zhang,et al.  CASCADE: a novel quasi all paths-based network analysis algorithm for clustering biological interactions , 2008, BMC Bioinformatics.

[14]  Brian W. Kernighan,et al.  An Effective Heuristic Algorithm for the Traveling-Salesman Problem , 1973, Oper. Res..

[15]  Aidong Zhang,et al.  A topological measurement for weighted protein interaction network , 2005, 2005 IEEE Computational Systems Bioinformatics Conference (CSB'05).

[16]  Igor Jurisica,et al.  Protein complex prediction via cost-based clustering , 2004, Bioinform..

[17]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[18]  Thomas Stützle,et al.  MAX-MIN Ant System , 2000, Future Gener. Comput. Syst..

[19]  Aidong Zhang,et al.  Protein Interaction Networks: Computational Analysis , 2009 .

[20]  Aidong Zhang,et al.  A novel functional module detection algorithm for protein-protein interaction networks , 2006, Algorithms for Molecular Biology.

[21]  T. Takagi,et al.  Assessment of prediction accuracy of protein function from protein–protein interaction data , 2001, Yeast.

[22]  Aidong Zhang,et al.  A “Seed-Refine” Algorithm for Detecting Protein Complexes From Protein Interaction Data , 2007, IEEE Transactions on NanoBioscience.

[23]  Ignacio Marín,et al.  Iterative Cluster Analysis of Protein Interaction Data , 2005, Bioinform..

[24]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[25]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[26]  Marco Dorigo,et al.  Ant system: optimization by a colony of cooperating agents , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[27]  Ron Shamir,et al.  A clustering algorithm based on graph connectivity , 2000, Inf. Process. Lett..

[28]  Shigehiko Kanaya,et al.  Development and implementation of an algorithm for detection of protein complexes in large interaction networks , 2006, BMC Bioinformatics.

[29]  Andrew V. Goldberg,et al.  Experimental study of minimum cut algorithms , 1997, SODA '97.

[30]  Lei Shi,et al.  ANN Based Protein Function Prediction Using Integrated Protein-Protein Interaction Data , 2009, 2009 International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing.

[31]  Frank Dudbridge,et al.  The Use of Edge-Betweenness Clustering to Investigate Biological Function in Protein Interaction Networks , 2005, BMC Bioinformatics.