Dynamically Searching for a Domain for protein Function Prediction

The availability of large amounts of protein-protein interaction (PPI) data makes it feasible to use computational approaches to predict protein functions. The base of existing computational approaches is to exploit the known function information of annotated proteins in the PPI data to predict functions of un-annotated proteins. However, these approaches consider the prediction domain (i.e. the set of proteins from which the functions are predicted) as unchangeable during the prediction procedure. This may lead to valuable information being overwhelmed by the unavoidable noise information in the PPI data when predicting protein functions, and in turn, the prediction results will be distorted. In this paper, we propose a novel method to dynamically predict protein functions from the PPI data. Our method regards the function prediction as a dynamic process of finding a suitable prediction domain, from which representative functions of the domain are selected to predict functions of un-annotated proteins. Our method exploits the topological structural information of a PPI network and the semantic relationship between protein functions to measure the relationship between proteins, dynamically select a suitable prediction domain and predict functions. The evaluation on real PPI datasets demonstrated the effectiveness of our proposed method, and generated better prediction results.

[1]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[2]  Kazuyuki Aihara,et al.  Protein function prediction with high-throughput data , 2008, Amino Acids.

[3]  Alessandro Vespignani,et al.  Global protein function prediction from protein-protein interaction networks , 2003, Nature Biotechnology.

[4]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[5]  C. Orengo,et al.  Protein function prediction--the power of multiplicity. , 2009, Trends in biotechnology.

[6]  C. Deane,et al.  Protein Interactions , 2002, Molecular & Cellular Proteomics.

[7]  S. Kasif,et al.  Whole-genome annotation by using evidence integration in functional-linkage networks. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Kui Zhang,et al.  Prediction of protein function using protein-protein interaction data , 2002, Proceedings. IEEE Computer Society Bioinformatics Conference.

[9]  Mona Singh,et al.  Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps , 2005, ISMB.

[10]  Gultekin Özsoyoglu,et al.  Annotating proteins by mining protein interaction networks , 2006, ISMB.

[11]  Wei Zhu,et al.  Semantic and layered protein function prediction from PPI networks. , 2010, Journal of theoretical biology.

[12]  S. Fields High‐throughput two‐hybrid analysis , 2005, The FEBS journal.

[13]  Limsoon Wong,et al.  Predicting Protein Functions from Protein Interaction Networks , 2012, Int. J. Knowl. Discov. Bioinform..

[14]  Vipin Kumar,et al.  Incorporating functional inter-relationships into protein function prediction algorithms , 2009, BMC Bioinformatics.

[15]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[16]  Hui Jiang,et al.  Predicting protein functions by relaxation labelling protein interaction network , 2010, BMC Bioinformatics.

[17]  Limsoon Wong,et al.  Exploiting indirect neighbours and topological weight to predict protein function from protein--protein interactions , 2006 .

[18]  J. J. Díaz-Mejía,et al.  Network-based function prediction and interactomics: the case for metabolic enzymes. , 2011, Metabolic engineering.

[19]  Jingyu Hou,et al.  An iterative approach of protein function prediction , 2011, BMC Bioinformatics.

[20]  R. Sharan,et al.  Network-based prediction of protein function , 2007, Molecular systems biology.

[21]  T. Takagi,et al.  Assessment of prediction accuracy of protein function from protein–protein interaction data , 2001, Yeast.

[22]  H. Mewes,et al.  The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes. , 2004, Nucleic acids research.

[23]  David Martin,et al.  Functional classification of proteins for the prediction of cellular function from a protein-protein interaction network , 2003, Genome Biology.

[24]  B. Schwikowski,et al.  A network of protein–protein interactions in yeast , 2000, Nature Biotechnology.