Prediction of protein functions based on function-function correlation relations

A protein function pair approach, based on protein-protein interaction (PPI) data, is proposed to predict protein functions. Randomization tests are performed on the PPI dataset, which resulted in a protein function correlation scoring value which is used to rank the relative importance of a function pair. It has been found that certain classes of protein functions tend to be correlated together. Scoring values of these correlation pairs allow us to predict the functionality of a protein given that it interacts with proteins having well-defined function annotations. The jackknife test is used to validate the function pair method. The protein function pair approach achieves a prediction sensitivity comparable to an approach using more sophisticated method. The main advantages of this approach are as follows: (i) a set of function-function correlation relations are derived and intuitive biological interpretation can be achieved, and (ii) its simplicity, only two parameters are needed.

[1]  D. Eisenberg,et al.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Adam J. Smith,et al.  The Database of Interacting Proteins: 2004 update , 2004, Nucleic Acids Res..

[3]  Ioannis Xenarios,et al.  DIP: the Database of Interacting Proteins , 2000, Nucleic Acids Res..

[4]  Volker Roth,et al.  Improved functional prediction of proteins by learning kernel combinations in multilabel settings , 2007, BMC Bioinformatics.

[5]  Tom M. W. Nye,et al.  Statistical analysis of domains in interacting protein pairs , 2005, Bioinform..

[6]  Jiong Yang,et al.  PathFinder: mining signal transduction pathway segments from protein-protein interaction networks , 2007, BMC Bioinformatics.

[7]  Dmitrij Frishman,et al.  The MIPS mammalian protein?Cprotein interaction database , 2005, Bioinform..

[8]  Alessandro Vespignani,et al.  Global protein function prediction from protein-protein interaction networks , 2003, Nature Biotechnology.

[9]  Jukka Corander,et al.  A Naive Bayes Classifier for Protein Function Prediction , 2009, Silico Biol..

[10]  Dong-Soo Han,et al.  PreSPI: a domain combination based prediction system for protein-protein interaction. , 2004, Nucleic acids research.

[11]  Limsoon Wong,et al.  An efficient strategy for extensive integration of diverse biological data for protein function prediction , 2007, Bioinform..

[12]  Anton J. Enright,et al.  Protein interaction maps for complete genomes based on gene fusion events , 1999, Nature.

[13]  Kara Dolinski,et al.  Genome Snapshot: a new resource at the Saccharomyces Genome Database (SGD) presenting an overview of the Saccharomyces cerevisiae genome , 2005, Nucleic Acids Res..

[14]  Simon Kasif,et al.  Probabilistic Protein Function Prediction from Heterogeneous Genome-Wide Data , 2007, PloS one.

[15]  D. Eisenberg,et al.  Detecting protein function and protein-protein interactions from genome sequences. , 1999, Science.

[16]  Ting Chen,et al.  Diffusion kernel-based logistic regression models for protein function prediction. , 2006, Omics : a journal of integrative biology.

[17]  D. Eisenberg,et al.  A combined algorithm for genome-wide prediction of protein function , 1999, Nature.

[18]  Ting Chen,et al.  An Integrated Probabilistic Model for Functional Prediction of Proteins , 2004, J. Comput. Biol..

[19]  Zheng-Zhi Wang,et al.  Effect of the quality of the interaction data on predicting protein function from protein-protein interactions , 2008, Interdisciplinary Sciences: Computational Life Sciences.

[20]  Wan Kyu Kim,et al.  Large scale statistical prediction of protein-protein interaction by potentially interacting domain (PID) pair. , 2002, Genome informatics. International Conference on Genome Informatics.

[21]  Jason Weston,et al.  Gene functional classification from heterogeneous data , 2001, RECOMB.

[22]  Amy E. Keating,et al.  AVID: An integrative framework for discovering functional relationships among proteins , 2005, BMC Bioinformatics.

[23]  Simon Kasif,et al.  Genomic functional annotation using co-evolution profiles of gene clusters , 2002, Genome Biology.