Open source tool for prediction of genome wide protein-protein interaction network based on ortholog information

BackgroundProtein-protein interactions are crucially important for cellular processes. Knowledge of these interactions improves the understanding of cell cycle, metabolism, signaling, transport, and secretion. Information about interactions can hint at molecular causes of diseases, and can provide clues for new therapeutic approaches. Several (usually expensive and time consuming) experimental methods can probe protein - protein interactions. Data sets, derived from such experiments make the development of prediction methods feasible, and make the creation of protein-protein interaction network predicting tools possible.MethodsHere we report the development of a simple open source program module (OpenPPI_predictor) that can generate a putative protein-protein interaction network for target genomes. This tool uses the orthologous interactome network data from a related, experimentally studied organism.ResultsResults from our predictions can be visualized using the Cytoscape visualization software, and can be piped to downstream processing algorithms. We have employed our program to predict protein-protein interaction network for the human parasite roundworm Brugia malayi, using interactome data from the free living nematode Caenorhabditis elegans.AvailabilityThe OpenPPI_predictor source code is available from http://tools.neb.com/~posfai/.

[1]  John P. Overington,et al.  Genomic-scale prioritization of drug targets: the TDR Targets database , 2008, Nature Reviews Drug Discovery.

[2]  Ozlem Keskin,et al.  Prediction of protein-protein interactions by combining structure and sequence conservation in protein interfaces , 2005, Bioinform..

[3]  M. Moran,et al.  Mass spectrometry for the study of protein-protein interactions. , 2001, Methods.

[4]  Christian von Mering,et al.  STRING 8—a global view on proteins and their functional interactions in 630 organisms , 2008, Nucleic Acids Res..

[5]  Reza Salavati,et al.  Sequence-based prediction of protein-protein interactions by means of codon usage , 2008, Genome Biology.

[6]  N. Anderson,et al.  Proteome and proteomics: New technologies, new concepts, and new words , 1998, Electrophoresis.

[7]  D. Eisenberg,et al.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Feng Chen,et al.  OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups , 2005, Nucleic Acids Res..

[9]  Cheng-Yan Kao,et al.  POINT: a database for the prediction of protein-protein interactions based on the orthologous interactome , 2004, Bioinform..

[10]  Ozlem Keskin,et al.  PRISM: protein interactions by structural matching , 2005, Nucleic Acids Res..

[11]  William A. Petri,et al.  Markell and Voge's medical parasitology , 1999 .

[12]  Kimmen Sjölander,et al.  Berkeley PHOG: PhyloFacts orthology group prediction web server , 2009, Nucleic Acids Res..

[13]  Ioannis Xenarios,et al.  DIP: The Database of Interacting Proteins: 2001 update , 2001, Nucleic Acids Res..

[14]  Gabriele Ausiello,et al.  MINT: the Molecular INTeraction database , 2006, Nucleic Acids Res..

[15]  Xiaohua Hu,et al.  Learning an enriched representation from unlabeled data for protein-protein interaction extraction , 2010, BMC Bioinformatics.

[16]  Stanley Letovsky,et al.  Predicting protein function from protein/protein interaction data: a probabilistic approach , 2003, ISMB.

[17]  D. Eisenberg,et al.  Detecting protein function and protein-protein interactions from genome sequences. , 1999, Science.

[18]  Ashkan Golshani,et al.  Computational methods for predicting protein-protein interactions. , 2008, Advances in biochemical engineering/biotechnology.

[19]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[20]  Alex W. Wilkinson,et al.  Computational prediction of protein-protein interactions , 2012 .

[21]  Alessandro Vespignani,et al.  Global protein function prediction from protein-protein interaction networks , 2003, Nature Biotechnology.

[22]  Kara Dolinski,et al.  The Princeton Protein Orthology Database (P-POD): A Comparative Genomics Analysis Tool for Biologists , 2007, PloS one.

[23]  Maria Victoria Schneider,et al.  MINT: a Molecular INTeraction database. , 2002, FEBS letters.

[24]  Darren A. Natale,et al.  The COG database: an updated version includes eukaryotes , 2003, BMC Bioinformatics.

[25]  R. Nussinov,et al.  Hot regions in protein--protein interactions: the organization and contribution of structurally conserved hot spot residues. , 2005, Journal of molecular biology.

[26]  Jonathan E. Allen,et al.  Draft Genome of the Filarial Nematode Parasite Brugia malayi , 2007, Science.

[27]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[28]  Adam J. Smith,et al.  The Database of Interacting Proteins: 2004 update , 2004, Nucleic Acids Res..

[29]  K. Young Yeast two-hybrid: so many interactions, (in) so little time... , 1998, Biology of reproduction.

[30]  W. Blackstock,et al.  Proteomics: quantitative and physical mapping of cellular proteins. , 1999, Trends in biotechnology.

[31]  M. Gerstein,et al.  A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data , 2003, Science.

[32]  Kimberly Van Auken,et al.  WormBase: a comprehensive data resource for Caenorhabditis biology and genomics , 2004, Nucleic Acids Res..

[33]  Sándor Pongor,et al.  ProGMap: an integrated annotation resource for protein orthology , 2009, Nucleic Acids Res..

[34]  Robert B. Russell,et al.  InterPreTS: protein Interaction Prediction through Tertiary Structure , 2003, Bioinform..

[35]  Yi Zhou,et al.  BLASTO: a tool for searching orthologous groups , 2007, Nucleic Acids Res..

[36]  Martin Vingron,et al.  IntAct: an open source molecular interaction database , 2004, Nucleic Acids Res..

[37]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.