Identification of Yeast’s Interactome Using Neural Networks

An important aspect for designing precise medical therapies is to have an accurate knowledge of protein protein interactions involved in the process. Next generation sequencing technologies for discovering novel genes are fuelling an information explosion that allows researchers to study these molecules in previously unimagined ways. A profound understanding of these biological components promises great leaps for the field of medical sciences, i.e., by designing personalized/preventive medicines, or by curing the life threatening diseases including many types of cancers etc. However, experimental techniques are slow and noisy; and usually report false interactions. These limitations prompted a surge of interest in computational techniques to infer the true interactions. In this paper, we propose and evaluate a Neural Network based approach for deciphering the interactions of Saccharomyces cerevisiae species proteins. The novelty of this approach lies in integrating the evidences (broadly classified as structural and non-structural) in a hybrid fashion. The structure-based evidences include, geometrical features extracted from individual homolog templates, e.g., interacting residues, interfacing residues, binding sites of proteins etc., while the non-structural evidences include, biological process, molecular function, cellular component and motif based similarities. These features are combined using Neural Network based classifier to predict true interactions. The algorithm showed encouraging results, when benchmarked for Saccharomyces cerevisiae’s interactome, retrieved from the STRING database; with an accuracy of 92% for functional association networks, while on protein interaction networks the accuracy remained 83%.

[1]  Julie M. Sahalie,et al.  An experimentally derived confidence score for binary protein-protein interactions , 2008, Nature Methods.

[2]  Alfonso Valencia,et al.  Prediction of protein-protein interactions from evolutionary information. , 2003, Methods of biochemical analysis.

[3]  William Stafford Noble,et al.  Large-scale prediction of protein-protein interactions from structures , 2010, BMC Bioinformatics.

[4]  C. Deane,et al.  Protein Interactions , 2002, Molecular & Cellular Proteomics.

[5]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[6]  Alfonso Valencia,et al.  Towards the prediction of protein interaction partners using physical docking , 2011, Molecular systems biology.

[7]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[8]  Benjamin A. Shoemaker,et al.  Deciphering Protein–Protein Interactions. Part II. Computational Methods to Predict Protein and Domain Interaction Partners , 2007, PLoS Comput. Biol..

[9]  Mei Liu,et al.  Prediction of protein-protein interactions using random decision forest framework , 2005, Bioinform..

[10]  Vladimir Pavlovic,et al.  Prediction of Protein Functions with Gene Ontology and Interspecies Protein Homology Data , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[11]  Nathan Linial,et al.  Generative probabilistic models for protein–protein interaction networks—the biclique perspective , 2011, Bioinform..

[12]  Jesús Giraldo,et al.  Artificial Intelligence: A Novel Approach for Drug Discovery. , 2019, Trends in pharmacological sciences.

[13]  C. Mungall,et al.  Gene Ontology Consortium : going forward The Gene Ontology , 2015 .

[14]  Alfredo Benso,et al.  A combined approach for genome wide protein function annotation/prediction , 2013, Proteome Science.

[15]  Geoffrey J. Barton,et al.  Probabilistic prediction and ranking of human protein-protein interactions , 2007, BMC Bioinformatics.

[16]  Benjamin A. Shoemaker,et al.  Deciphering Protein–Protein Interactions. Part I. Experimental Techniques and Databases , 2007, PLoS Comput. Biol..

[17]  D. Eisenberg,et al.  Computational methods of analysis of protein-protein interactions. , 2003, Current opinion in structural biology.

[18]  Juwen Shen,et al.  Predicting protein–protein interactions based only on sequences information , 2007, Proceedings of the National Academy of Sciences.

[19]  Raquel Norel,et al.  Protein interface conservation across structure space , 2010, Proceedings of the National Academy of Sciences.

[20]  Adel Golovin,et al.  MSDmotif: exploring protein sites and motifs , 2008, BMC Bioinformatics.

[21]  Katja M. Arndt,et al.  An in vivo library-versus-library selection of optimized protein–protein interactions , 1999, Nature Biotechnology.

[22]  Kyungsook Han,et al.  Sequence-based prediction of protein-protein interactions by means of rotation forest and autocorrelation descriptor. , 2010, Protein and peptide letters.

[23]  Ruth Nussinov,et al.  A method for simultaneous alignment of multiple protein structures , 2004, Proteins.

[24]  B. Séraphin,et al.  A generic protein purification method for protein complex characterization and proteome exploration , 1999, Nature Biotechnology.

[25]  B. Honig,et al.  Structure-based prediction of protein-protein interactions on a genome-wide scale , 2012, Nature.

[26]  Baldomero Oliva,et al.  Prediction of protein-protein interactions using distant conservation of sequence patterns and structure relationships , 2005, Bioinform..

[27]  Sajjad Ahmad Madani,et al.  Diet-Right: A Smart Food Recommendation System , 2017, KSII Trans. Internet Inf. Syst..

[28]  Matthew Shardlow,et al.  Neural Text Simplification of Clinical Letters with a Domain Specific Phrase Table , 2019, ACL.

[29]  R. Nussinov,et al.  Predicting protein-protein interactions on a proteome scale by matching evolutionary and structural similarities at interfaces using PRISM , 2011, Nature Protocols.

[30]  Alfredo Benso,et al.  A Structure based Approach for Accurate Prediction of Protein Interactions Networks , 2016, BIOINFORMATICS.

[31]  T. Barrette,et al.  Probabilistic model of the human protein-protein interaction network , 2005, Nature Biotechnology.

[32]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[33]  Dachuan Zhang,et al.  MMDB and VAST+: tracking structural similarities between macromolecular complexes , 2013, Nucleic Acids Res..

[34]  Zhen Liu,et al.  Refined phylogenetic profiles method for predicting protein-protein interactions , 2005, Bioinform..

[35]  Sophia Ananiadou,et al.  Detecting experimental techniques and selecting relevant documents for protein-protein interactions from biomedical literature , 2011, BMC Bioinformatics.

[36]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[37]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[38]  E. van Nimwegen,et al.  Accurate Prediction of Protein–protein Interactions from Sequence Alignments Using a Bayesian Method , 2022 .

[39]  Ozlem Keskin,et al.  Prediction of protein-protein interactions by combining structure and sequence conservation in protein interfaces , 2005, Bioinform..