Building protein-protein interaction networks for Leishmania species through protein structural information

BackgroundSystematic analysis of a parasite interactome is a key approach to understand different biological processes. It makes possible to elucidate disease mechanisms, to predict protein functions and to select promising targets for drug development. Currently, several approaches for protein interaction prediction for non-model species incorporate only small fractions of the entire proteomes and their interactions. Based on this perspective, this study presents an integration of computational methodologies, protein network predictions and comparative analysis of the protozoan species Leishmania braziliensis and Leishmania infantum. These parasites cause Leishmaniasis, a worldwide distributed and neglected disease, with limited treatment options using currently available drugs.ResultsThe predicted interactions were obtained from a meta-approach, applying rigid body docking tests and template-based docking on protein structures predicted by different comparative modeling techniques. In addition, we trained a machine-learning algorithm (Gradient Boosting) using docking information performed on a curated set of positive and negative protein interaction data. Our final model obtained an AUC = 0.88, with recall = 0.69, specificity = 0.88 and precision = 0.83. Using this approach, it was possible to confidently predict 681 protein structures and 6198 protein interactions for L. braziliensis, and 708 protein structures and 7391 protein interactions for L. infantum. The predicted networks were integrated to protein interaction data already available, analyzed using several topological features and used to classify proteins as essential for network stability.ConclusionsThe present study allowed to demonstrate the importance of integrating different methodologies of interaction prediction to increase the coverage of the protein interaction of the studied protocols, besides it made available protein structures and interactions not previously reported.

[1]  Andrej Sali,et al.  Virtual ligand screening against comparative protein structure models. , 2012, Methods in molecular biology.

[2]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[3]  M. Quail,et al.  Whole genome sequencing of multiple Leishmania donovani clinical isolates provides insights into population structure and mechanisms of drug resistance. , 2011, Genome research.

[4]  Reinhard Schneider,et al.  Using graph theory to analyze biological networks , 2011, BioData Mining.

[5]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[6]  Ruth Nussinov,et al.  Structure and dynamics of molecular networks: A novel paradigm of drug discovery. A comprehensive review , 2012, Pharmacology & therapeutics.

[7]  Matthew W. Hahn,et al.  Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. , 2005, Molecular biology and evolution.

[8]  Yang Zhang,et al.  Improving the physical realism and structural accuracy of protein models by a two-step atomic-level energy minimization. , 2011, Biophysical journal.

[9]  Asher Mullard,et al.  Protein–protein interaction inhibitors get into the groove , 2012, Nature Reviews Drug Discovery.

[10]  B. Snel,et al.  Conservation of gene order: a fingerprint of proteins that physically interact. , 1998, Trends in biochemical sciences.

[11]  B. Honig,et al.  Structure-based prediction of protein-protein interactions on a genome-wide scale , 2012, Nature.

[12]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .

[13]  A. Sali,et al.  Statistical potential for assessment and prediction of protein structures , 2006, Protein science : a publication of the Protein Society.

[14]  Makedonka Mitreva,et al.  Targeting Protein-Protein Interactions for Parasite Control , 2011, PloS one.

[15]  A. Henney,et al.  A network solution , 2008, Nature.

[16]  J. Shaw,et al.  Evolution, classification and geographical distribution. , 1987 .

[17]  David A. Lee,et al.  Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space , 2006, Nucleic acids research.

[18]  SödingJohannes Protein homology detection by HMM--HMM comparison , 2005 .

[19]  David A. Gough,et al.  Predicting protein-protein interactions from primary structure , 2001, Bioinform..

[20]  M. Sternberg,et al.  Prediction of protein-protein interactions by docking methods. , 2002, Current opinion in structural biology.

[21]  Christina Kiel,et al.  Analyzing protein interaction networks using structural information. , 2008, Annual review of biochemistry.

[22]  Zhiping Weng,et al.  Protein–protein docking benchmark version 4.0 , 2010, Proteins.

[23]  A. Vinayagam,et al.  A Directed Protein Interaction Network for Investigating Intracellular Signal Transduction , 2011, Science Signaling.

[24]  K. Katoh,et al.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. , 2002, Nucleic acids research.

[25]  Chung-Yen Lin,et al.  Hubba: hub objects analyzer—a framework of interactome hubs identification for network biology , 2008, Nucleic Acids Res..

[26]  L. Dardenne,et al.  Structural modelling and comparative analysis of homologous, analogous and specific proteins from Trypanosoma cruzi versus Homo sapiens: putative drug targets for chagas' disease treatment , 2010, BMC Genomics.

[27]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[28]  Kyungsook Han,et al.  Sequence-based prediction of protein-protein interactions by means of rotation forest and autocorrelation descriptor. , 2010, Protein and peptide letters.

[29]  Yoshihiro Yamanishi,et al.  Relating drug–protein interaction network with drug side effects , 2012, Bioinform..

[30]  Bonnie Berger,et al.  Struct2Net: a web service to predict protein–protein interactions using a structure-based approach , 2010, Nucleic Acids Res..

[31]  Andrej Sali,et al.  Comparative Protein Structure Modeling Using MODELLER , 2014, Current protocols in bioinformatics.

[32]  Michael J E Sternberg,et al.  The Phyre2 web portal for protein modeling, prediction and analysis , 2015, Nature Protocols.

[33]  Patrick Aloy,et al.  Interrogating protein interaction networks through structural biology , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[34]  D. Ingber,et al.  High-Betweenness Proteins in the Yeast Protein Interaction Network , 2005, Journal of biomedicine & biotechnology.

[35]  Shuai Cheng Li,et al.  A tool for clustering large numbers of protein decoys , 2010 .

[36]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[37]  A. Valencia,et al.  Computational methods for the prediction of protein interactions. , 2002, Current opinion in structural biology.

[38]  Ozlem Keskin,et al.  Prediction of Protein Interactions by Structural Matching: Prediction of PPI Networks and the Effects of Mutations on PPIs that Combines Sequence and Structural Information. , 2017, Methods in molecular biology.

[39]  Mark Gerstein,et al.  The Importance of Bottlenecks in Protein Networks: Correlation with Gene Essentiality and Expression Dynamics , 2007, PLoS Comput. Biol..

[40]  Fausto Spoto,et al.  Creating, generating and comparing random network models with Network Randomizer , 2016, F1000Research.

[41]  N. Chandra,et al.  Mycobacterium tuberculosis interactome analysis unravels potential pathways to drug resistance , 2008, BMC Microbiology.

[42]  B. D. Saúde.,et al.  Protocolo de vigilância e resposta à ocorrência de microcefalia relacionada à infecção pelo vírus zika , 2015 .

[43]  J. Estaquier,et al.  Regulation of immunity during visceral Leishmania infection , 2016, Parasites & Vectors.

[44]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[45]  Kengo Kinoshita,et al.  Prediction of disordered regions in proteins based on the meta approach , 2008, Bioinform..

[46]  Richard N. Armstrong,et al.  Prediction of Substrates for Glutathione Transferases by Covalent Docking , 2014, J. Chem. Inf. Model..

[47]  M. Vidal,et al.  Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or "interologs". , 2001, Genome research.

[48]  Paul Horton,et al.  Nucleic Acids Research Advance Access published May 21, 2007 WoLF PSORT: protein localization predictor , 2007 .

[49]  Alpan Raval,et al.  Identifying Hubs in Protein Interaction Networks , 2009, PloS one.

[50]  Ozlem Keskin,et al.  Human Cancer Protein-Protein Interaction Network: A Structural Perspective , 2009, PLoS Comput. Biol..

[51]  Lan V. Zhang,et al.  Evidence for dynamically organized modularity in the yeast protein–protein interaction network , 2004, Nature.

[52]  W-C Hwang,et al.  Identification of Information Flow‐Modulating Drug Targets: A Novel Bridging Paradigm for Drug Discovery , 2008, Clinical pharmacology and therapeutics.

[53]  Ran Kafri,et al.  Preferential protection of protein interaction network hubs in yeast: Evolved functionality of genetic redundancy , 2008, Proceedings of the National Academy of Sciences.

[54]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[55]  Ruth Nussinov,et al.  A method for simultaneous alignment of multiple protein structures , 2004, Proteins.

[56]  Phoebe M. Roberts,et al.  Mining literature for systems biology , 2006, Briefings Bioinform..

[57]  Parantu K. Shah,et al.  Structural similarity to bridge sequence space: Finding new families on the bridges , 2005, Protein science : a publication of the Protein Society.

[58]  C. Nakamura,et al.  Recent advances in leishmaniasis treatment. , 2011, International journal of infectious diseases : IJID : official publication of the International Society for Infectious Diseases.

[59]  T. Blundell,et al.  Structural biology and drug discovery. , 2005, Drug discovery today.

[60]  Ben M. Webb,et al.  Comparative Protein Structure Modeling Using MODELLER , 2016, Current protocols in bioinformatics.

[61]  Hiroaki Kitano,et al.  Structure of Protein Interaction Networks and Their Implications on Drug Design , 2009, PLoS Comput. Biol..

[62]  L. Bonetta Protein–protein interactions: Interactome under construction , 2010, Nature.

[63]  H. Wolfson,et al.  FiberDock: Flexible induced‐fit backbone refinement in molecular docking , 2010, Proteins.

[64]  Giulio Superti-Furga,et al.  Protein interaction networks in innate immunity. , 2013, Trends in immunology.

[65]  Dmitrij Frishman,et al.  Negatome 2.0: a database of non-interacting proteins derived by literature mining, manual annotation and protein structure analysis , 2013, Nucleic Acids Res..

[66]  Paola Lecca,et al.  Detecting modules in biological networks by edge weight clustering and entropy significance , 2015, Front. Genet..

[67]  S. Castanys,et al.  Fitness of Leishmania donovani Parasites Resistant to Drug Combinations , 2015, PLoS neglected tropical diseases.

[68]  Alex W. Wilkinson,et al.  Computational prediction of protein-protein interactions , 2012 .

[69]  Edson L. Folador,et al.  Computational Prediction of Protein-Protein Interactions in Leishmania Predicted Proteomes , 2012, PloS one.

[70]  P. Stadler,et al.  Centers of complex networks. , 2003, Journal of theoretical biology.

[71]  Carsten Wiuf,et al.  Subnets of scale-free networks are not scale-free: sampling properties of networks. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[72]  Pawel Herzyk,et al.  Chromosome and gene copy number variation allow major structural change between species and strains of Leishmania. , 2011, Genome research.

[73]  Allan Kuchinsky,et al.  Protein network prediction and topological analysis in Leishmania major as a tool for drug target selection , 2010, BMC Bioinformatics.

[74]  Dong Xu,et al.  Selective refinement and selection of near‐native models in protein structure prediction , 2015, Proteins.

[75]  R. Nussinov,et al.  Predicting protein-protein interactions on a proteome scale by matching evolutionary and structural similarities at interfaces using PRISM , 2011, Nature Protocols.

[76]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[77]  Mate S. Szalay,et al.  How to design multi-target drugs , 2007, Expert opinion on drug discovery.

[78]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[79]  István A. Kovács,et al.  Network-Based Tools for the Identification of Novel Drug Targets , 2011, Science Signaling.

[80]  Eileen Kraemer,et al.  TriTrypDB: a functional genomic resource for the Trypanosomatidae , 2009, Nucleic Acids Res..

[81]  A. Valencia,et al.  High-confidence prediction of global interactomes based on genome-wide coevolutionary networks , 2008, Proceedings of the National Academy of Sciences.

[82]  Caroline C. Friedel,et al.  Inferring topology from clustering coefficients in protein-protein interaction networks , 2006, BMC Bioinformatics.

[83]  P. Bastien,et al.  Large-Scale Investigation of Leishmania Interaction Networks with Host Extracellular Matrix by Surface Plasmon Resonance Imaging , 2013, Infection and Immunity.

[84]  Bridget E. Begg,et al.  A Proteome-Scale Map of the Human Interactome Network , 2014, Cell.

[85]  Creating, generating and comparing random network models with NetworkRandomizer. , 2016, F1000Research.

[86]  R. Killick-Kendrick,et al.  The Leishmaniases in biology and medicine , 1987 .

[87]  Yuri Matsuzaki,et al.  Highly precise protein-protein interaction prediction based on consensus between template-based and de novo docking methods , 2013, BMC Proceedings.

[88]  S. Gygi,et al.  Network organization of the human autophagy system , 2010, Nature.

[89]  Artem Cherkasov,et al.  Structural characterization of genomes by large scale sequence-structure threading: application of reliability analysis in structural genomics , 2004, BMC Bioinformatics.

[90]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[91]  P. Bork,et al.  Predicting biological networks from genomic data , 2008, FEBS letters.

[92]  Margaret E. Johnson,et al.  Protein-protein binding selectivity and network topology constrain global and local properties of interface binding networks , 2017, Scientific Reports.

[93]  E. Levanon,et al.  Preferential attachment in the protein network evolution. , 2003, Physical review letters.

[94]  R. Albert Scale-free networks in cell biology , 2005, Journal of Cell Science.

[95]  J. Cano,et al.  Leishmaniasis Worldwide and Global Estimates of Its Incidence , 2012, PloS one.

[96]  Tom L. Blundell,et al.  Keynote review: Structural biology and drug discovery , 2005 .

[97]  A. Barabasi,et al.  Interactome Networks and Human Disease , 2011, Cell.

[98]  T E Browder,et al.  Observation of the D(sJ)(2317) and D(sJ)(2457) in B decays. , 2003, Physical review letters.

[99]  Yuri Matsuzaki,et al.  MEGADOCK: An All-to-All Protein-Protein Interaction Prediction System Using Tertiary Structure Data , 2013, Protein and peptide letters.

[100]  Toshiyuki Sato,et al.  In silico Screening of protein-protein Interactions with All-to-All Rigid docking and Clustering: an Application to Pathway Analysis , 2009, J. Bioinform. Comput. Biol..

[101]  Shu-Lin Wang,et al.  Computational methods for the prediction of protein-protein interactions. , 2010, Protein and peptide letters.

[102]  Harpreet Kaur Saini,et al.  BIOINFORMATICS APPLICATIONS NOTE Structural bioinformatics Meta-DP: domain prediction meta-server , 2022 .

[103]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[104]  M. Vignali,et al.  A protein interaction network of the malaria parasite Plasmodium falciparum , 2005, Nature.

[105]  Marc A. Martí-Renom,et al.  MODBASE: a database of annotated comparative protein structure models and associated resources , 2005, Nucleic Acids Res..

[106]  Robert E. W. Hancock,et al.  NetworkAnalyst - integrative approaches for protein–protein interaction network analysis and visual exploration , 2014, Nucleic Acids Res..

[107]  Philip M. Kim,et al.  Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights , 2006, Science.

[108]  Concettina Guerra,et al.  Computational Methods for the Prediction of Protein-Protein Interactions , 2011, IWCIA.