Galaxy InteractoMIX: An Integrated Computational Platform for the Study of Protein-Protein Interaction Data.

Protein interactions play a crucial role among the different functions of a cell and are central to our understanding of cellular processes both in health and disease. Here we present Galaxy InteractoMIX (http://galaxy.interactomix.com) a platform composed of thirteen different computational tools each addressing specific aspects of the study of protein-protein interactions, ranging from large-scale cross-specie protein-wide interactomes to atomic resolution level of protein complexes. Galaxy InteractoMIX provides an intuitive interface where users can retrieve consolidated interactomics data distributed across several databases or uncover links between diseases and genes by analyzing the interactomes underlying these diseases. The platform makes possible large-scale prediction and curation protein interactions using the conservation of motifs, interology, or presence or absence of key sequence signatures. The range of structure-based tools includes modeling and analysis of protein complexes, delineation of interfaces and the modeling of peptides acting as inhibitors of protein-protein interactions. Galaxy InteractoMIX includes a range of ready-to-use workflows to run complex analyses requiring minimal intervention by users. The potential range of applications of the platform covers different aspects of Life Science, Biomedicine, Biotechnology and drug discovery where protein associations are studied.

[1]  Jens Meiler,et al.  ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. , 2011, Methods in enzymology.

[2]  Joan Segura,et al.  A holistic in silico approach to predict functional sites in protein structures , 2012, Bioinform..

[3]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[4]  Robert D. Finn,et al.  The Pfam protein families database: towards a more sustainable future , 2015, Nucleic Acids Res..

[5]  Arnaud Céol,et al.  3did: identification and classification of domain-based interactions of known three-dimensional structure , 2010, Nucleic Acids Res..

[6]  Ioannis Xenarios,et al.  DIP: The Database of Interacting Proteins: 2001 update , 2001, Nucleic Acids Res..

[7]  Pamela F. Jones,et al.  Improving the prediction of protein binding sites by combining heterogeneous data and Voronoi diagrams , 2011, BMC Bioinformatics.

[8]  Baldomero Oliva,et al.  Biana: a software framework for compiling biological interactions and analyzing networks , 2010, BMC Bioinformatics.

[9]  Tim J. P. Hubbard,et al.  SCOP database in 2002: refinements accommodate structural genomics , 2002, Nucleic Acids Res..

[10]  Ruth Nussinov,et al.  PatchDock and SymmDock: servers for rigid and symmetric docking , 2005, Nucleic Acids Res..

[11]  Rafael C. Jimenez,et al.  The IntAct molecular interaction database in 2012 , 2011, Nucleic Acids Res..

[12]  Baldomero Oliva,et al.  ArchDB 2014: structural classification of loops in proteins , 2013, Nucleic Acids Res..

[13]  A. Barabasi,et al.  An empirical framework for binary interactome mapping , 2008, Nature Methods.

[14]  Gerrit Groenhof,et al.  GROMACS: Fast, flexible, and free , 2005, J. Comput. Chem..

[15]  S. Brunak,et al.  A scored human protein–protein interaction network to catalyze genomic interpretation , 2017, Nature Methods.

[16]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[17]  Alfonso Valencia,et al.  Towards the prediction of protein interaction partners using physical docking , 2011, Molecular systems biology.

[18]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[19]  Salam A. Assi,et al.  PCRPi: Presaging Critical Residues in Protein interfaces, a new computational tool to chart hot spots in protein interfaces , 2009, Nucleic acids research.

[20]  J. Bonet,et al.  Using collections of structural models to predict changes of binding affinity caused by mutations in protein–protein interactions , 2020, Protein science : a publication of the Protein Society.

[21]  Jimin Pei,et al.  AL2CO: calculation of positional conservation in a protein sequence alignment , 2001, Bioinform..

[22]  Ralf Herwig,et al.  Analyzing and interpreting genome data at the network level with ConsensusPathDB , 2016, Nature Protocols.

[23]  Roland L. Dunbrack,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improved prediction of protein side-chain conformations with SCWRL4 , 2022 .

[24]  D. Eisenberg,et al.  Three-dimensional cluster analysis identifies interfaces and functional residue clusters in proteins. , 2001, Journal of molecular biology.

[25]  Ben M. Webb,et al.  Comparative Protein Structure Modeling Using MODELLER , 2007, Current protocols in protein science.

[26]  Pamela F. Jones,et al.  VORFFIP-Driven Dock: V-D2OCK, a Fast and Accurate Protein Docking Strategy , 2015, PloS one.

[27]  E. Sprinzak,et al.  Correlated sequence-signatures as markers of protein-protein interaction. , 2001, Journal of molecular biology.

[28]  Adam Godzik,et al.  Clustering of highly homologous sequences to reduce the size of large protein databases , 2001, Bioinform..

[29]  Jaime Prilusky,et al.  Automated analysis of interatomic contacts in proteins , 1999, Bioinform..

[30]  Elisenda Feliu,et al.  On the analysis of protein–protein interactions via knowledge‐based potentials for the prediction of protein–protein docking , 2011, Protein science : a publication of the Protein Society.

[31]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[32]  H. Wolfson,et al.  FiberDock: Flexible induced‐fit backbone refinement in molecular docking , 2010, Proteins.

[33]  Baldomero Oliva,et al.  GUILDify: a web server for phenotypic characterization of genes through biological data integration and network-based prioritization algorithms , 2014, Bioinform..

[34]  Damian Szklarczyk,et al.  STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets , 2018, Nucleic Acids Res..

[35]  B. Kuhlman,et al.  A comparison of successful and failed protein interface designs highlights the challenges of designing buried hydrogen bonds , 2013, Protein science : a publication of the Protein Society.

[36]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2011 update , 2010, Nucleic Acids Res..

[37]  Baldomero Oliva,et al.  Knowledge-based modeling of peptides at protein interfaces: PiPreD , 2015, Bioinform..

[38]  Baldomero Oliva,et al.  On the mechanisms of protein interactions: predicting their affinity from unbound tertiary structures , 2017, Bioinform..

[39]  E. Guney,et al.  iFrag: A Protein-Protein Interface Prediction Server Based on Sequence Fragments. , 2017, Journal of molecular biology.

[40]  Oriol Fornes,et al.  InteractoMIX: a suite of computational tools to exploit interactomes in biological and clinical research. , 2016, Biochemical Society transactions.

[41]  François Stricher,et al.  The FoldX web server: an online force field , 2005, Nucleic Acids Res..

[42]  Rodrigo Lopez,et al.  Multiple sequence alignment with the Clustal series of programs , 2003, Nucleic Acids Res..

[43]  András Fiser,et al.  Comparative protein structure modeling by combining multiple templates and optimizing sequence-to-structure alignments , 2007, Bioinform..

[44]  I. Jurisica,et al.  Unequal evolutionary conservation of human protein interactions in interologous networks , 2007, Genome Biology.

[45]  E. Guney,et al.  GUILDify v2.0: A Tool to Identify Molecular Networks Underlying Human Diseases, Their Comorbidities and Their Druggable Targets. , 2019, Journal of molecular biology.

[46]  Baldomero Oliva,et al.  BIPS: BIANA Interolog Prediction Server. A tool for protein–protein interaction inference , 2012, Nucleic Acids Res..

[47]  Elisenda Feliu,et al.  Understanding protein-protein interactions using local structural features. , 2013, Journal of molecular biology.

[48]  M. Sippl Recognition of errors in three‐dimensional structures of proteins , 1993, Proteins.

[49]  Salam A. Assi,et al.  Presaging Critical Residues in Protein interfaces-Web Server (PCRPi-W): A Web Server to Chart Hot Spots in Protein Interfaces , 2010, PloS one.

[50]  Baldomero Oliva,et al.  Frag'r'Us: knowledge-based sampling of protein backbone conformations for de novo structure-based protein design , 2014, Bioinform..

[51]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[52]  Dmitrij Frishman,et al.  Negatome 2.0: a database of non-interacting proteins derived by literature mining, manual annotation and protein structure analysis , 2013, Nucleic Acids Res..

[53]  T. Clackson,et al.  A hot spot of binding energy in a hormone-receptor interface , 1995, Science.

[54]  M. Šikić,et al.  PSAIA – Protein Structure and Interaction Analyzer , 2008, BMC Structural Biology.

[55]  Zhiping Weng,et al.  ZRANK: Reranking protein docking predictions with an optimized energy function , 2007, Proteins.

[56]  David P. Dobkin,et al.  The quickhull algorithm for convex hulls , 1996, TOMS.

[57]  Baldomero Oliva,et al.  iLoops: a protein-protein interaction prediction server based on structural features , 2013, Bioinform..

[58]  András Fiser,et al.  M4T: a comparative protein structure modeling server , 2007, Nucleic Acids Res..

[59]  Patrick Aloy,et al.  Ten thousand interactions for the molecular biologist , 2004, Nature Biotechnology.

[60]  T L Blundell,et al.  Comparison of solvent-inaccessible cores of homologous proteins: definitions useful for protein modelling. , 1987, Protein engineering.

[61]  Raphael A. G. Chaleil,et al.  Updates to the Integrated Protein-Protein Interaction Benchmarks: Docking Benchmark Version 5 and Affinity Benchmark Version 2. , 2015, Journal of molecular biology.