Molecular Interaction Search Tool (MIST): an integrated resource for mining gene and protein interaction data

Abstract Model organism and human databases are rich with information about genetic and physical interactions. These data can be used to interpret and guide the analysis of results from new studies and develop new hypotheses. Here, we report the development of the Molecular Interaction Search Tool (MIST; http://fgrtools.hms.harvard.edu/MIST/). The MIST database integrates biological interaction data from yeast, nematode, fly, zebrafish, frog, rat and mouse model systems, as well as human. For individual or short gene lists, the MIST user interface can be used to identify interacting partners based on protein–protein and genetic interaction (GI) data from the species of interest as well as inferred interactions, known as interologs, and to view a corresponding network. The data, interologs and search tools at MIST are also useful for analyzing ‘omics datasets. In addition to describing the integrated database, we also demonstrate how MIST can be used to identify an appropriate cut-off value that balances false positive and negative discovery, and present use-cases for additional types of analysis. Altogether, the MIST database and search tools support visualization and navigation of existing protein and GI data, as well as comparison of new and existing data.

[1]  L. Castagnoli,et al.  mentha: a resource for browsing integrated protein-interaction networks , 2013, Nature Methods.

[2]  Adam J. Smith,et al.  The Database of Interacting Proteins: 2004 update , 2004, Nucleic Acids Res..

[3]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[4]  Gary D. Bader,et al.  GeneMANIA Prediction Server 2013 Update , 2013, Nucleic Acids Res..

[5]  Ian M. Donaldson,et al.  iRefWeb: interactive analysis of consolidated protein interaction data and their supporting evidence , 2010, Database J. Biol. Databases Curation.

[6]  Bridget E. Begg,et al.  A Proteome-Scale Map of the Human Interactome Network , 2014, Cell.

[7]  Yanhui Hu,et al.  Integrating protein-protein interaction networks with phenotypes reveals signs of interactions , 2013, Nature Methods.

[8]  Jürg Bähler,et al.  PomBase 2015: updates to the fission yeast database , 2014, Nucleic Acids Res..

[9]  Hyungwon Choi,et al.  SAINT: Probabilistic Scoring of Affinity Purification - Mass Spectrometry Data , 2010, Nature Methods.

[10]  Damian Szklarczyk,et al.  The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible , 2016, Nucleic Acids Res..

[11]  Orr Ashenberg,et al.  Networks of bZIP Protein-Protein Interactions Diversified Over a Billion Years of Evolution , 2013, Science.

[12]  Chih-yuan Chiang,et al.  A Human MAP Kinase Interactome , 2010, Nature Methods.

[13]  A. Vinayagam,et al.  A Directed Protein Interaction Network for Investigating Intracellular Signal Transduction , 2011, Science Signaling.

[14]  Rafael C. Jimenez,et al.  The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases , 2013, Nucleic Acids Res..

[15]  Kara Dolinski,et al.  The BioGRID interaction database: 2017 update , 2016, Nucleic Acids Res..

[16]  Yanhui Hu,et al.  The Drosophila Gene Expression Tool (DGET) for expression analyses , 2017, BMC Bioinformatics.

[17]  N. Perrimon,et al.  Combining genetic perturbations and proteomics to examine kinase-phosphatase networks in Drosophila embryos. , 2014, Developmental cell.

[18]  Isabelle Gagnon-Arsenault,et al.  Gene duplication can impart fragility, not robustness, in the yeast protein interaction network , 2017, Science.

[19]  C. Glass,et al.  SAP30, a component of the mSin3 corepressor complex involved in N-CoR-mediated repression by specific transcription factors. , 1998, Molecular cell.

[20]  Julian Mintseris,et al.  A Protein Complex Network of Drosophila melanogaster , 2011, Cell.

[21]  N. Perrimon,et al.  Protein Complex–Based Analysis Framework for High-Throughput Data Sets , 2013, Science Signaling.

[22]  D. Durocher,et al.  High-Resolution CRISPR Screens Reveal Fitness Genes and Genotype-Specific Cancer Liabilities , 2015, Cell.

[23]  Marco Y. Hein,et al.  A Human Interactome in Three Quantitative Dimensions Organized by Stoichiometries and Abundances , 2015, Cell.

[24]  Bonnie Berger,et al.  An integrative approach to ortholog prediction for disease-focused and other functional studies , 2011, BMC Bioinformatics.

[25]  Johannes Goll,et al.  Protein interaction data curation: the International Molecular Exchange (IMEx) consortium , 2012, Nature Methods.

[26]  N. Perrimon,et al.  The Hippo Signaling Pathway Interactome , 2013, Science.

[27]  Yanhui Hu,et al.  FlyBase at 25: looking to the future , 2016, Nucleic Acids Res..

[28]  Ioannis Xenarios,et al.  DIP: The Database of Interacting Proteins: 2001 update , 2001, Nucleic Acids Res..

[29]  J. Massagué,et al.  Genome-wide Impact of the BRG1 SWI/SNF Chromatin Remodeler on the Transforming Growth Factor β Transcriptional Program* , 2008, Journal of Biological Chemistry.

[30]  M. Gerstein,et al.  Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogs. , 2004, Genome research.

[31]  Gary D. Bader,et al.  Cytoscape.js: a graph theory library for visualisation and analysis , 2015, Bioinform..

[32]  Stephen Guest,et al.  DroID 2011: a comprehensive, integrated resource for protein, transcription factor, RNA and gene interactions for Drosophila , 2010, Nucleic Acids Res..