The Alliance of Genome Resources: Building a Modern Data Ecosystem for Model Organism Databases

Model organisms are essential experimental platforms for discovering gene functions, defining protein and genetic networks, uncovering functional consequences of human genome variation, and for modeling human disease. For decades, researchers who use model organisms have relied on Model Organism Databases (MODs) and the Gene Ontology Consortium (GOC) for expertly curated annotations, and for access to integrated genomic and biological information obtained from the scientific literature and public data archives. Through the development and enforcement of data and semantic standards, these genome resources provide rapid access to the collected knowledge of model organisms in human readable and computation-ready formats that would otherwise require countless hours for individual researchers to assemble on their own. Since their inception, the MODs for the predominant biomedical model organisms [Mus sp. (laboratory mouse), Saccharomyces cerevisiae, Drosophila melanogaster, Caenorhabditis elegans, Danio rerio, and Rattus norvegicus] along with the GOC have operated as a network of independent, highly collaborative genome resources. In 2016, these six MODs and the GOC joined forces as the Alliance of Genome Resources (the Alliance). By implementing shared programmatic access methods and data-specific web pages with a unified “look and feel,” the Alliance is tackling barriers that have limited the ability of researchers to easily compare common data types and annotations across model organisms. To adapt to the rapidly changing landscape for evaluating and funding core data resources, the Alliance is building a modern, extensible, and operationally efficient “knowledge commons” for model organisms using shared, modular infrastructure.

[1]  Anushya Muruganujan,et al.  Alliance of Genome Resources Portal: unified model organism research platform , 2019, Nucleic Acids Res..

[2]  Yan Wang,et al.  Advances and Applications in the Quest for Orthologs , 2019, Molecular biology and evolution.

[3]  Maxim V. Kuleshov,et al.  modEnrichr: a suite of gene set enrichment analysis tools for model organisms , 2019, Nucleic Acids Res..

[4]  J. Nadeau,et al.  The virtuous cycle of human genetics and mouse models in drug discovery , 2019, Nature Reviews Drug Discovery.

[5]  James C. Hu,et al.  The Gene Ontology Resource: 20 years and still GOing strong , 2019 .

[6]  Kara Dolinski,et al.  The BioGRID interaction database: 2019 update , 2018, Nucleic Acids Res..

[7]  Judith A. Blake,et al.  Mouse Genome Database (MGD) 2019 , 2018, Nucleic Acids Res..

[8]  The UniProt Consortium,et al.  UniProt: a worldwide hub of protein knowledge , 2018, Nucleic Acids Res..

[9]  Giulia Antonazzo,et al.  FlyBase 2.0: the next generation , 2018, Nucleic Acids Res..

[10]  E. Bolton,et al.  The Rat: A Model Used in Biomedical Research. , 2019, Methods in molecular biology.

[11]  P. Ingham From Drosophila segmentation to human cancer therapy , 2018, Development.

[12]  Kimberly Van Auken,et al.  WormBase 2017: molting into a new stage , 2017, Nucleic Acids Res..

[13]  Victoria Petri,et al.  A Primer for the Rat Genome Database (RGD). , 2018, Methods in molecular biology.

[14]  J. Apfeld,et al.  What Can We Learn About Human Disease from the Nematode C. elegans? , 2018, Methods in molecular biology.

[15]  R. Appel,et al.  Funding knowledgebases: Towards a sustainable funding model for the UniProt use case , 2017, F1000Research.

[16]  A. Golden From phenologs to silent suppressors: Identifying potential therapeutic targets for human disease , 2017, Molecular reproduction and development.

[17]  Michael F. Wangler,et al.  Model Organisms Facilitate Rare Disease Diagnosis and Therapeutic Research , 2017, Genetics.

[18]  Michael F. Wangler,et al.  MARRVEL: Integration of Human and Model Organism Genetic Resources to Facilitate Functional Annotation of the Human Genome. , 2017, American journal of human genetics.

[19]  Yanhui Hu,et al.  Gene2Function: An Integrated Online Resource for Gene Function Discovery , 2017, G3: Genes, Genomes, Genetics.

[20]  F. Arnaud,et al.  From core referencing to data re-use: two French national initiatives to reinforce paleodata stewardship (National Cyber Core Repository and LTER France Retro-Observatory) , 2017 .

[21]  W. Anderson Data management: A global coalition to sustain core data , 2017, Nature.

[22]  Elissa J Chesler,et al.  Integrative Functional Genomics for Systems Genetics in GeneWeaver.org. , 2017, Methods in molecular biology.

[23]  R. Appel,et al.  Funding knowledgebases: Towards a sustainable funding model for the UniProt use case , 2017, F1000Research.

[24]  R. T. Cox,et al.  Fly Models of Human Diseases: Drosophila as a Model for Understanding Human Mitochondrial Mutations and Disease. , 2017, Current topics in developmental biology.

[25]  Charles E. Cook,et al.  Identifying ELIXIR Core Data Resources , 2017, F1000Research.

[26]  S. Berger,et al.  The Sustained Impact of Model Organisms—in Genetics and Epigenetics , 2016, Genetics.

[27]  F. Collins,et al.  Meeting Report: The Allied Genetics Conference 2016 , 2016, G3: Genes, Genomes, Genetics.

[28]  K. Strange,et al.  Drug Discovery in Fish, Flies, and Worms. , 2016, ILAR journal.

[29]  Tudor Groza,et al.  The Monarch Initiative: an integrative data and analytic platform connecting phenotypes to genotypes across species , 2016, bioRxiv.

[30]  Alan Ruttenberg,et al.  The Cell Ontology 2016: enhanced content, modularization, and ontology interoperability , 2016, J. Biomed. Semant..

[31]  Robert Stevens,et al.  A Survey of Bioinformatics Database and Software Usage through Mining the Literature , 2016, PloS one.

[32]  Midori A. Harris,et al.  Model organism databases: essential resources that need the support of both funders and users , 2016, BMC Biology.

[33]  Erika Check Hayden,et al.  Funding for model-organism databases in trouble , 2016 .

[34]  Hugo J. Bellen,et al.  COLLECTION : TRANSLATIONAL IMPACT OF DROSOPHILA Drosophila tools and assays for the study of human diseases , 2016 .

[35]  Judith A. Blake,et al.  Mouse genome database 2016 , 2015, Nucleic Acids Res..

[36]  Jocelyn Kaiser,et al.  BIOMEDICAL RESOURCES. Funding for key data resources in jeopardy. , 2016, Science.

[37]  Corey Nislow,et al.  Complementation of Yeast Genes with Human Genes as an Experimental Platform for Functional Testing of Human Genetic Variants , 2015, Genetics.

[38]  Sergio Contrino,et al.  Cross‐organism analysis using InterMine , 2015, Genesis.

[39]  Austin G. Meyer,et al.  Systematic humanization of yeast genes reveals conserved functions and genetic modularity , 2015, Science.

[40]  Jeffrey L. Privette,et al.  A Unified Framework for Measuring Stewardship Practices Applied to Digital Environmental Datasets , 2015, Data Sci. J..

[41]  M. Westerfield,et al.  Zebrafish models in translational research: tipping the scales toward advancements in human health , 2014, Disease Models & Mechanisms.

[42]  Johannes Goll,et al.  Protein interaction data curation: the International Molecular Exchange (IMEx) consortium , 2012, Nature Methods.

[43]  Edith D. Wong,et al.  Saccharomyces Genome Database: the genomics resource of budding yeast , 2011, Nucleic Acids Res..

[44]  S. Lewis,et al.  Uberon, an integrative multi-species anatomy ontology , 2012, Genome Biology.

[45]  H. Jacob The rat: a model used in biomedical research. , 2010, Methods in molecular biology.

[46]  L. Stein,et al.  JBrowse: a next-generation genome browser. , 2009, Genome research.

[47]  H. Jacob,et al.  Rats! , 2009, Disease Models & Mechanisms.

[48]  Scott Cain,et al.  GMODWeb: a web framework for the generic model organism database , 2008, Genome Biology.

[49]  Judith A. Blake,et al.  Beyond the data deluge: Data integration and bio-ontologies , 2006, J. Biomed. Informatics.

[50]  S. Lewis,et al.  The generic genome browser: a building block for a model organism system database. , 2002, Genome research.

[51]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[52]  Westerfield,et al.  An on-line database for zebrafish development and genetics research , 1997, Seminars in cell & developmental biology.

[53]  Dennis A. Benson,et al.  GenBank , 2017, Nucleic Acids Res..