The minimum information about a genome sequence (MIGS) specification

With the quantity of genomic data increasing at an exponential rate, it is imperative that these data be captured electronically, in a standard format. Standardization activities must proceed within the auspices of open-access and international working bodies. To tackle the issues surrounding the development of better descriptions of genomic investigations, we have formed the Genomic Standards Consortium (GSC). Here, we introduce the minimum information about a genome sequence (MIGS) specification with the intent of promoting participation in its development and discussing the resources that will be required to develop improved mechanisms of metadata capture and exchange. As part of its wider goals, the GSC also supports improving the 'transparency' of the information contained in existing genomic databases.

Chris F. Taylor | Allyson L. Lister | Nikos | Samuel V. Angiuoli | Richard L. Moxon | G. Cochrane | M. Ashburner | S. Lewis | T. Tatusova | N. Maltsev | Nelson Axelrod | S. Kravitz | Adrian J. Tett | D. Ussery | P. Dawyndt | F. Glöckner | N. Morrison | N. Kyrpides | E. Kolker | J. Gilbert | P. Vos | D. Haft | H. Hermjakob | Susanna-Assunta Sansone | Y. Tateno | V. Markowitz | K. Nelson | J. Selengut | J. Leebens-Mack | C. dePamphilis | J. Parkhill | C. Hertz-Fowler | R. Guralnick | P. Hugenholtz | R. Edwards | P. Gilna | P. Lord | A. Wipat | N. Ward | R. Stevens | J. Kennedy | J. Boore | D. Field | Kelvin Li | P. Sterk | Leonid Kagan | B. Methé | L. Proctor | G. Garrity | J. Cole | N. Faruque | Tanya Gray | N. Thomson | M. Allen | S. Baldauf | Stuart Ballard | Robert Feldman | P. Goldstein | David Hancock | I. Joint | M. Kane | G. Kowalchuk | R. Kottmann | J. Martiny | I. Mizrachi | Owen White | A. Spiers | P. Swift | S. Turner | Bob Vaughan | T. Whetzel | Ingio San Gil | G. Wilson | David | Robert A. Feldman | Kyrpides | Parkhill | Jennifer B. H. Martiny | Adrian Tett | Edwards | Trish | Jim Leebens-Mack | Nadeem Faruque | J. Cole | Trish Whetzel | Nick R. Thomson | S. Lewis | Michael Ashburner | Suzanna E. Lewis | Henning Hermjakob | Matthew | Julian | Eugene Kolker | Dawn Field | George Garrity | Norman Morrison | Jeremy Selengut | Tatiana Tatusova | Nicholas Thomson | Michael J. Allen | Sandra Baldauf | Stuart Ballard | Jeffrey Boore | Guy Cochrane | James Cole | Claude dePamphilis | Robert | Dan Haft | Hancock | Christiane Hertz-Fowler | Phil Hugenholtz | Kane | Jessie Kennedy | George Kowalchuk | Allyson Liste | Phillip Lord | Natalia Maltsev | Jennifer Martiny | Barbara Methé | Richard Moxon | Karen Nelson | Andrew Spiers | Robert Stevens | Paul Swift | Chris Taylor | Yoshio Tateno | Sarah Turner | Naomi Ward | Whetzel | Gareth Wilson | A. Wipat | Jim Leebens‐Mack

[1]  C. Fraser,et al.  Sequenced strains must be saved from extinction , 2001, Nature.

[2]  in chief George M. Garrity Bergey’s Manual® of Systematic Bacteriology , 1989, Springer New York.

[3]  D. Tautz,et al.  A plea for DNA taxonomy , 2003 .

[4]  K. Borzym,et al.  Complete genome sequence of the marine planctomycete Pirellula sp. strain 1 , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Ian T. Paulsen,et al.  Genome sequence of Silicibacter pomeroyi reveals adaptations to the marine environment , 2004, Nature.

[6]  Bacterial whole-genome sequences: minimal information and strain availability. , 2004, Microbiology.

[7]  J. Shendure,et al.  Advanced sequencing technologies: methods and goals , 2004, Nature Reviews Genetics.

[8]  R. Amann,et al.  The genome of Desulfotalea psychrophila, a sulfate-reducing bacterium from permanently cold Arctic sediments. , 2004, Environmental microbiology.

[9]  Owen White,et al.  Genome Properties: a system for the investigation of prokaryotic genetic content for microbiology, genome annotation and comparative genomics , 2005, Bioinform..

[10]  N. McCarthy,et al.  Time to Change , 2017 .

[11]  M. Moran,et al.  Overview of the Marine Roseobacter Lineage , 2005, Applied and Environmental Microbiology.

[12]  Dawn Field,et al.  Cataloguing our current genome collection. , 2005, Microbiology.

[13]  Naryttza N. Diaz,et al.  The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes , 2005, Nucleic acids research.

[14]  eGenomics: Genomes and the Environment , 2005, Comparative and functional genomics.

[15]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[16]  M. Breitbart,et al.  Using pyrosequencing to shed light on deep mine microbial ecology , 2006, BMC Genomics.

[17]  D. Field,et al.  Ecological perspectives on the sequenced genome collection , 2005 .

[18]  T. Hansen Bergey's Manual of Systematic Bacteriology , 2005 .

[19]  George Garrity,et al.  eGenomics: Cataloguing our Complete Genome Collection , 2005, Comparative and functional genomics.

[20]  Rolf Apweiler,et al.  Evidence standards in experimental and inferential INSDC Third Party Annotation data. , 2006, Omics : a journal of integrative biology.

[21]  Chris F. Taylor,et al.  Development of FuGO: an ontology for functional genomics investigations. , 2006, Omics : a journal of integrative biology.

[22]  Dawn Field,et al.  Meeting report: eGenomics: Cataloguing our Complete Genome Collection II. , 2006, Omics : a journal of integrative biology.

[23]  Andrew R Jones,et al.  A strategy capitalizing on synergies: the Reporting Structure for Biological Investigation (RSBI) working group. , 2006, Omics : a journal of integrative biology.

[24]  Florent E. Angly,et al.  The Marine Viromes of Four Oceanic Regions , 2006, PLoS biology.

[25]  Guy Cochrane,et al.  Concept of sample in OMICS technology. , 2006, Omics : a journal of integrative biology.

[26]  R. Amann,et al.  Whole genome analysis of the marine Bacteroidetes'Gramella forsetii' reveals adaptations to degradation of polymeric organic matter. , 2006, Environmental microbiology.

[27]  Renzo Kottmann,et al.  Megx.net—database resources for marine ecological genomics , 2005, Nucleic Acids Res..

[28]  G. Church,et al.  Sequencing genomes from single cells by polymerase cloning , 2006, Nature Biotechnology.

[29]  Inna Dubchak,et al.  The integrated microbial genomes (IMG) system , 2005, Nucleic Acids Res..

[30]  P. Bork,et al.  Get the most out of your metagenome: computational analysis of environmental sequence data. , 2007, Current opinion in microbiology.

[31]  Renzo Kottmann,et al.  eGenomics: Cataloguing Our Complete Genome Collection III , 2007, Comparative and Functional Genomics.

[32]  A. Halpern,et al.  The Sorcerer II Global Ocean Sampling Expedition: Northwest Atlantic through Eastern Tropical Pacific , 2007, PLoS biology.

[33]  Nigel W. Hardy,et al.  Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project , 2008, Nature Biotechnology.

[34]  I-Min A. Chen,et al.  IMG/M: a data management and analysis system for metagenomes , 2007, Nucleic Acids Res..

[35]  Nikos Kyrpides,et al.  The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata , 2007, Nucleic Acids Res..

[36]  I-Min A. Chen,et al.  The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata , 2007, Nucleic Acids Res..