Whole Genome Sequencing: Bacterial Typing Revolutionized

Bacterial genotyping, or identifying bacteria at the genetic level, is particularly important for diagnosis, treatment, and epidemiological surveillance of bacterial infections. This is especially the case for bacteria exhibiting high levels of antibiotic resistance or virulence, and those involved in nosocomial or pandemic infections. Genotyping also has applications in studying bacterial population dynamics. Over the last two decades, molecular methods have progressively replaced phenotypic assays to type bacterial strains. Whole-genome sequencing of bacteria has recently emerged as a cost-effective and convenient approach for addressing many microbiological questions. Here, we briefly review the state of the art within this field and provide a step-by-step introduction to the workflow involved in genome sequencing, assembly and annotation. We also discussed the application of this technique in a clinical microbiology laboratory, focusing on three essential tasks: (1) identifying the species of an isolate, (2) testing its properties, such as resistance to antibiotics and virulence, and (3) monitoring the emergence and spread of bacterial pathogens. We predict that the application of whole generations sequencing will soon be sufficiently fast, accurate and cheap to be used in routine clinical microbiology practice, where it could replace many complex current techniques with a single, more efficient workflow.

[1]  J. Wolf,et al.  A field guide to whole-genome sequencing, assembly and annotation , 2014, Evolutionary applications.

[2]  C. Ponting,et al.  Sequencing depth and coverage: key considerations in genomic analyses , 2014, Nature Reviews Genetics.

[3]  R. Nielsen,et al.  Unlocking the vault: next‐generation museum population genomics , 2013, Molecular ecology.

[4]  Michael Roberts,et al.  The MaSuRCA genome assembler , 2013, Bioinform..

[5]  Monica C Munoz-Torres,et al.  Web Apollo: a web-based genomic annotation editing platform , 2013, Genome Biology.

[6]  Greg C. Lee,et al.  Assembler for de novo assembly of large genomes , 2013, Proceedings of the National Academy of Sciences.

[7]  Bin Chen,et al.  Relating genes to function: identifying enriched transcription factors using the ENCODE ChIP-Seq significance tool , 2013, Bioinform..

[8]  S. Narum,et al.  Genotyping‐by‐sequencing in ecological and conservation genomics , 2013, Molecular ecology.

[9]  Douglas G. Scofield,et al.  The Norway spruce genome sequence and conifer genome evolution , 2013, Nature.

[10]  Ola Spjuth,et al.  Lessons learned from implementing a national infrastructure in Sweden for storage and analysis of next-generation sequencing data , 2013, GigaScience.

[11]  Petr Novák,et al.  RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads , 2013, Bioinform..

[12]  O. Ryder,et al.  Conservation genomics of threatened animal species. , 2013, Annual review of animal biosciences.

[13]  Nagarjun Vijay,et al.  Challenges and strategies in transcriptome assembly and differential gene expression quantification. A comprehensive in silico assessment of RNA‐seq experiments , 2013, Molecular ecology.

[14]  M. Pop,et al.  Sequence assembly demystified , 2013, Nature Reviews Genetics.

[15]  Inanç Birol,et al.  Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species , 2013, GigaScience.

[16]  David W. Cheung,et al.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler , 2012, GigaScience.

[17]  R. Gibbs,et al.  Mind the Gap: Upgrading Genomes with Pacific Biosciences RS Long-Read Sequencing Technology , 2012, PloS one.

[18]  Pall I. Olason,et al.  The genomic landscape of species divergence in Ficedula flycatchers , 2012, Nature.

[19]  Qiang Wang,et al.  The oyster genome reveals stress adaptation and complexity of shell formation , 2012, Nature.

[20]  Daniel J. Wilson,et al.  Transforming clinical microbiology with bacterial genome sequencing , 2012, Nature Reviews Genetics.

[21]  David Haussler,et al.  Tissue sampling methods and standards for vertebrate genomics , 2012, GigaScience.

[22]  W. Pirovano,et al.  Toward almost closed genomes with GapFiller , 2012, Genome Biology.

[23]  M. Schatz,et al.  Hybrid error correction and de novo assembly of single-molecule sequencing reads , 2012, Nature Biotechnology.

[24]  M. Yandell,et al.  A beginner's guide to eukaryotic genome annotation , 2012, Nature Reviews Genetics.

[25]  Mehrdad Hajibabaei,et al.  Next‐generation sequencing technologies for environmental DNA research , 2012, Molecular ecology.

[26]  Keith A. Jolley,et al.  Ribosomal multilocus sequence typing: universal characterization of bacteria from domain to strain , 2012, Microbiology.

[27]  Heng Li,et al.  Exploring single-sample SNP and INDEL calling with whole-genome de novo assembly , 2012, Bioinform..

[28]  M. Blaser,et al.  The human microbiome: at the interface of health and disease , 2012, Nature Reviews Genetics.

[29]  Peter Donnelly,et al.  Evolutionary dynamics of Staphylococcus aureus during progression from carriage to disease , 2012, Proceedings of the National Academy of Sciences.

[30]  R. Durbin,et al.  Efficient de novo assembly of large genomes using compressed data structures. , 2012, Genome research.

[31]  J. Wasserheit,et al.  The emerging threat of untreatable gonococcal infection. , 2012, The New England journal of medicine.

[32]  Ole Lund,et al.  Multilocus Sequence Typing of Total-Genome-Sequenced Bacteria , 2012, Journal of Clinical Microbiology.

[33]  A. Künstner,et al.  ConDeTri - A Content Dependent Read Trimmer for Illumina Data , 2011, PloS one.

[34]  Nuno A. Fonseca,et al.  Assemblathon 1: a competitive assessment of de novo short read assembly methods. , 2011, Genome research.

[35]  R. Durbin,et al.  Inference of human population history from individual whole-genome sequences. , 2011, Nature.

[36]  J. Galindo,et al.  Applications of next generation sequencing in molecular ecology of non-model organisms , 2011, Heredity.

[37]  Robert J. Elshire,et al.  A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species , 2011, PloS one.

[38]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[39]  B. Mishra,et al.  Comparing De Novo Genome Assembly: The Long and Short of It , 2011, PloS one.

[40]  C. Herrera,et al.  Untangling individual variation in natural populations: ecological, genetic and epigenetic correlates of long‐term inequality in herbivory , 2011, Molecular ecology.

[41]  Carl Kingsford,et al.  A fast, lock-free approach for efficient parallel counting of occurrences of k-mers , 2011, Bioinform..

[42]  W. Pirovano,et al.  Scaffolding pre-assembled contigs using SSPACE , 2011, Bioinform..

[43]  James H. Bullard,et al.  The origin of the Haitian cholera outbreak strain. , 2011, The New England journal of medicine.

[44]  A. Gnirke,et al.  High-quality draft assemblies of mammalian genomes from massively parallel sequence data , 2010, Proceedings of the National Academy of Sciences.

[45]  Martin C. J. Maiden,et al.  BIGSdb: Scalable analysis of bacterial genome variation at the population level , 2010, BMC Bioinformatics.

[46]  Giovanna Morelli,et al.  Phylogenetic diversity and historical patterns of pandemic spread of Yersinia pestis , 2010, Nature Genetics.

[47]  François Laviolette,et al.  Ray: Simultaneous Assembly of Reads from a Mix of High-Throughput Sequencing Technologies , 2010, J. Comput. Biol..

[48]  G. Luikart,et al.  Genomics and the future of conservation genetics , 2010, Nature Reviews Genetics.

[49]  M. Schatz,et al.  Assembly of large genomes using second-generation sequencing. , 2010, Genome research.

[50]  Michael C. Schatz,et al.  Cloud Computing and the DNA Data Race , 2010, Nature Biotechnology.

[51]  S. Koren,et al.  Assembly algorithms for next-generation sequencing data. , 2010, Genomics.

[52]  M. Berriman,et al.  Improving draft assemblies by iterative mapping and assembly of short reads to eliminate gaps , 2010, Genome Biology.

[53]  N. J. Ouborg,et al.  Conservation genetics in transition to conservation genomics. , 2010, Trends in genetics : TIG.

[54]  A. Cheung,et al.  Faculty Opinions recommendation of Evolution of MRSA during hospital transmission and intercontinental spread. , 2010 .

[55]  David M. Aanensen,et al.  EpiCollect: Linking Smartphones to Web Applications for Epidemiology, Ecology and Community Data Collection , 2009, PloS one.

[56]  Didier Raoult,et al.  Bacterial strain typing in the genomic era. , 2009, FEMS microbiology reviews.

[57]  Jeanette C Papp,et al.  The value of avian genomics to the conservation of wildlife , 2009, BMC Genomics.

[58]  Steven J. M. Jones,et al.  Abyss: a Parallel Assembler for Short Read Sequence Data Material Supplemental Open Access , 2022 .

[59]  C. Primmer From Conservation Genetics to Conservation Genomics , 2009, Annals of the New York Academy of Sciences.

[60]  E. Mardis Next-generation DNA sequencing methods. , 2008, Annual review of genomics and human genetics.

[61]  David Hernández,et al.  De novo bacterial genome sequencing: millions of very short reads assembled on a desktop computer. , 2008, Genome research.

[62]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[63]  J. Lupski,et al.  The complete genome of an individual by massively parallel DNA sequencing , 2008, Nature.

[64]  Aaron L. Halpern,et al.  Consensus generation and variant detection by Celera Assembler , 2008, Bioinform..

[65]  M. Schatz,et al.  Genome assembly forensics: finding the elusive mis-assembly , 2008, Genome Biology.

[66]  Robert D Schnabel,et al.  SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries , 2008, Nature Methods.

[67]  Sofia M. C. Robb,et al.  MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. , 2007, Genome research.

[68]  Keith Bradnam,et al.  CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes , 2007, Bioinform..

[69]  René L. Warren,et al.  Assembling millions of short DNA sequences using SSAKE , 2006, Bioinform..

[70]  E. Ostrander,et al.  Genomics and conservation genetics. , 2006, Trends in ecology & evolution.

[71]  H. Tettelin,et al.  The microbial pan-genome. , 2005, Current opinion in genetics & development.

[72]  C. Fraser-Liggett,et al.  Insights on biology and evolution from microbial genome sequencing. , 2005, Genome research.

[73]  M. Weizenegger,et al.  Use of the Genotype MTBDR Assay for Rapid Detection of Rifampin and Isoniazid Resistance in Mycobacterium tuberculosis Complex Isolates , 2005, Journal of Clinical Microbiology.

[74]  S. Pääbo,et al.  Genetic analyses from ancient DNA. , 2004, Annual review of genetics.

[75]  J. Bonfield,et al.  Finishing the euchromatic sequence of the human genome , 2004, Nature.

[76]  G. Weinstock,et al.  The Atlas genome assembly system. , 2004, Genome research.

[77]  L. Bernatchez,et al.  Adaptive evolutionary conservation: towards a unified concept for defining conservation units , 2001, Molecular ecology.

[78]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[79]  H. Ellegren Genome sequencing and population genomics in non-model organisms. , 2014, Trends in ecology & evolution.

[80]  Claude-Alain H. Roten,et al.  Fast and accurate short read alignment with Burrows–Wheeler transform , 2009, Bioinform..

[81]  P. Vandamme,et al.  DNA-DNA hybridization values and their relationship to whole-genome sequence similarities. , 2007, International journal of systematic and evolutionary microbiology.

[82]  Gene Ontology Consortium The Gene Ontology (GO) database and informatics resource , 2003 .

[83]  B. Berger,et al.  ARACHNE: a whole-genome shotgun assembler. , 2002, Genome research.

[84]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..