TGS-TB: Total Genotyping Solution for Mycobacterium tuberculosis Using Short-Read Whole-Genome Sequencing

Whole-genome sequencing (WGS) with next-generation DNA sequencing (NGS) is an increasingly accessible and affordable method for genotyping hundreds of Mycobacterium tuberculosis (Mtb) isolates, leading to more effective epidemiological studies involving single nucleotide variations (SNVs) in core genomic sequences based on molecular evolution. We developed an all-in-one web-based tool for genotyping Mtb, referred to as the Total Genotyping Solution for TB (TGS-TB), to facilitate multiple genotyping platforms using NGS for spoligotyping and the detection of phylogenies with core genomic SNVs, IS6110 insertion sites, and 43 customized loci for variable number tandem repeat (VNTR) through a user-friendly, simple click interface. This methodology is implemented with a KvarQ script to predict MTBC lineages/sublineages and potential antimicrobial resistance. Seven Mtb isolates (JP01 to JP07) in this study showing the same VNTR profile were accurately discriminated through median-joining network analysis using SNVs unique to those isolates. An additional IS6110 insertion was detected in one of those isolates as supportive genetic information in addition to core genomic SNVs. The results of in silico analyses using TGS-TB are consistent with those obtained using conventional molecular genotyping methods, suggesting that NGS short reads could provide multiple genotypes to discriminate multiple strains of Mtb, although longer NGS reads (≥300-mer) will be required for full genotyping on the TGS-TB web site. Most available short reads (~100-mer) can be utilized to discriminate the isolates based on the core genome phylogeny. TGS-TB provides a more accurate and discriminative strain typing for clinical and epidemiological investigations; NGS strain typing offers a total genotyping solution for Mtb outbreak and surveillance. TGS-TB web site: https://gph.niid.go.jp/tgs-tb/.

[1]  S. Salzberg,et al.  Versatile and open software for comparing large genomes , 2004, Genome Biology.

[2]  T. Clark,et al.  PolyTB: A genomic variation map for Mycobacterium tuberculosis , 2014, Tuberculosis.

[3]  Shuifang Zhu,et al.  Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads , 2014, BMC Bioinformatics.

[4]  Q. Gao,et al.  Whole-genome sequencing to detect recent transmission of Mycobacterium tuberculosis in settings with a high burden of tuberculosis. , 2014, Tuberculosis.

[5]  P. Barnes,et al.  Molecular epidemiology of tuberculosis. , 2003, The New England journal of medicine.

[6]  Daniel J. Wilson,et al.  Whole-genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational study , 2013, The Lancet. Infectious diseases.

[7]  Julian Parkhill,et al.  Whole-genome sequencing to establish relapse or re-infection with Mycobacterium tuberculosis: a retrospective observational study , 2013, The Lancet. Respiratory medicine.

[8]  S. Mitarai,et al.  Promising loci of variable numbers of tandem repeats for typing Beijing family Mycobacterium tuberculosis. , 2008, Journal of medical microbiology.

[9]  B. Barrell,et al.  Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence , 1998, Nature.

[10]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[11]  Daniel J. Wilson,et al.  Transforming clinical microbiology with bacterial genome sequencing , 2012, Nature Reviews Genetics.

[12]  Richard Durbin,et al.  Fast and accurate long-read alignment with Burrows–Wheeler transform , 2010, Bioinform..

[13]  P. Beckert,et al.  PhyResSE: a Web Tool Delineating Mycobacterium tuberculosis Antibiotic Resistance and Lineage from Whole-Genome Sequencing Data , 2015, Journal of Clinical Microbiology.

[14]  Lisa J. Murray,et al.  Genomic Diversity among Drug Sensitive and Multidrug Resistant Isolates of Mycobacterium tuberculosis with Identical DNA Fingerprints , 2009, PloS one.

[15]  Gilles Vergnaud,et al.  Identification of polymorphic tandem repeats by direct comparison of genome sequence from different bacterial strains : a web-based resource , 2004, BMC Bioinformatics.

[16]  M. Chase,et al.  Use of whole genome sequencing to estimate the mutation rate of Mycobacterium tuberculosis during latent infection , 2011, Nature Genetics.

[17]  Philip Supply,et al.  Automated High-Throughput Genotyping for Study of Global Epidemiology of Mycobacterium tuberculosis Based on Mycobacterial Interspersed Repetitive Units , 2001, Journal of Clinical Microbiology.

[18]  Nalin Rastogi,et al.  Proposal for Standardization of Optimized Mycobacterial Interspersed Repetitive Unit-Variable-Number Tandem Repeat Typing of Mycobacterium tuberculosis , 2006, Journal of Clinical Microbiology.

[19]  Francesc Coll,et al.  PhyTB: Phylogenetic tree visualisation and sample positioning for M. tuberculosis , 2015, BMC Bioinformatics.

[20]  D van Soolingen,et al.  Simultaneous detection and strain differentiation of Mycobacterium tuberculosis for diagnosis and epidemiology , 1997, Journal of clinical microbiology.

[21]  Stefan Niemann,et al.  MIRU-VNTRplus: a web tool for polyphasic genotyping of Mycobacterium tuberculosis complex bacteria , 2010, Nucleic Acids Res..

[22]  George M Church,et al.  Tuberculosis Drug Resistance Mutation Database , 2009, PLoS medicine.

[23]  Nigel J. Martin,et al.  SpolPred: rapid and accurate prediction of Mycobacterium tuberculosis spoligotypes from short genomic sequences , 2012, Bioinform..

[24]  J. T. Crawford,et al.  Strain identification of Mycobacterium tuberculosis by DNA fingerprinting: recommendations for a standardized methodology , 1993, Journal of clinical microbiology.

[25]  Stefan Niemann,et al.  Whole Genome Sequencing versus Traditional Genotyping for Investigation of a Mycobacterium tuberculosis Outbreak: A Longitudinal Molecular Epidemiological Study , 2013, PLoS medicine.

[26]  Tim E A Peto,et al.  Assessment of Mycobacterium tuberculosis transmission in Oxfordshire, UK, 2007–12, with whole pathogen genome sequences: an observational study , 2014, The Lancet. Respiratory medicine.

[27]  Karina Yusim,et al.  Mycobacterium tuberculosis--heterogeneity revealed through whole genome sequencing. , 2012, Tuberculosis.

[28]  熊礼宽,et al.  Mycobacterium , 1977, Bacteriological reviews.

[29]  S. Borrell,et al.  KvarQ: targeted and direct variant calling from fastq reads of bacterial genomes , 2014, BMC Genomics.

[30]  Stefan Niemann,et al.  Whole-Genome-Based Mycobacterium tuberculosis Surveillance: a Standardized, Portable, and Expandable Approach , 2014, Journal of Clinical Microbiology.

[31]  R. Frothingham,et al.  Comparison of Methods Based on Different Molecular Epidemiological Markers for Typing of Mycobacterium tuberculosis Complex Strains: Interlaboratory Study of Discriminatory Power and Reproducibility , 1999, Journal of Clinical Microbiology.

[32]  Julian Parkhill,et al.  Inferring patient to patient transmission of Mycobacterium tuberculosis from whole genome sequencing data , 2013, BMC Infectious Diseases.

[33]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[34]  Nalin Rastogi,et al.  Proposal of a Consensus Set of Hypervariable Mycobacterial Interspersed Repetitive-Unit–Variable-Number Tandem-Repeat Loci for Subtyping of Mycobacterium tuberculosis Beijing Isolates , 2013, Journal of Clinical Microbiology.

[35]  J.,et al.  The New England Journal of Medicine , 2012 .