Global Phylogeny of Mycobacterium tuberculosis Based on Single Nucleotide Polymorphism (SNP) Analysis: Insights into Tuberculosis Evolution, Phylogenetic Accuracy of Other DNA Fingerprinting Systems, and Recommendations for a Minimal Standard SNP Set

ABSTRACT We analyzed a global collection of Mycobacterium tuberculosis strains using 212 single nucleotide polymorphism (SNP) markers. SNP nucleotide diversity was high (average across all SNPs, 0.19), and 96% of the SNP locus pairs were in complete linkage disequilibrium. Cluster analyses identified six deeply branching, phylogenetically distinct SNP cluster groups (SCGs) and five subgroups. The SCGs were strongly associated with the geographical origin of the M. tuberculosis samples and the birthplace of the human hosts. The most ancestral cluster (SCG-1) predominated in patients from the Indian subcontinent, while SCG-1 and another ancestral cluster (SCG-2) predominated in patients from East Asia, suggesting that M. tuberculosis first arose in the Indian subcontinent and spread worldwide through East Asia. Restricted SCG diversity and the prevalence of less ancestral SCGs in indigenous populations in Uganda and Mexico suggested a more recent introduction of M. tuberculosis into these regions. The East African Indian and Beijing spoligotypes were concordant with SCG-1 and SCG-2, respectively; X and Central Asian spoligotypes were also associated with one SCG or subgroup combination. Other clades had less consistent associations with SCGs. Mycobacterial interspersed repetitive unit (MIRU) analysis provided less robust phylogenetic information, and only 6 of the 12 MIRU microsatellite loci were highly differentiated between SCGs as measured by GST. Finally, an algorithm was devised to identify two minimal sets of either 45 or 6 SNPs that could be used in future investigations to enable global collaborations for studies on evolution, strain differentiation, and biological differences of M. tuberculosis.

[1]  A. Oskooi Molecular Evolution and Phylogenetics , 2008 .

[2]  P. Smouse,et al.  genalex 6: genetic analysis in Excel. Population genetic software for teaching and research , 2006 .

[3]  Magali Cavatore,et al.  Role of embB Codon 306 Mutations in Mycobacterium tuberculosis Revisited: a Novel Association with Broad Drug Resistance and IS6110 Clustering Rather than Ethambutol Resistance , 2005, Antimicrobial Agents and Chemotherapy.

[4]  M. Reed,et al.  In vivo phenotypic dominance in mouse mixed infections with Mycobacterium tuberculosis clinical isolates. , 2005, The Journal of infectious diseases.

[5]  Y. Balabanova,et al.  Drug-resistant tuberculosis, clinical virulence, and the dominance of the Beijing strain family in Russia. , 2005, JAMA.

[6]  Nalin Rastogi,et al.  Spoligotyping of Mycobacterium tuberculosis isolates from patients with pulmonary tuberculosis in Mumbai, India. , 2005, Research in microbiology.

[7]  P. Small,et al.  Does DOTS work in populations with drug-resistant tuberculosis? , 2005, The Lancet.

[8]  Nalin Rastogi,et al.  Genetic Diversity, Determined on the Basis of katG463 and gyrA95 Polymorphisms, Spoligotyping, and IS6110 Typing, of Mycobacterium tuberculosis Complex Isolates from Italy , 2005, Journal of Clinical Microbiology.

[9]  J. Bates,et al.  Epidemiologic Import of Tuberculosis Cases Whose Isolates Have Similar but Not Identical IS6110 Restriction Fragment Length Polymorphism Patterns , 2005, Journal of Clinical Microbiology.

[10]  P. Godfrey-Faussett,et al.  Effects of genetic variability of Mycobacterium tuberculosis strains on the presentation of disease. , 2005, The Lancet. Infectious diseases.

[11]  Qian Gao,et al.  Gene expression diversity among Mycobacterium tuberculosis clinical isolates. , 2005, Microbiology.

[12]  Trends in tuberculosis--United States, 2004. , 2005, MMWR. Morbidity and mortality weekly report.

[13]  Anthony J McMichael,et al.  Social and environmental risk factors in the emergence of infectious diseases , 2004, Nature Medicine.

[14]  D. Caugant,et al.  Impact of drug resistance on fitness of Mycobacterium tuberculosis strains of the W-Beijing genotype. , 2004, FEMS immunology and medical microbiology.

[15]  Nalin Rastogi,et al.  Data mining of Mycobacterium tuberculosis complex genotyping results using mycobacterial interspersed repetitive units validates the clonal structure of spoligotyping-defined families. , 2004, Research in microbiology.

[16]  T. Shinnick,et al.  Characterization of drug-resistant isolates of Mycobacterium tuberculosis derived from Russian inmates. , 2004, The international journal of tuberculosis and lung disease : the official journal of the International Union against Tuberculosis and Lung Disease.

[17]  Paul Keim,et al.  Phylogenetic discovery bias in Bacillus anthracis using single-nucleotide polymorphisms from whole-genome sequencing. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[18]  M. Reed,et al.  A glycolipid of hypervirulent tuberculosis strains that inhibits the innate immune response , 2004, Nature.

[19]  S. Niemann,et al.  The Beijing genotype is emerging among multidrug-resistant Mycobacterium tuberculosis strains from Germany. , 2004, The international journal of tuberculosis and lung disease : the official journal of the International Union against Tuberculosis and Lung Disease.

[20]  Tim Brown,et al.  Silent Nucleotide Polymorphisms and a Phylogeny for Mycobacterium tuberculosis , 2004, Emerging infectious diseases.

[21]  Sudhir Kumar,et al.  MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment , 2004, Briefings Bioinform..

[22]  Nalin Rastogi,et al.  Predominant Tuberculosis Spoligotypes, Delhi, India , 2004, Emerging infectious diseases.

[23]  Marcus W Feldman,et al.  Stable association between strains of Mycobacterium tuberculosis and their human host populations. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[24]  M. Hazbón,et al.  Hairpin Primers for Simplified Single-Nucleotide Polymorphism Analysis of Mycobacterium tuberculosis and Other Organisms , 2004, Journal of Clinical Microbiology.

[25]  Robin Warren,et al.  Molecular Characteristics and Global Spread of Mycobacterium tuberculosis with a Western Cape F11 Genotype , 2004, Journal of Clinical Microbiology.

[26]  Xavier Messeguer,et al.  DnaSP, DNA polymorphism analyses by the coalescent and other methods , 2003, Bioinform..

[27]  N. Ahmed,et al.  Tuberculosis in seals caused by a novel member of the Mycobacterium tuberculosis complex: Mycobacterium pinnipedii sp. nov. , 2003, International journal of systematic and evolutionary microbiology.

[28]  Julian Parkhill,et al.  The complete genome sequence of Mycobacterium bovis , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[29]  R. Fleischmann,et al.  Modeling Bacterial Evolution with Comparative-Genome-Based Marker Systems: Application to Mycobacterium tuberculosis Evolution and Pathogenesis , 2003, Journal of bacteriology.

[30]  Nalin Rastogi,et al.  Snapshot of Moving and Expanding Clones of Mycobacterium tuberculosis and Their Global Distribution Assessed by Spoligotyping in an International Study , 2003, Journal of Clinical Microbiology.

[31]  M. Behr,et al.  Microevolution of the Direct Repeat Region of Mycobacterium tuberculosis: Implications for Interpretation of Spoligotyping Data , 2002, Journal of Clinical Microbiology.

[32]  Edward A Graviss,et al.  Genome-wide analysis of synonymous single nucleotide polymorphisms in Mycobacterium tuberculosis complex organisms: resolution of genetic relationships among closely related microbial strains. , 2002, Genetics.

[33]  A. Hughes,et al.  Genomewide Pattern of Synonymous Nucleotide Substitution in Two Complete Genomes of Mycobacterium tuberculosis , 2002, Emerging infectious diseases.

[34]  Y. Balabanova,et al.  Rifampin- and Multidrug-Resistant Tuberculosis in Russian Civilians and Prison Inmates: Dominance of the Beijing Strain Family , 2002, Emerging infectious diseases.

[35]  S. Salzberg,et al.  Whole-Genome Comparison of Mycobacterium tuberculosis Clinical and Laboratory Strains , 2002, Journal of bacteriology.

[36]  D. van Soolingen,et al.  Worldwide Occurrence of Beijing/W Strains of Mycobacterium tuberculosis: A Systematic Review , 2002, Emerging infectious diseases.

[37]  Alicia Aranaz,et al.  Genomic deletions suggest a phylogeny for the Mycobacterium tuberculosis complex. , 2002, The Journal of infectious diseases.

[38]  D. Caugant,et al.  Spread of Drug-Resistant Mycobacterium tuberculosis Strains of the Beijing Genotype in the Archangel Oblast, Russia , 2002, Journal of Clinical Microbiology.

[39]  C. Buchrieser,et al.  A new evolutionary scenario for the Mycobacterium tuberculosis complex , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Marc Sebban,et al.  A data-mining approach to spacer oligonucleotide typing of Mycobacterium tuberculosis , 2002, Bioinform..

[41]  D. van Soolingen,et al.  Epidemiological evidence of the spread of a Mycobacterium tuberculosis strain of the Beijing genotype on Gran Canaria Island. , 2001, American journal of respiratory and critical care medicine.

[42]  Philip Supply,et al.  Automated High-Throughput Genotyping for Study of Global Epidemiology of Mycobacterium tuberculosis Based on Mycobacterial Interspersed Repetitive Units , 2001, Journal of Clinical Microbiology.

[43]  Nalin Rastogi,et al.  Spoligotype database of Mycobacterium tuberculosis: biogeographic distribution of shared types and epidemiologic and phylogenetic perspectives. , 2001, Emerging infectious diseases.

[44]  Y. Nakamura Molecular analyses of the serotype of Cryptococcus neoformans. , 2001, Nihon Ishinkin Gakkai zasshi = Japanese journal of medical mycology.

[45]  G. Kaplan,et al.  Virulence of a Mycobacterium tuberculosis clinical isolate in mice is determined by failure to induce Th1 type immunity and is associated with induction of IFN-α/β , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[46]  N Rastogi,et al.  Spacer oligonucleotide typing of bacteria of the Mycobacterium tuberculosis complex: recommendations for standardised nomenclature. , 2001, The international journal of tuberculosis and lung disease : the official journal of the International Union against Tuberculosis and Lung Disease.

[47]  C. Locht,et al.  High-resolution minisatellite-based typing as a portable approach to global analysis of Mycobacterium tuberculosis molecular epidemiology. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[48]  N. Schork,et al.  Single nucleotide polymorphisms and the future of genetic epidemiology , 2000, Clinical genetics.

[49]  B. C. Wong,et al.  Differences in Genotypes of Helicobacter pylori from Different Human Populations , 2000, Journal of bacteriology.

[50]  D. van Soolingen,et al.  Mycobacterium tuberculosis Beijing genotype emerging in Vietnam. , 2000, Emerging infectious diseases.

[51]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[52]  J. Musser,et al.  Negligible genetic diversity of mycobacterium tuberculosis host immune system protein targets: evidence of limited selective pressure. , 2000, Genetics.

[53]  M. Milinkovitch,et al.  Phylogenetic Analyses Indicate an Atypical Nurse-to-Patient Transmission of Human Immunodeficiency Virus Type 1 , 2000, Journal of Virology.

[54]  R. Wilkinson,et al.  Influence of vitamin D deficiency and vitamin D receptor polymorphisms on tuberculosis among Gujarati Asians in west London: a case-control study , 2000, The Lancet.

[55]  W. Bishai,et al.  Virulence of Mycobacterium tuberculosisCDC1551 and H37Rv in Rabbits Evaluated by Lurie’s Pulmonary Tubercle Count Method , 1999, Infection and Immunity.

[56]  M. Kato-Maeda,et al.  Drug resistance among acid-fast bacilli , 1999, The Lancet.

[57]  R. Frothingham Evolutionary bottlenecks in the agents of tuberculosis, leprosy, and paratuberculosis. , 1999, Medical hypotheses.

[58]  B. Barrell,et al.  Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence , 1998, Nature.

[59]  R. Frothingham,et al.  Genetic diversity in the Mycobacterium tuberculosis complex based on variable numbers of tandem DNA repeats. , 1998, Microbiology.

[60]  M. Achtman,et al.  Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[61]  H. Whittle,et al.  Variations in the NRAMP1 gene and susceptibility to tuberculosis in West Africans. , 1998, The New England journal of medicine.

[62]  J. Darbyshire,et al.  Tuberculosis in England and Wales in 1993: results of a national survey. Public Health Laboratory Service/British Thoracic Society/Department of Health Collaborative Group. , 1997, Thorax.

[63]  T. Whittam,et al.  Restricted structural gene polymorphism in the Mycobacterium tuberculosis complex indicates evolutionarily recent global dissemination. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[64]  Cindy R. Friedman,et al.  Widespread dissemination of a drug-susceptible strain of Mycobacterium tuberculosis. , 1997, The Journal of infectious diseases.

[65]  D van Soolingen,et al.  Simultaneous detection and strain differentiation of Mycobacterium tuberculosis for diagnosis and epidemiology , 1997, Journal of clinical microbiology.

[66]  Susan Elizabeth Gunter Ponce de Leon , 1996 .

[67]  J T Douglas,et al.  Predominance of a single genotype of Mycobacterium tuberculosis in countries of east Asia , 1995, Journal of clinical microbiology.

[68]  P H Harvey,et al.  Revealing the history of infectious disease epidemics through phylogenetic trees. , 1995, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[69]  T. Whittam,et al.  Is Mycobacterium tuberculosis 15,000 years old? , 1994, The Journal of infectious diseases.

[70]  D Alland,et al.  Transmission of tuberculosis in New York City. An analysis by DNA fingerprinting and conventional epidemiologic methods. , 1994, The New England journal of medicine.

[71]  J. T. Crawford,et al.  Strain identification of Mycobacterium tuberculosis by DNA fingerprinting: recommendations for a standardized methodology , 1993, Journal of clinical microbiology.

[72]  J. T. Crawford,et al.  Repetitive DNA sequences as probes for Mycobacterium tuberculosis , 1988, Journal of clinical microbiology.

[73]  P. Hunter,et al.  Numerical index of the discriminatory ability of typing systems: an application of Simpson's index of diversity , 1988, Journal of clinical microbiology.