Accommodating error analysis in comparison and clustering of molecular fingerprints.

Molecular epidemiologic studies of infectious diseases rely on pathogen genotype comparisons, which usually yield patterns comprising sets of DNA fragments (DNA fingerprints). We use a highly developed genotyping system, IS6110-based restriction fragment length polymorphism analysis of Mycobacterium tuberculosis, to develop a computational method that automates comparison of large numbers of fingerprints. Because error in fragment length measurements is proportional to fragment length and is positively correlated for fragments within a lane, an align-and-count method that compensates for relative scaling of lanes reliably counts matching fragments between lanes. Results of a two-step method we developed to cluster identical fingerprints agree closely with 5 years of computer-assisted visual matching among 1,335 M. tuberculosis fingerprints. Fully documented and validated methods of automated comparison and clustering will greatly expand the scope of molecular epidemiology.

[1]  D van Soolingen,et al.  Molecular epidemiology of tuberculosis in Denmark in 1992 , 1995, Journal of clinical microbiology.

[2]  Peter M. Small,et al.  Molecular Epidemiology of Tuberculosis , 1994 .

[3]  D. van Soolingen,et al.  Restriction fragment length polymorphism Mycobacterium tuberculosis strains isolated from Greenland during 1992: evidence of tuberculosis transmission between Greenland and Denmark , 1994, Journal of clinical microbiology.

[4]  P. V. van Helden,et al.  Unexpectedly high strain diversity of Mycobacterium tuberculosis in a high-incidence community. , 1996, South African medical journal = Suid-Afrikaanse tydskrif vir geneeskunde.

[5]  S. Preston‐Martin,et al.  Transmission of tuberculosis among the urban homeless. , 1996, JAMA.

[6]  S. Gillespie,et al.  Restriction fragment length polymorphism analysis of Mycobacterium tuberculosis isolated from patients with pulmonary tuberculosis in northern Tanzania. , 1995, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[7]  D. van Soolingen,et al.  Analysis of the population structure of Mycobacterium tuberculosis in Ethiopia, Tunisia, and The Netherlands: usefulness of DNA typing for global tuberculosis epidemiology. , 1995, The Journal of infectious diseases.

[8]  J. T. Crawford,et al.  Strain identification of Mycobacterium tuberculosis by DNA fingerprinting: recommendations for a standardized methodology , 1993, Journal of clinical microbiology.

[9]  P. Small,et al.  A computer-assisted molecular epidemiologic approach to confronting the reemergence of tuberculosis. , 1996, The American journal of the medical sciences.

[10]  S. Das,et al.  IS6110 restriction fragment length polymorphism typing of clinical isolates of Mycobacterium tuberculosis from patients with pulmonary tuberculosis in Madras, south India. , 1995, Tubercle and lung disease : the official journal of the International Union against Tuberculosis and Lung Disease.