Evaluation of Machine Learning and Rules-Based Approaches for Predicting Antimicrobial Resistance Profiles in Gram-negative Bacilli from Whole Genome Sequence Data

The time-to-result for culture-based microorganism recovery and phenotypic antimicrobial susceptibility testing necessitates initial use of empiric (frequently broad-spectrum) antimicrobial therapy. If the empiric therapy is not optimal, this can lead to adverse patient outcomes and contribute to increasing antibiotic resistance in pathogens. New, more rapid technologies are emerging to meet this need. Many of these are based on identifying resistance genes, rather than directly assaying resistance phenotypes, and thus require interpretation to translate the genotype into treatment recommendations. These interpretations, like other parts of clinical diagnostic workflows, are likely to be increasingly automated in the future. We set out to evaluate the two major approaches that could be amenable to automation pipelines: rules-based methods and machine learning methods. The rules-based algorithm makes predictions based upon current, curated knowledge of Enterobacteriaceae resistance genes. The machine-learning algorithm predicts resistance and susceptibility based on a model built from a training set of variably resistant isolates. As our test set, we used whole genome sequence data from 78 clinical Enterobacteriaceae isolates, previously identified to represent a variety of phenotypes, from fully-susceptible to pan-resistant strains for the antibiotics tested. We tested three antibiotic resistance determinant databases for their utility in identifying the complete resistome for each isolate. The predictions of the rules-based and machine learning algorithms for these isolates were compared to results of phenotype-based diagnostics. The rules based and machine-learning predictions achieved agreement with standard-of-care phenotypic diagnostics of 89.0 and 90.3%, respectively, across twelve antibiotic agents from six major antibiotic classes. Several sources of disagreement between the algorithms were identified. Novel variants of known resistance factors and incomplete genome assembly confounded the rules-based algorithm, resulting in predictions based on gene family, rather than on knowledge of the specific variant found. Low-frequency resistance caused errors in the machine-learning algorithm because those genes were not seen or seen infrequently in the test set. We also identified an example of variability in the phenotype-based results that led to disagreement with both genotype-based methods. Genotype-based antimicrobial susceptibility testing shows great promise as a diagnostic tool, and we outline specific research goals to further refine this methodology.

[1]  M. McConnell,et al.  Progress on the development of rapid methods for antimicrobial susceptibility testing. , 2013, The Journal of antimicrobial chemotherapy.

[2]  Ole Lund,et al.  Rapid Whole-Genome Sequencing for Detection and Characterization of Microorganisms Directly from Clinical Samples , 2013, Journal of Clinical Microbiology.

[3]  J. Handelsman,et al.  Cloning the Soil Metagenome: a Strategy for Accessing the Genetic and Functional Diversity of Uncultured Microorganisms , 2000, Applied and Environmental Microbiology.

[4]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[5]  S. Badshah,et al.  The Current Case of Quinolones: Synthetic Approaches and Antibacterial Activity , 2016, Molecules.

[6]  Jianzhong Shen,et al.  Emergence of plasmid-mediated colistin resistance mechanism MCR-1 in animals and human beings in China: a microbiological and molecular biological study. , 2015, The Lancet. Infectious diseases.

[7]  Mark Borodovsky,et al.  Gene identification in prokaryotic genomes, phages, metagenomes, and EST sequences with GeneMarkS suite. , 2011, Current protocols in bioinformatics.

[8]  Joseph L DeRisi,et al.  Actionable diagnosis of neuroleptospirosis by next-generation sequencing. , 2014, The New England journal of medicine.

[9]  G. Church,et al.  Functional Characterization of the Antibiotic Resistance Reservoir in the Human Microflora , 2009, Science.

[10]  R. Garfein,et al.  Evaluation of Pyrosequencing for Detecting Extensively Drug-Resistant Mycobacterium tuberculosis among Clinical Isolates from Four High-Burden Countries , 2014, Antimicrobial Agents and Chemotherapy.

[11]  Ronald N. Jones,et al.  Mutation-Driven β-Lactam Resistance Mechanisms among Contemporary Ceftazidime-Nonsusceptible Pseudomonas aeruginosa Isolates from U.S. Hospitals , 2014, Antimicrobial Agents and Chemotherapy.

[12]  Evan S Snitkin,et al.  Tracking a Hospital Outbreak of Carbapenem-Resistant Klebsiella pneumoniae with Whole-Genome Sequencing , 2012, Science Translational Medicine.

[13]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[14]  C. Bertelli,et al.  Rapid bacterial genome sequencing: methods and applications in clinical microbiology. , 2013, Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases.

[15]  Matthew H Samore,et al.  Implementing an Antibiotic Stewardship Program: Guidelines by the Infectious Diseases Society of America and the Society for Healthcare Epidemiology of America. , 2016, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[16]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[17]  Dag Harmsen,et al.  Bacterial Whole-Genome Sequencing Revisited: Portable, Scalable, and Standardized Analysis for Typing and Detection of Virulence and Antibiotic Resistance Genes , 2014, Journal of Clinical Microbiology.

[18]  Annette M. Molinaro,et al.  Prediction error estimation: a comparison of resampling methods , 2005, Bioinform..

[19]  Andrew C. Pawlowski,et al.  The Comprehensive Antibiotic Resistance Database , 2013, Antimicrobial Agents and Chemotherapy.

[20]  Meghan A. Wallace,et al.  KPC and NDM-1 Genes in Related Enterobacteriaceae Strains and Plasmids from Pakistan and the United States , 2015, Emerging infectious diseases.

[21]  S. Rasmussen,et al.  Identification of acquired antimicrobial resistance genes , 2012, The Journal of antimicrobial chemotherapy.

[22]  K. Bush,et al.  Novel Carbapenem-Hydrolyzing β-Lactamase, KPC-1, from a Carbapenem-Resistant Strain of Klebsiella pneumoniae , 2001, Antimicrobial Agents and Chemotherapy.

[23]  L. Christophorou Science , 2018, Emerging Dynamics: Science, Energy, Society and Values.

[24]  G. Dantas,et al.  The Tetracycline Destructases: A Novel Family of Tetracycline-Inactivating Enzymes. , 2015, Chemistry & biology.

[25]  M. Kaufmann,et al.  The role of ISAba1 in expression of OXA carbapenemase genes in Acinetobacter baumannii. , 2006, FEMS microbiology letters.

[26]  Chunlei Du,et al.  Nanopore-based Fourth-generation DNA Sequencing Technology , 2015, Genom. Proteom. Bioinform..

[27]  Molly K. Gibson,et al.  Bacterial phylogeny structures soil resistomes across habitats , 2014, Nature.

[28]  K. Wood,et al.  Duration of hypotension before initiation of effective antimicrobial therapy is the critical determinant of survival in human septic shock* , 2006, Critical care medicine.

[29]  D. Haake,et al.  Emerging technologies for rapid identification of bloodstream pathogens. , 2014, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[30]  A. M. George,et al.  Multidrug resistance in Klebsiella pneumoniae: a novel gene, ramA, confers a multidrug resistance phenotype in Escherichia coli. , 1995, Microbiology.

[31]  Christian Drosten,et al.  Rapid point of care diagnostic tests for viral and bacterial respiratory tract infections—needs, advances, and future prospects , 2014, The Lancet Infectious Diseases.

[32]  P. Hawkey,et al.  The changing epidemiology of resistance. , 2009, The Journal of antimicrobial chemotherapy.

[33]  M. Clementi,et al.  The Era of Molecular and Other Non-Culture-Based Methods in Diagnosis of Sepsis , 2010, Clinical Microbiology Reviews.

[34]  G. Smith,et al.  Rapid bacterial whole-genome sequencing to enhance diagnostic and public health microbiology. , 2013, JAMA internal medicine.

[35]  Fan Yang,et al.  TIGRFAMs: a protein family resource for the functional identification of proteins , 2001, Nucleic Acids Res..

[36]  Molly K. Gibson,et al.  Improved annotation of antibiotic resistance determinants reveals microbial resistomes cluster by ecology , 2014, The ISME Journal.

[37]  Daniel J. Wilson,et al.  Transforming clinical microbiology with bacterial genome sequencing , 2012, Nature Reviews Genetics.

[38]  G. Smith,et al.  Whole-genome sequencing for rapid susceptibility testing of M. tuberculosis. , 2013, The New England journal of medicine.

[39]  G. Jacoby,et al.  Updated Functional Classification of β-Lactamases , 2009, Antimicrobial Agents and Chemotherapy.

[40]  Sean R Eddy,et al.  A new generation of homology search tools based on probabilistic inference. , 2009, Genome informatics. International Conference on Genome Informatics.

[41]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[42]  Daniel J. Wilson,et al.  Prediction of Staphylococcus aureus Antimicrobial Resistance by Whole-Genome Sequencing , 2014, Journal of Clinical Microbiology.

[43]  Zhengwei Zhu,et al.  CD-HIT: accelerated for clustering the next-generation sequencing data , 2012, Bioinform..

[44]  S. Solomon,et al.  Antibiotic resistance threats in the United States: stepping back from the brink. , 2014, American family physician.

[45]  J. R. Johnson,et al.  Predicting antimicrobial susceptibilities for Escherichia coli and Klebsiella pneumoniae isolates using whole genomic sequence data , 2013, The Journal of antimicrobial chemotherapy.

[46]  M. Ferraro Performance standards for antimicrobial susceptibility testing , 2001 .

[47]  J. E. Rogers,et al.  The Shared Antibiotic Resistome of Soil Bacteria and Human Pathogens , 2012 .

[48]  Mary Jane Ferraro,et al.  Performance standards for antimicrobial susceptibility testing : twelfth informational supplement , 2002 .

[49]  T. Tenson,et al.  Persisters—as elusive as ever , 2016, Applied Microbiology and Biotechnology.

[50]  L. May,et al.  Better Tests, Better Care: Improved Diagnostics for Infectious Diseases , 2013, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.