Forensic Signature Detection of Yersinia pestis Culturing Practices across Institutions Using a Bayesian Network

The field of bioforensics is focused on the analysis of evidence from a biocrime. Existing laboratory analyses can identify the specific strain of an organism, as well signatures of the specific culture batch of organisms, such as low-frequency contaminants or indicators of growth and processing methods. To link these disparate types of physical data to potential suspects, investigators may need to identify institutions or individuals whose access to strains and culturing practices match those identified from the evidence. In this work, we present a Bayesian statistical network to fuse different types of analytical measurements that predict the production environment of a Yersinia pestis (Y. pestis) sample under investigation with automated text processing of scientific publications to identify institutions with a history of growing Y. pestis under similar conditions. Furthermore, the textual and experimental signatures were evaluated recursively to determine the overall sensitivity of the network across all levels of false positives. We illustrate that institutions associated with several specific culturing practices can be accurately selected based on the experimental signature from only a few analytical measurements. These findings demonstrate that similar Bayesian networks can be generated generically for many organisms of interest and their deployment is not prohibitive due to either computational or experimental factors.

[1]  Lee Ann McCue,et al.  Fusion of laboratory and textual data for investigative bioforensics. , 2013, Forensic science international.

[2]  J. Ehleringer,et al.  Hydrogen and oxygen isotope ratios in human hair are related to geography , 2008, Proceedings of the National Academy of Sciences.

[3]  N. Valentine,et al.  The effect of growth medium on B. anthracis Sterne spore carbohydrate content. , 2011, Journal of microbiological methods.

[4]  Tyler B. Coplen,et al.  NEW GUIDELINES FOR REPORTING STABLE HYDROGEN, CARBON, AND OXYGEN ISOTOPE-RATIO DATA , 1996 .

[5]  Helen W. Kreuzer-Martin,et al.  Stable isotope ratios as a tool in microbial forensics--Part 1. Microbial isotopic composition as a function of growth medium. , 2004, Journal of forensic sciences.

[6]  Paolo Garbolino,et al.  Evaluation of scientific evidence using Bayesian networks. , 2002, Forensic science international.

[7]  Heather A. Colburn,et al.  Bayesian-Integrated Microbial Forensics , 2008, Applied and Environmental Microbiology.

[8]  Bruce Budowle,et al.  Building Microbial Forensics as a Response to Bioterrorism , 2003, Science.

[9]  Randall S Murch Microbial forensics: building a national capacity to investigate bioterrorism. , 2003, Biosecurity and bioterrorism : biodefense strategy, practice, and science.

[10]  Lucila Ohno-Machado,et al.  The use of receiver operating characteristic curves in biomedical informatics , 2005, J. Biomed. Informatics.

[11]  Simone Gittelson,et al.  Bayesian Networks and the Value of the Evidence for the Forensic Two‐Trace Transfer Problem * , 2012, Journal of forensic sciences.

[12]  G. M. Mong,et al.  Impurity profiling to match a nerve agent to its precursor source for chemical forensics applications. , 2011, Analytical chemistry.

[13]  J. Hilden Statistical diagnosis based on conditional independence does not require it. , 1984, Computers in biology and medicine.

[14]  Joel G. Pounds,et al.  Pacific Symposium on Biocomputing 14:451-463 (2009) A BAYESIAN INTEGRATION MODEL OF HIGH- THROUGHPUT PROTEOMICS AND METABOLOMICS DATA FOR IMPROVED EARLY DETECTION OF MICROBIAL INFECTIONS , 2022 .

[15]  Helen W. Kreuzer-Martin,et al.  Microbe forensics: Oxygen and hydrogen stable isotope ratios in Bacillus subtilis cells and spores , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Xuegong Zhang,et al.  Recursive SVM feature selection and sample classification for mass-spectrometry and microarray data , 2006, BMC Bioinformatics.

[17]  Ji-Hyun Kim,et al.  Estimating classification error rate: Repeated cross-validation, repeated hold-out and bootstrap , 2009, Comput. Stat. Data Anal..

[18]  N. Valentine,et al.  Characterization of residual medium peptides from Yersinia pestis cultures. , 2013, Analytical chemistry.

[19]  F Taroni,et al.  A general approach to Bayesian networks for the interpretation of evidence. , 2004, Forensic science international.

[20]  Helen Kreuzer,et al.  Bayesian Integration of Isotope Ratio for Geographic Sourcing of Castor Beans , 2012, Journal of biomedicine & biotechnology.

[21]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.