Fusion of laboratory and textual data for investigative bioforensics.

Chemical and biological forensic programs focus on the identification of a threat and acquisition of laboratory measurements to determine how a threat agent may have been produced. However, to generate investigative leads, it might also be useful to identify institutions where the same agent has been produced by the same or a very similar process, since the producer of the agent may have learned methods at a university or similar institution. We have developed a Bayesian network framework that fuses hard and soft data sources to assign probability to production practices. It combines the results of laboratory measurements with an automatic text reader to scan scientific literature and rank institutions that had published papers on the agent of interest in order of the probability that the institution has the capability to generate the sample of interest based on laboratory data. We demonstrate the Bayesian network on an example case from microbial forensics, predicting the methods used to produce Bacillus anthracis spores based on mass spectrometric measurements and identifying institutions that have a history of growing Bacillus spores using the same or highly similar methods. We illustrate that the network model can assign a higher posterior probability than expected by random chance to appropriate institutions when trained using only a small set of manually analyzed documents. This is the first example of an automated methodology to integrate experimental and textual data for the purpose of investigative forensics.

[1]  Philippe Besse,et al.  Statistical Applications in Genetics and Molecular Biology A Sparse PLS for Variable Selection when Integrating Omics Data , 2011 .

[2]  Helen W. Kreuzer-Martin,et al.  Microbe forensics: Oxygen and hydrogen stable isotope ratios in Bacillus subtilis cells and spores , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[3]  N. Valentine,et al.  Determination of post-culture processing with carbohydrates by MALDI-MS and TMS derivatization GC-MS. , 2011, Talanta.

[4]  G. S. Walker,et al.  Rapid Screening for the Detection and Differentiation of Gamma‐Hydroxybutyrate using Ion Chromatography *,† , 2011, Journal of forensic sciences.

[5]  Weiwen Zhang,et al.  Integrating multiple 'omics' analysis for microbial biology: application and methodologies. , 2010, Microbiology.

[6]  Heather A. Colburn,et al.  Bayesian-Integrated Microbial Forensics , 2008, Applied and Environmental Microbiology.

[7]  Nunzianda Frascione,et al.  Detection and identification of body fluid stains using antibody-nanoparticle conjugates. , 2012, The Analyst.

[8]  Bruce Budowle,et al.  Building Microbial Forensics as a Response to Bioterrorism , 2003, Science.

[9]  Randall S Murch Microbial forensics: building a national capacity to investigate bioterrorism. , 2003, Biosecurity and bioterrorism : biodefense strategy, practice, and science.

[10]  J. Ballantyne,et al.  Specific and sensitive mRNA biomarkers for the identification of skin in 'touch DNA' evidence. , 2012, Forensic science international. Genetics.

[11]  Kelly Virkler,et al.  Analysis of body fluids for forensic purposes: from laboratory testing to non-destructive rapid confirmatory identification at a crime scene. , 2009, Forensic science international.

[12]  F Taroni,et al.  Bayesian networks for evaluating forensic DNA profiling evidence: a review and guide to literature. , 2012, Forensic science international. Genetics.

[13]  N. Valentine,et al.  The effect of growth medium on B. anthracis Sterne spore carbohydrate content. , 2011, Journal of microbiological methods.

[14]  Simone Gittelson,et al.  Bayesian Networks and the Value of the Evidence for the Forensic Two‐Trace Transfer Problem * , 2012, Journal of forensic sciences.

[15]  M. Montgomery,et al.  Trace Detection of Meglumine and Diatrizoate from Bacillus Spore Samples Using Liquid Chromatography/Mass Spectrometry *,§ , 2012, Journal of forensic sciences.

[16]  D. Newbury,et al.  Postdetonation nuclear debris for attribution , 2010, Proceedings of the National Academy of Sciences.

[17]  Kristin H. Jarman,et al.  Stable Isotope Ratios and Forensic Analysis of Microorganisms , 2007, Applied and Environmental Microbiology.

[18]  B Salbu,et al.  Characterization of U/Pu particles originating from the nuclear weapon accidents at Palomares, Spain, 1966 and Thule, Greenland, 1968. , 2007, The Science of the total environment.

[19]  Joel G. Pounds,et al.  Pacific Symposium on Biocomputing 14:451-463 (2009) A BAYESIAN INTEGRATION MODEL OF HIGH- THROUGHPUT PROTEOMICS AND METABOLOMICS DATA FOR IMPROVED EARLY DETECTION OF MICROBIAL INFECTIONS , 2022 .

[20]  Paolo Garbolino,et al.  Evaluation of scientific evidence using Bayesian networks. , 2002, Forensic science international.

[21]  M. Ketterer,et al.  Multi-technique characterization of a nuclear bomb particle from the Palomares accident. , 2006, Journal of environmental radioactivity.

[22]  Richard Simon,et al.  A comparison of bootstrap methods and an adjusted bootstrap approach for estimating the prediction error in microarray classification , 2007, Statistics in medicine.

[23]  F Taroni,et al.  A general approach to Bayesian networks for the interpretation of evidence. , 2004, Forensic science international.

[24]  M. Sciotti,et al.  Development and characterization of an enzymatic method for the rapid determination of gamma hydroxybutyric acid. , 2010, Chimia.

[25]  F Taroni,et al.  Learning about Bayesian networks for forensic interpretation: an example based on the 'the problem of multiple propositions'. , 2012, Science & justice : journal of the Forensic Science Society.

[26]  F Taroni,et al.  Bayesian networks and probabilistic reasoning about scientific evidence when there is a lack of data. , 2006, Forensic science international.

[27]  M. Sciotti,et al.  An Enzymatic Method to Determine γ-Hydroxybutyric Acid in Serum and Urine , 2011, Therapeutic drug monitoring.

[28]  H. Kreuzer,et al.  Multiple Stable Isotope Characterization as a Forensic Tool to Distinguish Acid Scavenger Samples * , 2012, Journal of forensic sciences.

[29]  G. M. Mong,et al.  Impurity profiling to match a nerve agent to its precursor source for chemical forensics applications. , 2011, Analytical chemistry.