Oak Ridge Bio-surveillance Toolkit (ORBiT): Integrating Big-Data Analytics with Visual Analysis for Public Health Dynamics

In this position paper, we describe the design and implementation of the Oak Ridge Bio-surveillance Toolkit (ORBiT): a collection of novel statistical and machine learning tools implemented for (1) integrating heterogeneous traditional (e.g. emergency room visits, prescription sales data, etc.) and non-traditional (social media such as Twitter and Instagram) data sources, (2) analyzing large-scale datasets and (3) presenting the results from the analytics as a visual interface for the end-user to interact and provide feedback. We present examples of how ORBiT can be used to summarize ex- tremely large-scale datasets effectively and how user interactions can translate into the data analytics process for bio-surveillance. We also present a strategy to estimate parameters relevant to dis- ease spread models from near real time data feeds and show how these estimates can be integrated with disease spread models for large-scale populations. We conclude with a perspective on how integrating data and visual analytics could lead to better forecasting and prediction of disease spread as well as improved awareness of disease susceptible regions.

[1]  Andrej J. Savol,et al.  Event detection and sub‐state discovery from biomolecular simulations using higher‐order statistics: Application to enzyme adenylate kinase , 2012, Proteins.

[2]  Arvind Ramanathan,et al.  On-the-Fly Identification of Conformational Substates from Molecular Dynamics Simulations. , 2011, Journal of chemical theory and computation.

[3]  Emily H. Chan,et al.  Using Web Search Query Data to Monitor Dengue Epidemics: A New Model for Neglected Tropical Disease Surveillance , 2011, PLoS neglected tropical diseases.

[4]  David S. Ebert,et al.  A pandemic influenza modeling and visualization tool☆ , 2011, Journal of Visual Languages & Computing.

[5]  John R. Goodall,et al.  Interactive Visual Analysis of High Throughput Text Streams , 2012 .

[6]  J. Marc Overhage,et al.  The Indiana Public Health Emergency Surveillance System: Ongoing Progress, Early Findings, and Future Directions , 2006, AMIA.

[7]  Colleen A Bradley,et al.  BioSense: implementation of a National Early Event Detection and Situational Awareness System. , 2005, MMWR supplements.

[8]  George A. Muller,et al.  U.S. airport entry screening in response to pandemic influenza: Modeling and analysis , 2009, Travel Medicine and Infectious Disease.

[9]  Howard S. Burkom,et al.  Statistical Challenges Facing Early Outbreak Detection in Biosurveillance , 2010, Technometrics.

[10]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[11]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[12]  L. Hutwagner,et al.  The bioterrorism preparedness and response Early Aberration Reporting System (EARS) , 2003, Journal of Urban Health.

[13]  Laura L. Pullum,et al.  Integrating Heterogeneous Healthcare Datasets and Visual Analytics for Disease Bio-surveillance and Dynamics , 2013 .

[14]  Ryan Hafen,et al.  Syndromic surveillance: STL for modeling, visualizing, and monitoring disease counts , 2009, BMC Medical Informatics Decis. Mak..

[15]  Wendy W. Chapman,et al.  Natural Language Processing for Biosurveillance , 2006, Handbook of Biosurveillance.

[16]  Allan D. Jepson,et al.  Sparse PCA: Extracting Multi-scale Structure from Data , 2001, ICCV.

[17]  Patrick Kelley,et al.  Identification and investigation of disease outbreaks by ESSENCE , 2006, Journal of Urban Health.

[18]  Kenneth D. Mandl,et al.  HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports , 2008, Journal of the American Medical Informatics Association.

[19]  David S. Ebert,et al.  LAHVA: Linked Animal-Human Health Visual Analytics , 2007, 2007 IEEE Symposium on Visual Analytics Science and Technology.

[20]  Russ P Lopez Disease Surveillance: A Public Health Informatics Approach , 2007 .

[21]  Russ Burtner,et al.  INTERNATIONAL JOURNAL OF HEALTH GEOGRAPHICS REVIEW Open Access , 2022 .

[22]  J. Small,et al.  The AFHSC-Division of GEIS Operations Predictive Surveillance Program: a multidisciplinary approach for the early detection and response to disease outbreaks , 2011, BMC public health.

[23]  Marko A. Rodriguez,et al.  Exposing multi-relational networks to single-relational network analysis algorithms , 2008, J. Informetrics.

[24]  J. Brownstein,et al.  Digital disease detection--harnessing the Web for public health surveillance. , 2009, The New England journal of medicine.

[25]  National Electronic Disease Surveillance System (NEDSS): a standards-based approach to connect public health and clinical medicine. , 2001, Journal of public health management and practice : JPHMP.

[26]  Allan D. Jepson,et al.  Half-Lives of EigenFlows for Spectral Clustering , 2002, NIPS.