Data-driven efficient network and surveillance-based immunization

Given a contact network and coarse-grained diagnostic information such as electronic Healthcare Reimbursement Claims (eHRC) data, can we develop efficient intervention policies from data to control an epidemic? Immunization is an important problem in multiple areas, especially epidemiology and public health. However, most existing studies rely on assuming prior epidemiological models to develop pre-emptive strategies, which may fail to adapt to the change in new epidemiological patterns and the availability of rich data such as eHRC. In practice, disease spread is usually complicated, hence assuming an underlying model may deviate from true spreading patterns, leading to possibly inaccurate interventions. Additionally, the abundance of health care surveillance data (such as eHRC) makes it possible to study data-driven strategies without too many restrictive assumptions. Hence, such a data-driven intervention approach can help public-health experts take more practical decisions. In this paper, we take into account propagation log and contact networks for controlling propagation. Different from previous model-based approaches, our solutions are solely data driven in a sense that we develop immunization strategies directly from the network and eHRC without assuming classical epidemiological models. In particular, we formulate the novel and challenging data-driven immunization problem. To solve it, we first propose an efficient sampling approach to align surveillance data with contact networks, then develop an efficient algorithm with the provably approximate guarantee for immunization. Finally, we show the effectiveness and scalability of our methods via extensive experiments on multiple datasets, and conduct case studies on nation-wide real medical surveillance data.

[1]  Madhav V. Marathe,et al.  Generation and analysis of large synthetic social contact networks , 2009, Proceedings of the 2009 Winter Simulation Conference (WSC).

[2]  Aaron F. McDaid,et al.  Clustering in networks with the collapsed Stochastic Block Model , 2012, 1203.3083.

[3]  Svetha Venkatesh,et al.  Effective sparse imputation of patient conditions in electronic medical records for emergency risk predictions , 2017, Knowledge and Information Systems.

[4]  Aravind Srinivasan,et al.  Modelling disease outbreaks in realistic urban social networks , 2004, Nature.

[5]  Eunha Shim,et al.  Optimal strategies of social distancing and vaccination against seasonal influenza. , 2013, Mathematical biosciences and engineering : MBE.

[6]  Christos Faloutsos,et al.  Fractional Immunization in Networks , 2013, SDM.

[7]  C. Macken,et al.  Modeling targeted layered containment of an influenza pandemic in the United States , 2008, Proceedings of the National Academy of Sciences.

[8]  Ravi Kumar,et al.  On the Bursty Evolution of Blogspace , 2003, WWW '03.

[9]  Herbert W. Hethcote,et al.  The Mathematics of Infectious Diseases , 2000, SIAM Rev..

[10]  D. Stevanović,et al.  Decreasing the spectral radius of a graph by link removals. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[12]  Donald F. Towsley,et al.  The effect of network topology on the spread of epidemics , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[13]  Michalis Faloutsos,et al.  Threshold conditions for arbitrary cascade models on arbitrary networks , 2011, 2011 IEEE 11th International Conference on Data Mining.

[14]  Yao Zhang,et al.  DAVA: Distributing Vaccines over Networks under Prior Information , 2014, SDM.

[15]  Michalis Faloutsos,et al.  Gelling, and melting, large graphs by edge manipulation , 2012, CIKM.

[16]  Cécile Viboud,et al.  Spatial Transmission of 2009 Pandemic Influenza in the US , 2014, PLoS Comput. Biol..

[17]  Reuven Cohen,et al.  Efficient immunization strategies for computer networks and populations. , 2002, Physical review letters.

[18]  Christos Faloutsos,et al.  On the Vulnerability of Large Graphs , 2010, 2010 IEEE International Conference on Data Mining.

[19]  Laks V. S. Lakshmanan,et al.  A Data-Based Approach to Social Influence Maximization , 2011, Proc. VLDB Endow..

[20]  Shweta Bansal,et al.  Eight challenges for network epidemic models. , 2015, Epidemics.

[21]  James Aspnes,et al.  Inoculation strategies for victims of viruses and the sum-of-squares partition problem , 2005, SODA '05.

[22]  A. J. Hall Infectious diseases of humans: R. M. Anderson & R. M. May. Oxford etc.: Oxford University Press, 1991. viii + 757 pp. Price £50. ISBN 0-19-854599-1 , 1992 .

[23]  J. Medlock,et al.  Optimizing Influenza Vaccine Distribution , 2009, Science.

[24]  Arvind Ramanathan,et al.  Augmenting Epidemiological Models with Point-Of-Care Diagnostics Data , 2016, PloS one.

[25]  Philippe Flajolet,et al.  Probabilistic Counting Algorithms for Data Base Applications , 1985, J. Comput. Syst. Sci..

[26]  Anita Driessen-Mol,et al.  Age-Dependent Changes in Geometry, Tissue Composition and Mechanical Properties of Fetal to Adult Cryopreserved Human Heart Valves , 2016, PloS one.

[27]  Avinash R. Patwardhan,et al.  Comparison: Flu Prescription Sales Data from a Retail Pharmacy in the US with Google Flu Trends and US ILINet (CDC) Data as Flu Activity Indicator , 2012, PloS one.

[28]  Christos Faloutsos,et al.  ANF: a fast and scalable tool for data mining in massive graphs , 2002, KDD.

[29]  Le Song,et al.  Scalable diffusion-aware optimization of network topology , 2014, KDD.

[30]  Arvind Ramanathan,et al.  ORBiT: Oak Ridge biosurveillance toolkit for public health dynamics , 2015, BMC Bioinformatics.

[31]  Yao Zhang,et al.  Controlling Propagation at Group Scale on Networks , 2015, 2015 IEEE International Conference on Data Mining.

[32]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[33]  Laura L. Pullum,et al.  Sequential pattern mining of electronic healthcare reimbursement claims: Experiences and challenges in uncovering how patients are treated by physicians , 2015, 2015 IEEE International Conference on Big Data (Big Data).