Data-Driven Immunization

Given a contact network and coarse-grained diagnostic information like electronic Healthcare Reimbursement Claims (eHRC) data, can we develop efficient intervention policies to control an epidemic? Immunization is an important problem in multiple areas especially epidemiology and public health. However, most existing studies focus on developing pre-emptive strategies assuming prior epidemiological models. In practice, disease spread is usually complicated, hence assuming an underlying model may deviate from true spreading patterns, leading to possibly inaccurate interventions. Additionally, the abundance of health care surveillance data (like eHRC) makes it possible to study data-driven strategies without too many restrictive assumptions. Hence, such an approach can help public-health experts take more practical decisions. In this paper, we take into account propagation log and contact networks for controlling propagation. We formulate the novel and challenging Data-Driven Immunization problem without assuming classical epidemiological models. To solve it, we first propose an efficient sampling approach to align surveillance data with contact networks, then develop an efficient algorithm with the provably approximate guarantee for immunization. Finally, we show the effectiveness and scalability of our methods via extensive experiments on multiple datasets, and conduct case studies on nation-wide real medical surveillance data.

[1]  Cécile Viboud,et al.  Spatial Transmission of 2009 Pandemic Influenza in the US , 2014, PLoS Comput. Biol..

[2]  Reuven Cohen,et al.  Efficient immunization strategies for computer networks and populations. , 2002, Physical review letters.

[3]  J. Medlock,et al.  Optimizing Influenza Vaccine Distribution , 2009, Science.

[4]  Avinash R. Patwardhan,et al.  Comparison: Flu Prescription Sales Data from a Retail Pharmacy in the US with Google Flu Trends and US ILINet (CDC) Data as Flu Activity Indicator , 2012, PloS one.

[5]  Herbert W. Hethcote,et al.  The Mathematics of Infectious Diseases , 2000, SIAM Rev..

[6]  D. Stevanović,et al.  Decreasing the spectral radius of a graph by link removals. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[8]  Arvind Ramanathan,et al.  Augmenting Epidemiological Models with Point-Of-Care Diagnostics Data , 2016, PloS one.

[9]  Philippe Flajolet,et al.  Probabilistic Counting Algorithms for Data Base Applications , 1985, J. Comput. Syst. Sci..

[10]  Aravind Srinivasan,et al.  Modelling disease outbreaks in realistic urban social networks , 2004, Nature.

[11]  Aaron F. McDaid,et al.  Clustering in networks with the collapsed Stochastic Block Model , 2012, 1203.3083.

[12]  C. Macken,et al.  Modeling targeted layered containment of an influenza pandemic in the United States , 2008, Proceedings of the National Academy of Sciences.

[13]  Ravi Kumar,et al.  On the Bursty Evolution of Blogspace , 2003, WWW '03.

[14]  Shweta Bansal,et al.  Eight challenges for network epidemic models. , 2015, Epidemics.

[15]  James Aspnes,et al.  Inoculation strategies for victims of viruses and the sum-of-squares partition problem , 2005, SODA '05.

[16]  Madhav V. Marathe,et al.  Generation and analysis of large synthetic social contact networks , 2009, Proceedings of the 2009 Winter Simulation Conference (WSC).

[17]  Christos Faloutsos,et al.  Fractional Immunization in Networks , 2013, SDM.

[18]  Arvind Ramanathan,et al.  ORBiT: Oak Ridge biosurveillance toolkit for public health dynamics , 2015, BMC Bioinformatics.

[19]  Yao Zhang,et al.  Controlling Propagation at Group Scale on Networks , 2015, 2015 IEEE International Conference on Data Mining.

[20]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[21]  Laura L. Pullum,et al.  Sequential pattern mining of electronic healthcare reimbursement claims: Experiences and challenges in uncovering how patients are treated by physicians , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[22]  Michalis Faloutsos,et al.  Gelling, and melting, large graphs by edge manipulation , 2012, CIKM.

[23]  Donald F. Towsley,et al.  The effect of network topology on the spread of epidemics , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[24]  Christos Faloutsos,et al.  On the Vulnerability of Large Graphs , 2010, 2010 IEEE International Conference on Data Mining.

[25]  Laks V. S. Lakshmanan,et al.  A Data-Based Approach to Social Influence Maximization , 2011, Proc. VLDB Endow..

[26]  Le Song,et al.  Scalable diffusion-aware optimization of network topology , 2014, KDD.

[27]  Yao Zhang,et al.  DAVA: Distributing Vaccines over Networks under Prior Information , 2014, SDM.

[28]  Eunha Shim,et al.  Optimal strategies of social distancing and vaccination against seasonal influenza. , 2013, Mathematical biosciences and engineering : MBE.

[29]  Michalis Faloutsos,et al.  Threshold conditions for arbitrary cascade models on arbitrary networks , 2011, 2011 IEEE 11th International Conference on Data Mining.

[30]  Christos Faloutsos,et al.  ANF: a fast and scalable tool for data mining in massive graphs , 2002, KDD.

[31]  A. J. Hall Infectious diseases of humans: R. M. Anderson & R. M. May. Oxford etc.: Oxford University Press, 1991. viii + 757 pp. Price £50. ISBN 0-19-854599-1 , 1992 .