Towards Understanding Global Spread of Disease from Everyday Interpersonal Interactions

Monitoring and forecast of global spread of infectious diseases is difficult, mainly due to lack of fine-grained and timely data. Previous work in computational epidemiology has shown that mining data from the web can improve the predictability of high-level aggregate patterns of epidemics. By contrast, this paper explores how individuals contribute to the global spread of disease. We consider the important task of predicting the prevalence of flu-like illness in a given city based on interpersonal interactions of the city's residents with the outside world. We use the geo-tagged status updates of traveling Twitter users to infer properties of the flow of individuals between cities. While previous research considered only the raw volume of passengers, we estimate a number of latent variables, including the number of sick (symptomatic) travelers and the number of sick individuals to whom each traveler was exposed. We show that AI techniques provide insights into the mechanisms of disease spread and significantly improve predictability of future flu outbreaks. Our experiments involve over 51,000 individuals traveling between 75 cities prior and during a severe ongoing flu epidemic (October 2012 - January 2013). Our model leverages the text and interpersonal interactions recorded in over 6.5 million online status updates without any active user participation, enabling scalable public health applications.

[1]  J. H. Ellis,et al.  Erratum: Assessing the Impact of Airline Travel on the Geographic Spread of Pandemic Influenza , 2004, European Journal of Epidemiology.

[2]  M. Newman Spread of epidemic disease on networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Aron Culotta,et al.  Towards detecting influenza epidemics by analyzing Twitter messages , 2010, SOMA '10.

[4]  Henry A. Kautz,et al.  Predicting Disease Transmission from Geo-Tagged Micro-Blog Data , 2012, AAAI.

[5]  Alexander J. Smola,et al.  Advances in Large Margin Classifiers , 2000 .

[6]  Nello Cristianini,et al.  Flu Detector - Tracking Epidemics on Twitter , 2010, ECML/PKDD.

[7]  Marta C. González,et al.  A Metric of Influential Spreading during Contagion Dynamics through the Air Transportation Network , 2012, PloS one.

[8]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[9]  Nigel Collier,et al.  OMG U got flu? Analysis of shared health messages for bio-surveillance , 2011, Semantic Mining in Biomedicine.

[10]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[11]  Mark Dredze,et al.  A Model for Mining , 2011 .

[12]  Thorsten Joachims,et al.  A support vector method for multivariate performance measures , 2005, ICML.

[13]  J. Brownstein,et al.  Empirical Evidence for the Effect of Airline Travel on Inter-Regional Influenza Spread in the United States , 2006, PLoS medicine.

[14]  J. Brownstein,et al.  Social and news media enable estimation of epidemiological patterns early in the 2010 Haitian cholera outbreak. , 2012, The American journal of tropical medicine and hygiene.

[15]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[16]  Emily H. Chan,et al.  Participatory Epidemiology: Use of Mobile Phones for Community-Based Health Reporting , 2010, PLoS medicine.

[17]  T R Bender,et al.  An outbreak of influenza aboard a commercial airliner. , 1979, American journal of epidemiology.

[18]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[19]  Aravind Srinivasan,et al.  Modelling disease outbreaks in realistic urban social networks , 2004, Nature.

[20]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[21]  L. Bettencourt,et al.  A unified theory of urban living , 2010, Nature.

[22]  Alessandro Vespignani,et al.  The role of the airline transportation network in the prediction and predictability of global epidemics , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[23]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[24]  Henry A. Kautz,et al.  Modeling Spread of Disease from Social Interactions , 2012, ICWSM.

[25]  S. Ruan The effect of global travel on the spread of SARS. , 2005, Mathematical biosciences and engineering : MBE.

[26]  Joshua M. Epstein,et al.  Controlling Pandemic Flu: The Value of International Air Travel Restrictions , 2007, PloS one.

[27]  N. Ferguson,et al.  Frequent Travelers and Rate of Spread of Epidemics , 2007, Emerging infectious diseases.