Vision: towards real time epidemic vigilance through online social networks: introducing SNEFT -- social network enabled flu trends

Our vision is to achieve faster and near real time detection and prediction of the emergence and spread of an influenza epidemic, through sophisticated data collection and analysis of Online Social Networks (OSNs) such as Facebook, MySpace, and Twitter. In particular, we present the design of a system called SNEFT (Social Network Enabled Flu Trends), which will be developed in a 12-month SBIR (Small Business Innovation Research) project funded by the National Institutes of Health (NIH). We describe the innovative technologies that will be developed in this project for collecting and aggregating OSN data, extracting information from it, and integrating it with mathematical models of influenza. One of the monitoring tools used by the Centers for Disease Control and Prevention (CDC) is reports of Influenza-Like Illness (ILI) cases; these reports are authoritative but typically have a delay of one to two weeks due to the largely manual process. We describe the SNEFT prototype in the context of predicting ILI cases well in advance of the CDC reports. We observe that OSN data is individually noisy but collectively revealing, and speculate on other applications that can potentially be enabled by OSN data collection and analysis.

[1]  J. Robins,et al.  Transmission Dynamics and Control of Severe Acute Respiratory Syndrome , 2003, Science.

[2]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[3]  C. Macken,et al.  Modeling targeted layered containment of an influenza pandemic in the United States , 2008, Proceedings of the National Academy of Sciences.

[4]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[5]  D. Brockmann,et al.  Human Mobility and Spatial Disease Dynamics , 2010 .

[6]  A. Roddam Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation O Diekmann and JAP Heesterbeek, 2000, Chichester: John Wiley pp. 303, £39.95. ISBN 0-471-49241-8 , 2001 .

[7]  Pejman Rohani,et al.  Appropriate Models for the Management of Infectious Diseases , 2005, PLoS medicine.

[8]  J. O. Irwin,et al.  MATHEMATICAL EPIDEMIOLOGY , 1958 .

[9]  T. Geisel,et al.  Forecast and control of epidemics in a globalized world. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[10]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[11]  Alessandro Vespignani,et al.  influenza A(H1N1): a Monte Carlo likelihood analysis based on , 2009 .

[12]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[13]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[14]  A. Nizam,et al.  Containing Pandemic Influenza at the Source , 2005, Science.

[15]  Timothy J. Robinson,et al.  Sequential Monte Carlo Methods in Practice , 2003 .

[16]  D. Cummings,et al.  Strategies for containing an emerging influenza pandemic in Southeast Asia , 2005, Nature.

[17]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.