The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic

Twitter is a free social networking and micro-blogging service that enables its millions of users to send and read each other's “tweets,” or short, 140-character messages. The service has more than 190 million registered users and processes about 55 million tweets per day. Useful information about news and geopolitical events lies embedded in the Twitter stream, which embodies, in the aggregate, Twitter users' perspectives and reactions to current events. By virtue of sheer volume, content embedded in the Twitter stream may be useful for tracking or even forecasting behavior if it can be extracted in an efficient manner. In this study, we examine the use of information embedded in the Twitter stream to (1) track rapidly-evolving public sentiment with respect to H1N1 or swine flu, and (2) track and measure actual disease activity. We also show that Twitter can be used as a measure of public interest or concern about health-related events. Our results show that estimates of influenza-like illness derived from Twitter chatter accurately track reported disease levels.

[1]  P. Krause,et al.  Sales of Nonprescription Cold Remedies: A Unique Method of Influenza Surveillance , 1979, Pediatric Research.

[2]  S B Thacker,et al.  Application of multiple time series analysis to the estimation of pneumonia and influenza mortality by age 1962-1983. , 1988, Statistics in medicine.

[3]  Sholom M. Weiss,et al.  Computer Systems That Learn , 1990 .

[4]  D D Lenaway,et al.  Evaluation of a school-based influenza surveillance system. , 1995, Public health reports.

[5]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[6]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[7]  J. Marc Overhage,et al.  Research Paper: Detection of Pediatric Respiratory and Diarrheal Outbreaks from Sales of Over-the-counter Electrolyte Products , 2003, J. Am. Medical Informatics Assoc..

[8]  Joe Suyama,et al.  Surveillance of infectious disease occurrences in the community: an analysis of symptom presentation in the emergency department. , 2003, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[9]  S. Magruder Evaluation of Over-the-Counter Pharmaceutical Sales As a Possible Early Warning Indicator of Human Disease , 2003 .

[10]  Michael M. Wagner,et al.  Telephone Triage: A Timely Data Source for Surveillance of Influenza-like Diseases , 2003, AMIA.

[11]  C. Irvin,et al.  Syndromic analysis of computerized emergency department patients' chief complaints: an opportunity for bioterrorism and influenza surveillance. , 2003, Annals of emergency medicine.

[12]  G. R. Davies,et al.  Sales of over-the-counter remedies as an early warning system for winter bed crises. , 2003, Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases.

[13]  Christine Yuan,et al.  Syndromic surveillance at hospital emergency departments--southeastern Virginia. , 2004, MMWR supplements.

[14]  Crystale Purvis Cooper,et al.  Cancer Internet Search Activity on a Major Search Engine, United States 2001-2003 , 2005, Journal of medical Internet research.

[15]  W. Thompson,et al.  Epidemiology of seasonal influenza: use of surveillance data and statistical models to estimate the burden of disease. , 2006, The Journal of infectious diseases.

[16]  F. Nelson,et al.  Use of prediction markets to forecast infectious disease activity. , 2007, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[17]  Lynne Dailey,et al.  Research Paper: Timeliness of Data Sources Used for Influenza Surveillance , 2007, J. Am. Medical Informatics Assoc..

[18]  David M. Pennock,et al.  Using internet searches for influenza surveillance. , 2008, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[19]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[20]  Nello Cristianini,et al.  Tracking the flu pandemic by monitoring the social web , 2010, 2010 2nd International Workshop on Cognitive Information Processing.

[21]  Patty Kostkova,et al.  The potential of social networks for early warning nad outbreak detection systems: the swine flu Twitter study , 2010 .

[22]  Celeste Biever Twitter mood maps reveal emotional states of America , 2010 .

[23]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[24]  Jim Giles Barcodes help objects tell their stories , 2010 .

[25]  Jim Giles Blogs and tweets could predict the future , 2010 .

[26]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.