Classifying Information from Microblogs during Epidemics

At the outbreak of an epidemic, affected communities want/need to get aware of disease symptoms, preventive measures, and treatment strategies. On the other hand, health organizations try to get situational updates to assess the severity of the outbreak, known affected cases, and other details. Recent emergence of social media platforms such as Twitter provide convenient ways and fast access to disseminate and consume information to/from a wider audience. Research studies have shown potential of this online information to address information needs of concerned authorities during outbreaks, epidemics, and pandemics. In this work, we target three communities (i) people who are not affected yet and are looking for prevention-related information (ii) people who are affected and looking for treatment-related information, and (iii) health organizations like WHO, who are interested in gaining situational awareness to make timely decisions. We use Twitter data from two recent outbreaks (Ebola and MERS) to built an automatic classification approach using low level lexical features which are useful to categorize tweets into different disease-related categories.

[1]  E. Larson,et al.  Dissemination of health information through social networks: twitter and antibiotics. , 2010, American journal of infection control.

[2]  George Hripcsak,et al.  Automated encoding of clinical documents based on natural language processing. , 2004, Journal of the American Medical Informatics Association : JAMIA.

[3]  Pengzhu Zhang,et al.  Exploring Health-Related Topics in Online Health Community Using Cluster Analysis , 2013, 2013 46th Hawaii International Conference on System Sciences.

[4]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[5]  Niloy Ganguly,et al.  Extracting Situational Information from Microblogs during Disaster Events: a Classification-Summarization Approach , 2015, CIKM.

[6]  Elad Yom-Tov Ebola data from the Internet: An Opportunity for Syndromic Surveillance or a News Event? , 2015, Digital Health.

[7]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[8]  Sanda M. Harabagiu,et al.  Medical Question Answering for Clinical Decision Support , 2016, CIKM.

[9]  Kent A. Spackman,et al.  SNOMED clinical terms: overview of the development process and project status , 2001, AMIA.

[10]  Muhammad Imran,et al.  Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages , 2016, LREC.

[11]  Xiaodong Wang,et al.  When MetaMap Meets Social Media in Healthcare: Are the Word Labels Correct? , 2016, AIRS.

[12]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[13]  Daniel T. Heinze,et al.  Mining free-text medical records , 2001, AMIA.

[14]  Xin Tu,et al.  Social Structure and Depression in TrevorSpace , 2014, CSCW.

[15]  Theocharis Kyriacou,et al.  #hayfever; A Longitudinal Study into Hay Fever Related Tweets in the UK , 2016, Digital Health.

[16]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[17]  D. Milne,et al.  The role of the Internet in supporting and informing carers of people with cancer: a literature review , 2010, Supportive Care in Cancer.

[18]  M. Gribaudo,et al.  2002 , 2001, Cell and Tissue Research.

[19]  C. Dolea,et al.  World Health Organization , 1949, International Organization.

[20]  Munmun De Choudhury,et al.  Anorexia on Tumblr: A Characterization Study , 2015, Digital Health.

[21]  J. Austin,et al.  Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports. , 2002, Radiology.

[22]  Carlos Castillo,et al.  AIDR: artificial intelligence for disaster response , 2014, WWW.

[23]  Amber M. Angell,et al.  The social life of health records: understanding families' experiences of autism. , 2014, Social science & medicine.

[24]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[25]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[26]  A. James 2010 , 2011, Philo of Alexandria: an Annotated Bibliography 2007-2016.