A review of influenza detection and prediction through social networking sites

Early prediction of seasonal epidemics such as influenza may reduce their impact in daily lives. Nowadays, the web can be used for surveillance of diseases. Search engines and social networking sites can be used to track trends of different diseases seven to ten days faster than government agencies such as Center of Disease Control and Prevention (CDC). CDC uses the Illness-Like Influenza Surveillance Network (ILINet), which is a program used to monitor Influenza-Like Illness (ILI) sent by thousands of health care providers in order to detect influenza outbreaks. It is a reliable tool, however, it is slow and expensive. For that reason, many studies aim to develop methods that do real time analysis to track ILI using social networking sites. Social media data such as Twitter can be used to predict the spread of flu in the population and can help in getting early warnings. Today, social networking sites (SNS) are used widely by many people to share thoughts and even health status. Therefore, SNS provides an efficient resource for disease surveillance and a good way to communicate to prevent disease outbreaks. The goal of this study is to review existing alternative solutions that track flu outbreak in real time using social networking sites and web blogs. Many studies have shown that social networking sites can be used to conduct real time analysis for better predictions.

[1]  Mizuki Morita,et al.  Twitter Catches The Flu: Detecting Influenza Epidemics using Twitter , 2011, EMNLP.

[2]  Xiao Wang,et al.  Using Web data to enhance traffic situation awareness , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[3]  J. Carroll,et al.  A New Dimension of Health Care: Systematic Review of the Uses, Benefits, and Limitations of Social Media for Health Communication , 2013, Journal of medical Internet research.

[4]  Cécile Viboud,et al.  Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales , 2013, PLoS Comput. Biol..

[5]  Diane J. Cook,et al.  Monitoring Influenza Trends through Mining Social Media , 2009, BIOCOMP.

[6]  A Vespignani,et al.  Web‐based participatory surveillance of infectious diseases: the Influenzanet participatory surveillance experience , 2013, Clinical Microbiology and Infection.

[7]  Olga Baysal,et al.  Mining Twitter Data for Influenza Detection and Surveillance , 2016, 2016 IEEE/ACM International Workshop on Software Engineering in Healthcare Systems (SEHS).

[8]  Masaru Kitsuregawa,et al.  Visual fusion of mega-city big data: An application to traffic and tweets data analysis of Metro passengers , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[9]  D. Cummings,et al.  Mechanistic Models of Infectious Disease and Their Impact on Public Health , 2016, American journal of epidemiology.

[10]  Mark Dredze,et al.  Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance , 2015, PLoS Comput. Biol..

[11]  Phayung Meesad,et al.  Stock trend prediction relying on text mining and sentiment analysis with tweets , 2014, 2014 4th World Congress on Information and Communication Technologies (WICT 2014).

[12]  Shen Zhang,et al.  Using Twitter to Enhance Traffic Incident Awareness , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[13]  Sérgio Matos,et al.  Analysing Twitter and web queries for flu trend prediction , 2014, Theoretical Biology and Medical Modelling.

[14]  Mark Dredze,et al.  Separating Fact from Fear: Tracking Flu Infections on Twitter , 2013, NAACL.

[15]  Michael J. Paul,et al.  Twitter Improves Influenza Forecasting , 2014, PLoS currents.

[16]  Akira Fukuda,et al.  Hot topic detection in local areas using Twitter and Wikipedia , 2012, ARCS 2012.

[17]  Dominique Genoud,et al.  Mining and Visualizing Social Data to Inform Marketing Decisions , 2016, 2016 IEEE 30th International Conference on Advanced Information Networking and Applications (AINA).

[18]  Michael J. Paul,et al.  National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic , 2013, PloS one.

[19]  Marc-André Mittermayer,et al.  Forecasting Intraday stock price trends with text mining techniques , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[20]  M. Smolinski,et al.  Flu Near You: An Online Self-reported Influenza Surveillance System in the USA , 2013, Online Journal of Public Health Informatics.

[21]  Madhav V. Marathe,et al.  Flu Caster: A Pervasive Web Application for High Resolution Situation Assessment and Forecasting of Flu Outbreaks , 2015, 2015 International Conference on Healthcare Informatics.

[22]  Kenneth D. Mandl,et al.  HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports , 2008, Journal of the American Medical Informatics Association.

[23]  Hanan Samet,et al.  Identification of live news events using Twitter , 2011, LBSN '11.

[24]  Abdallah Qusef,et al.  Social Media in project communications management , 2016, 2016 7th International Conference on Computer Science and Information Technology (CSIT).

[25]  A. Dugas,et al.  Google Flu Trends: correlation with emergency department influenza rates and crowding metrics. , 2011, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[26]  Stephen Wan,et al.  Social Media Data Aggregation and Mining for Internet-Scale Customer Relationship Management , 2015, 2015 IEEE International Conference on Information Reuse and Integration.

[27]  Sune Lehmann,et al.  Understanding the Demographics of Twitter Users , 2011, ICWSM.

[28]  E. Nsoesie,et al.  A systematic review of studies on forecasting the dynamics of influenza outbreaks , 2013, Influenza and other respiratory viruses.

[29]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[30]  M. Santillana,et al.  What can digital disease detection learn from (an external revision to) Google Flu Trends? , 2014, American journal of preventive medicine.

[31]  Benyuan Liu,et al.  Predicting Flu Trends using Twitter data , 2011, 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[32]  Daniela Perrotta,et al.  Forecasting Seasonal Influenza Fusing Digital Indicators and a Mechanistic Disease Model , 2017, WWW.

[33]  Nalini Venkatasubramanian,et al.  Social media alert and response to threats to citizens (SMART-C) , 2012, 8th International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom).

[34]  Shaul Markovitch,et al.  Similarity of Temporal Query Logs Based on ARIMA Model , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[35]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[36]  Huan Liu,et al.  Learning with large-scale social media networks , 2010 .

[37]  Geert-Jan Houben,et al.  Semantics + filtering + search = twitcident. exploring information in social web streams , 2012, HT '12.

[38]  Edi Winarko,et al.  Event detection in social media: A survey , 2013, International Conference on ICT for Smart Society.

[39]  Michael Scharkow,et al.  Measuring the Public Agenda using Search Engine Queries , 2011 .

[40]  S. Volkova Predicting Demographics and Affect in Social Networks , 2015 .

[41]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[42]  Mauricio Santillana,et al.  Accurate estimation of influenza epidemics using Google search data via ARGO , 2015, Proceedings of the National Academy of Sciences.

[43]  Thomas Gottron,et al.  Bad news travel fast: a content-based analysis of interestingness on Twitter , 2011, WebSci '11.

[44]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[45]  John Scott What is social network analysis , 2010 .

[46]  Jie Zhang,et al.  Estimating Mobile Traffic Demand Using Twitter , 2016, IEEE Wireless Communications Letters.

[47]  M. Osborne,et al.  Bieber no more : First Story Detection using Twitter and Wikipedia , 2012 .

[48]  Desmond J. Higham,et al.  GeneRank: Using search engine technology for the analysis of microarray experiments , 2005, BMC Bioinformatics.

[49]  Ed H. Chi,et al.  Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network , 2010, 2010 IEEE Second International Conference on Social Computing.

[50]  E. Larson,et al.  Dissemination of health information through social networks: twitter and antibiotics. , 2010, American journal of infection control.

[51]  Cheng Hu,et al.  Chinese Social Media Analysis for Disease Surveillance , 2014, IIKI.

[52]  Miles Osborne,et al.  The Edinburgh Twitter Corpus , 2010, HLT-NAACL 2010.

[53]  D. Ruths,et al.  Social media for large studies of behavior , 2014, Science.

[54]  Naren Ramakrishnan,et al.  Flu Gone Viral: Syndromic Surveillance of Flu on Twitter Using Temporal Topic Models , 2014, 2014 IEEE International Conference on Data Mining.

[55]  Jürgen Pfeffer,et al.  Population Bias in Geotagged Tweets , 2015, Proceedings of the International AAAI Conference on Web and Social Media.

[56]  Alok N. Choudhary,et al.  Real-time disease surveillance using Twitter data: demonstration on flu and cancer , 2013, KDD.

[57]  Dotan A. Haim,et al.  Using Networks to Combine “Big Data” and Traditional Surveillance to Improve Influenza Predictions , 2015, Scientific Reports.

[58]  David M. Pennock,et al.  Predicting consumer behavior with Web search , 2010, Proceedings of the National Academy of Sciences.

[59]  Olga Baysal,et al.  Mining Twitter data for influenza detection and surveillance , 2016, SEHS@ICSE.

[60]  Dursun Delen,et al.  Predicting breast cancer survivability: a comparison of three data mining methods , 2005, Artif. Intell. Medicine.

[61]  Michael J. Shaw,et al.  Knowledge management and data mining for marketing , 2001, Decis. Support Syst..

[62]  Aron Culotta,et al.  Towards detecting influenza epidemics by analyzing Twitter messages , 2010, SOMA '10.

[63]  Axel Bruns,et al.  Twitter archives and the challenges of "Big Social Data" for media and communication research , 2012 .

[64]  Aron Culotta,et al.  Detecting influenza outbreaks by analyzing Twitter messages , 2010, ArXiv.

[65]  Tom A. B. Snijders,et al.  Social Network Analysis , 2011, International Encyclopedia of Statistical Science.

[66]  David M. Pennock,et al.  Using internet searches for influenza surveillance. , 2008, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[67]  J. Shaman,et al.  Forecasting seasonal outbreaks of influenza , 2012, Proceedings of the National Academy of Sciences.

[68]  Jiawei Han,et al.  Discovering Web access patterns and trends by applying OLAP and data mining technology on Web logs , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[69]  Janet M. Thornton,et al.  Living longer by dieting: analysis of transcriptional response after caloric restriction , 2005, BMC Bioinformatics.

[70]  Geert-Jan Houben,et al.  Twitcident: fighting fire with information from social web streams , 2012, WWW.

[71]  Armin R. Mikler,et al.  Text and Structural Data Mining of Influenza Mentions in Web and Social Media , 2010, International journal of environmental research and public health.

[72]  Alessandro Vespignani,et al.  Modeling the spatial spread of infectious diseases: The GLobal Epidemic and Mobility computational model , 2010, J. Comput. Sci..

[73]  Gagangeet Singh Aujla,et al.  Twitter data based prediction model for influenza epidemic , 2015, 2015 2nd International Conference on Computing for Sustainable Global Development (INDIACom).

[74]  Daniela Perrotta,et al.  Social Data Mining and Seasonal Influenza Forecasts: The FluOutlook Platform , 2015, ECML/PKDD.

[75]  R.J.P. Stronkman,et al.  Towards a realtime Twitter analysis during crises for operational crisis management , 2012, ISCRAM.

[76]  Matthew Mohebbi,et al.  Assessing Google Flu Trends Performance in the United States during the 2009 Influenza Virus A (H1N1) Pandemic , 2011, PloS one.

[77]  Jian Ma,et al.  A neural netwok based approach to detect influenza epidemics using search engine query data , 2010, 2010 International Conference on Machine Learning and Cybernetics.

[78]  Nello Cristianini,et al.  Tracking the flu pandemic by monitoring the social web , 2010, 2010 2nd International Workshop on Cognitive Information Processing.

[79]  Erwin Adi,et al.  Harvesting real time traffic information from Twitter , 2012 .

[80]  M. Osborne,et al.  Using Prediction Markets and Twitter to Predict a Swine Flu Pandemic , 2009 .

[81]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[82]  Li-wei Zhang,et al.  Research of Technical Development Trend and Hot Points Based on Text Mining , 2010, 2010 2nd International Conference on Information Engineering and Computer Science.