Role of Online Data from Search Engine and Social Media in Healthcare Informatics

Search engines and social media are two different online data sources where search engines can provide health related queries logs and Internet users’ discuss their diseases, symptoms, causes, preventions and even suggest treatment by sharing their views, experiences and opinions on social media. This chapter hypothesizes that online data from Google and Twitter can provide vital first-hand healthcare information. An approach is provided for collecting twitter data by exploring contextual information gleaned from Google search queries logs. Furthermore, it is investigated that whether it is possible to use tweets to track, monitor and predict diseases, especially Influenza epidemics. Obtained results show that healthcare institutes and professional’s uses social media to provide up-to date health related information and interact with public. Moreover, proposed approach is beneficial for extracting useful information regarding disease symptoms, side effects, medications and to track geographical location of epidemics affected area.

[1]  Jieping Ye,et al.  Tuberculosis Surveillance by Analyzing Google Trends , 2011, IEEE Transactions on Biomedical Engineering.

[2]  Wendy W. Chapman,et al.  Analysis of Web Access Logs for Surveillance of Influenza , 2004, MedInfo.

[3]  S. Magruder Evaluation of Over-the-Counter Pharmaceutical Sales As a Possible Early Warning Indicator of Human Disease , 2003 .

[4]  A. Flahault,et al.  More Diseases Tracked by Using Google Trends , 2009, Emerging infectious diseases.

[5]  Omar El Beqqali,et al.  Harnessing Semantic Features for Large-Scale Content-Based Hashtag Recommendations on Microblogging Platforms , 2017 .

[6]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[7]  Chang-Gun Lee,et al.  Estimating Influenza Outbreaks Using Both Search Engine Query Data and Social Media Data in South Korea , 2016, Journal of medical Internet research.

[8]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, Web Intelligence.

[9]  Terry Kind,et al.  Social Media and Clinical Care: Ethical, Professional, and Social Implications , 2013, Circulation.

[10]  Justin Zhijun Zhan,et al.  Sentiment analysis using product review data , 2015, Journal of Big Data.

[11]  George Forman,et al.  Learning from Little: Comparison of Classifiers Given Little Training , 2004, PKDD.

[12]  Kara Dawson,et al.  Protected Health Information on Social Networking Sites: Ethical and Legal Considerations , 2011, Journal of medical Internet research.

[13]  Mowafa Said Househ,et al.  The Use of Social Media in Healthcare: Organizational, Clinical, and Patient Perspectives , 2013, ITCH.

[14]  Basit Shahzad,et al.  Maximization of Tweet's viewership with respect to time , 2014, 2014 World Symposium on Computer Applications & Research (WSCAR).

[15]  Stephen S Morse,et al.  Public health surveillance and infectious disease detection. , 2012, Biosecurity and bioterrorism : biodefense strategy, practice, and science.

[16]  Claire Cardie,et al.  Early Stage Influenza Detection from Twitter , 2013, ArXiv.

[17]  Yiqun Liu,et al.  Predicting Epidemic Tendency through Search Behavior Analysis , 2011, IJCAI.

[18]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.

[19]  Sushruta Mishra,et al.  Impact of Swarm Intelligence Techniques in Diabetes Disease Risk Prediction , 2016, Int. J. Knowl. Discov. Bioinform..

[20]  Ram Gopal Raj,et al.  An application of case-based reasoning with machine learning for forensic autopsy , 2014, Expert Syst. Appl..

[21]  Umar Saif,et al.  FluBreaks: early epidemic detection from Google flu trends. , 2012, Journal of medical Internet research.

[22]  Arjun Mukherjee,et al.  Spotting fake reviewer groups in consumer reviews , 2012, WWW.

[23]  Basit Shahzad,et al.  When are Tweets Better Valued? An Empirical Study , 2014, J. Univers. Comput. Sci..

[24]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[25]  Trevor Strome,et al.  “Google Flu Trends” and Emergency Department Triage Data Predicted the 2009 Pandemic H1N1 Waves in Manitoba , 2011, Canadian journal of public health = Revue canadienne de sante publique.

[26]  Davis Jh,et al.  Continuing education electronic bulletin board system: provider readiness and interest. , 1990 .

[27]  Nigel Collier,et al.  Uncovering text mining: A survey of current work on web-based epidemic intelligence , 2012, Global public health.

[28]  C. L. Ventola Social media and health care professionals: benefits, risks, and best practices. , 2014, P & T : a peer-reviewed journal for formulary management.

[29]  David M. Pennock,et al.  Using internet searches for influenza surveillance. , 2008, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[30]  G. Eysenbach,et al.  Pandemics in the Age of Twitter: Content Analysis of Tweets during the 2009 H1N1 Outbreak , 2010, PloS one.

[31]  A. Hulth,et al.  Web Queries as a Source for Syndromic Surveillance , 2009, PloS one.

[32]  Son Doan,et al.  BioCaster: detecting public health rumors with a Web-based text mining system , 2008, Bioinform..

[33]  Eleftherios Mylonakis,et al.  Google trends: a web-based tool for real-time surveillance of disease outbreaks. , 2009, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[34]  Ralph Grishman,et al.  Information extraction for enhanced access to disease outbreak reports , 2002, J. Biomed. Informatics.

[35]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[36]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[37]  David Lazer,et al.  Twitter: big data opportunities--response. , 2014, Science.

[38]  Louis Leung Internet Embeddedness: Links with Online Health Information Seeking, Expectancy Value/Quality of Health Information Websites, and Internet Usage Patterns , 2008, Cyberpsychology Behav. Soc. Netw..

[39]  Kim-Fung Man,et al.  Cardiovascular diseases identification using electrocardiogram health identifier based on multiple criteria decision making , 2015, Expert Syst. Appl..

[40]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[41]  Jae Ho Lee,et al.  Correlation between National Influenza Surveillance Data and Google Trends in South Korea , 2013, PloS one.

[42]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[43]  Monika Henzinger The Past, Present, and Future of Web Search Engines p , 2004, ICALP.

[44]  Sotiris Kotsiantis,et al.  Text Classification Using Machine Learning Techniques , 2005 .

[45]  John H. Gerdes,et al.  Using web-based search data to predict macroeconomic statistics , 2005, CACM.

[46]  Claire Cardie,et al.  Adapting a Polarity Lexicon using Integer Linear Programming for Domain-Specific Sentiment Classification , 2009, EMNLP.

[47]  Qaiser Abbas,et al.  Comparative Study of Feature Selection Approaches for Urdu Text Categorization , 2015 .

[48]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[49]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[50]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[51]  Vincent S. Tseng,et al.  Paramaterless Clustering Techniques for Gene Expression Analysis , 2006 .

[52]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[53]  Aron Culotta,et al.  Towards detecting influenza epidemics by analyzing Twitter messages , 2010, SOMA '10.

[54]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[55]  E. Gabrilovich,et al.  Postmarket Drug Surveillance Without Trial Costs: Discovery of Adverse Drug Reactions Through Large-Scale Analysis of Web Search Queries , 2013, Journal of medical Internet research.

[56]  Lei Zhang,et al.  A Survey of Opinion Mining and Sentiment Analysis , 2012, Mining Text Data.

[57]  Matthew Mohebbi,et al.  Assessing Google Flu Trends Performance in the United States during the 2009 Influenza Virus A (H1N1) Pandemic , 2011, PloS one.