Domain-Specific Use Cases for Knowledge-Enabled Social Media Analysis

Social media provides a virtual platform for users to share and discuss their daily life, activities, opinions, health, feelings, etc. Such personal accounts readily generate Big Data marked by velocity, volume, value, variety, and veracity challenges. This type of Big Data analytics already supports useful investigations ranging from research into data mining and developing public policy to actions targeting an individual in a variety of domains such as branding and marketing, crime and law enforcement, crisis monitoring and management, as well as public and personalized health management. However, using social media to solve domain-specific problem is challenging due to complexity of the domain, lack of context, colloquial nature of language, and changing topic relevance in temporally dynamic domain. In this article, we discuss the need to go beyond data-driven machine learning and natural language processing, and incorporate deep domain knowledge as well as knowledge of how experts and decision makers explore and perform contextual interpretation. Four use cases are used to demonstrate the role of domain knowledge in addressing each challenge.

[1]  M. Haselton,et al.  The Evolution of Cognitive Bias , 2015 .

[2]  Amit P. Sheth,et al.  Assisting coordination during crisis: a domain ontology based approach to infer resource needs from tweets , 2014, WebSci '14.

[3]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[4]  R. Spitzer,et al.  The PHQ-9 , 2001, Journal of General Internal Medicine.

[5]  Amit P. Sheth,et al.  EmojiNet: Building a Machine Readable Sense Inventory for Emoji , 2016, SocInfo.

[6]  Amit P. Sheth,et al.  Harnessing Twitter "Big Data" for Automatic Emotion Identification , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[7]  Amit P. Sheth,et al.  Context and Domain Knowledge Enhanced Entity Spotting in Informal Text , 2009, SEMWEB.

[8]  Amit P. Sheth,et al.  PREDOSE: A semantic web platform for drug abuse epidemiology using social media , 2013, J. Biomed. Informatics.

[9]  Simone Paolo Ponzetto,et al.  Knowledge-based graph document modeling , 2014, WSDM.

[10]  Stephen H. Bach,et al.  Hinge-Loss Markov Random Fields and Probabilistic Soft Logic , 2015, J. Mach. Learn. Res..

[11]  Sören Auer,et al.  AGDISTIS - Graph-Based Disambiguation of Named Entities Using Linked Data , 2014, International Semantic Web Conference.

[12]  Amit P. Sheth,et al.  System and Method for Creating Semantic Web and Its Applications in Browsing, Searching, Profiling , 2001 .

[13]  Francois R. Lamy,et al.  "Time for dabs": Analyzing Twitter data on marijuana concentrates across the U.S. , 2015, Drug and alcohol dependence.

[14]  Krishnaprasad Thirunarayan,et al.  Knowledge will propel machine understanding of content: extrapolating from current examples , 2016, WI.

[15]  K. Vowles,et al.  Rates of opioid misuse, abuse, and addiction in chronic pain: a systematic review and data synthesis , 2015, Pain.

[16]  Marie Hudson,et al.  Comparison of the PHQ-9 and CES-D depression scales in systemic sclerosis: internal consistency reliability, convergent validity and clinical correlates. , 2010, Rheumatology.

[17]  Pushpak Bhattacharyya,et al.  TwiSent: a multistage system for analyzing sentiment in twitter , 2012, CIKM '12.

[18]  Ana-Maria Popescu,et al.  Democrats, republicans and starbucks afficionados: user classification in twitter , 2011, KDD.

[19]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.

[20]  Geert-Jan Houben,et al.  Twitcident: fighting fire with information from social web streams , 2012, WWW.

[21]  M. Martínez-Gutiérrez,et al.  Diversity of susceptible hosts in canine distemper virus infection: a systematic review and data synthesis , 2016, BMC Veterinary Research.

[22]  Frank M. Perna,et al.  The Benefits of Exercise for the Clinically Depressed. , 2004, Primary care companion to the Journal of clinical psychiatry.

[23]  Vaibhavi N Patodkar,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2016 .

[24]  Amit P. Sheth,et al.  Managing Semantic Content for the Web , 2002, IEEE Internet Comput..

[25]  Francis R. Bach,et al.  Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression , 2016, J. Mach. Learn. Res..

[26]  Sarasi Lalithsena,et al.  Domain-specific Knowledge Extraction from the Web of Data , 2018 .

[27]  Naomie Salim,et al.  Recognition of side effects as implicit-opinion words in drug reviews , 2016, Online Inf. Rev..

[28]  S. Gelman,et al.  Mapping the Mind: Domain Specificity In Cognition And Culture , 1994 .

[29]  Loren G. Terveen,et al.  "Blissfully Happy" or "Ready toFight": Varying Interpretations of Emoji , 2016, ICWSM.

[30]  Amit P. Sheth,et al.  What Are People Tweeting About Zika? An Exploratory Study Concerning Its Symptoms, Treatment, Transmission, and Prevention , 2017, JMIR public health and surveillance.

[31]  Amit P. Sheth,et al.  Are Twitter Users Equal in Predicting Elections? A Study of User Groups in Predicting 2012 U.S. Republican Presidential Primaries , 2012, SocInfo.

[32]  Amit P. Sheth,et al.  User Interests Identification on Twitter Using a Hierarchical Knowledge Base , 2014, ESWC.

[33]  Amit P. Sheth,et al.  Implicit Entity Linking in Tweets , 2016, ESWC.

[34]  Mark Strembeck,et al.  Identifying Emotions in Social Media: Comparison of Word-Emotion Lexicons , 2017, 2017 5th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW).

[35]  Petra Kralj Novak,et al.  Sentiment of Emojis , 2015, PloS one.

[36]  Fernando Diaz,et al.  Emergency-relief coordination on social media: Automatically matching resource requests and offers , 2013, First Monday.

[37]  Amit P. Sheth,et al.  Domain-specific hierarchical subgraph extraction: A recommendation use case , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[38]  Amit P. Sheth,et al.  Semi-Supervised Approach to Monitoring Clinical Depressive Symptoms in Social Media , 2017, ASONAM.

[39]  Amit P. Sheth,et al.  Challenges of Sentiment Analysis for Dynamic Events , 2017, IEEE Intelligent Systems.

[40]  Tommaso Di Noia,et al.  Top-N recommendations from implicit feedback leveraging linked open data , 2013, IIR.

[41]  Andrew N. Smith,et al.  How Does Brand-related User-generated Content Differ across YouTube, Facebook, and Twitter? , 2012 .