Sentinel: A Codesigned Platform for Semantic Enrichment of Social Media Streams

We introduce the Sentinel platform that supports semantic enrichment of streamed social media data for the purposes of situational understanding. The platform is the result of a codesign effort between computing and social scientists, iteratively developed through a series of pilot studies. The platform is founded upon a knowledge-based approach, in which input streams (channels) are characterized by spatial and terminological parameters, collected media is preprocessed to identify significant terms (signals), and data are tagged (framed) in relation to an ontology. Interpretation of processed media is framed in terms of the 5W framework (who, what, when, where, and why). The platform is designed to be open to the incorporation of new processing modules, building on the knowledge-based elements (channels, signals, and framing ontology) and accessible via a set of user-facing apps. We present the conceptual architecture for the platform, discuss the design and implementation challenges of the underlying stream-processing system, and present a number of apps developed in the context of the pilot studies, highlighting the strengths and importance of the codesign approach and indicating promising areas for future research.

[1]  Mike Uschold,et al.  Building Ontologies: Towards a Unified Methodology , 1996 .

[2]  Russ B. Altman,et al.  RiboWeb: An Ontology-Based System for Collaborative Molecular Biology , 1999, IEEE Intell. Syst..

[3]  Barry Smith,et al.  Biodynamic ontology: applying BFO in the biomedical domain. , 2004, Studies in health technology and informatics.

[4]  Steve Vinoski,et al.  Advanced Message Queuing Protocol , 2006, IEEE Internet Computing.

[5]  Midori A. Harris,et al.  BIOINFORMATICS APPLICATIONS NOTE doi:10.1093/bioinformatics/btm112 Databases and ontologies OBO-Edit—an ontology editor for biologists , 2007 .

[6]  Alun D. Preece,et al.  The International Technology Alliance in Network and Information Sciences , 2007, IEEE Intelligent Systems.

[7]  Amit P. Sheth,et al.  Spatio-Temporal-Thematic Analysis of Citizen Sensor Data: Challenges and Experiences , 2009, WISE.

[8]  David A. Shamma,et al.  Tweetgeist : Can the Twitter Timeline Reveal the Structure of Broadcast Events ? , 2009 .

[9]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[10]  Miles Osborne,et al.  Streaming First Story Detection with application to Twitter , 2010, NAACL.

[11]  Michael S. Bernstein,et al.  Twitinfo: aggregating and visualizing microblogs for event exploration , 2011, CHI.

[12]  Huiji Gao,et al.  Harnessing the Crowdsourcing Power of Social Media for Disaster Relief , 2011, IEEE Intelligent Systems.

[13]  Daniel Gayo-Avello,et al.  A Meta-Analysis of State-of-the-Art Electoral Prediction From Twitter Data , 2012, ArXiv.

[14]  Marta Sabou,et al.  Visualizing Contextual and Dynamic Features of Micropost Streams , 2012, #MSM.

[15]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[16]  Mani Srivastava,et al.  Human-centric sensing , 2012, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[17]  Geert-Jan Houben,et al.  Semantics + filtering + search = twitcident. exploring information in social web streams , 2012, HT '12.

[18]  Martin Innes,et al.  Can we speak in confidence? Community intelligence and neighbourhood policing v2.0 , 2012 .

[19]  Tarek F. Abdelzaher,et al.  On truth discovery in social sensing: A maximum likelihood estimation approach , 2012, International Symposium on Information Processing in Sensor Networks.

[20]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[21]  Tram Truong Huu,et al.  Bundle and Pool Architecture for Multi-Language, Robust, Scalable Workflow Executions , 2013, Journal of Grid Computing.

[22]  Alun D. Preece,et al.  FlexiTerm: a flexible term recognition method , 2013, J. Biomed. Semant..

[23]  Pericles A. Mitkas,et al.  Event identification in web social media through named entity recognition and topic modeling , 2013, Data Knowl. Eng..

[24]  Yiannis Kompatsiaris,et al.  Sensing Trending Topics in Twitter , 2013, IEEE Transactions on Multimedia.

[25]  Martin Innes,et al.  Signal Crimes: Social Reactions to Crime, Disorder, and Control , 2014 .

[26]  Iadh Ounis,et al.  Real-Time Detection, Tracking, and Monitoring of Automatically Discovered Events in Social Media , 2014, ACL.

[27]  Lada A. Adamic,et al.  Computational Social Science , 2009, Science.

[28]  Mark Dredze,et al.  Facebook, Twitter and Google Plus for Breaking News: Is There a Winner? , 2014, ICWSM.

[29]  Charu C. Aggarwal,et al.  Using humans as sensors: An estimation-theoretic perspective , 2014, IPSN-14 Proceedings of the 13th International Symposium on Information Processing in Sensor Networks.

[30]  Jeffrey Nichols,et al.  Home Location Identification of Twitter Users , 2014, TIST.

[31]  Kalina Bontcheva,et al.  Making sense of social media streams through semantics: A survey , 2014, Semantic Web.

[32]  Chang-Tien Lu,et al.  Misinformation Propagation in the Age of Twitter , 2014, Computer.

[33]  Dong Wang,et al.  Social Sensing: Building Reliable Systems on Unreliable Data , 2015 .

[34]  Ammatzia Peled,et al.  Real‐Time Major Events Monitoring and Alert System Through Social Networks , 2015 .

[35]  Sarah Vieweg,et al.  Processing Social Media Messages in Mass Emergency , 2014, ACM Comput. Surv..

[36]  Alun Preece,et al.  Ten “Rs” of Social Reaction: Using Social Media to Analyse the “Post-Event” Impacts of the Murder of Lee Rigby , 2016 .

[37]  Michael Grossniklaus,et al.  An evaluation of the run-time and task-based performance of event detection techniques for Twitter , 2015, Inf. Syst..

[38]  Michael Grossniklaus,et al.  Situation monitoring of urban areas using social media data streams , 2016, Inf. Syst..

[39]  Daniel A. Keim,et al.  A Survey on Visual Analytics of Social Media Data , 2016, IEEE Transactions on Multimedia.

[40]  Tien Pham,et al.  Network and Information Sciences (NIS) International Technology Alliance (ITA) , 2016 .

[41]  Kalina Bontcheva,et al.  A framework for real-time semantic social media analysis , 2017, J. Web Semant..

[42]  David Rogers,et al.  After Woolwich: analyzing open source communications to understand the interactive and multi-polar dynamics of the arc of conflict , 2018 .

[43]  P. Pirolli,et al.  The Sensemaking Process and Leverage Points for Analyst Technology as Identified Through Cognitive Task Analysis , 2007 .