Unfolding social content evolution along time and semantics

Abstract In the context of social media, the unstructured and dynamic nature of exchanged data and the information overload contribute to the growth of the number of research works proposing methods to improve performance of intelligent analytics services considering both time and semantics of the shared content. The presented paper focuses on the definition of a knowledge tracking framework to answer questions, such as “What is the semantic evolution of a topic (or news) along the time?”, “How did we arrive to a specific event?”, “What is the evolution of the topics of interest of a user?”, and so on. Our interest is about the elicitation of temporal patterns revealing the evolution of concepts along the time from a social media data stream; we focus on Twitter. Such patterns can be extracted at different levels of abstraction by considering different-sized time intervals and different scopes driven by the conceptualization of users’ queries. To address the proposed aim, we extend Temporal Concept Analysis and we use Description Logic to reason on semantically represented tweet streams. The evaluation activity reveals promising results from both sides quantitative and qualitative.

[1]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[2]  James Allan,et al.  Topic detection and tracking: event-based information organization , 2002 .

[3]  Peter F. Patel-Schneider,et al.  Ontology Constraints in Incomplete and Complete Data , 2012, International Semantic Web Conference.

[4]  Karl Erich Wolff,et al.  States, Transitions, and Life Tracks in Temporal Concept Analysis , 2005, Formal Concept Analysis.

[5]  Freddy Chong Tat Chua,et al.  Automatic Summarization of Events from Social Media , 2013, ICWSM.

[6]  Mimmo Parente,et al.  Online query-focused twitter summarizer through fuzzy lattice , 2015, 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[7]  Nafees Ur Rehman,et al.  Discovering OLAP dimensions in semi-structured data , 2014, Inf. Syst..

[8]  Vincenzo Loia,et al.  Hybrid methodologies to foster ontology-based knowledge management platform , 2013, 2013 IEEE Symposium on Intelligent Agents (IA).

[9]  Ian Horrocks,et al.  The Even More Irresistible SROIQ , 2006, KR.

[10]  Daniele Braga,et al.  C-SPARQL: SPARQL for continuous querying , 2009, WWW '09.

[11]  Bernhard Ganter,et al.  Formal Concept Analysis: Mathematical Foundations , 1998 .

[12]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[13]  Jason J. Jung,et al.  Real-time Event Detection on Social Data Stream , 2014, Mobile Networks and Applications.

[14]  Marco Rospocher,et al.  A 2-phase frame-based knowledge extraction framework , 2016, SAC.

[15]  Jimmy J. Lin,et al.  Smoothing techniques for adaptive online language models: topic tracking in tweet streams , 2011, KDD.

[16]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[17]  Boris Motik,et al.  OWL 2 Web Ontology Language: structural specification and functional-style syntax , 2008 .

[18]  Sunitha Abburu,et al.  A Survey on Ontology Reasoners and Comparison , 2012 .

[19]  Wenjie Li,et al.  Sequential Summarization: A Full View of Twitter Trending Topics , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[20]  Geoffrey C. Fox,et al.  Twister: a runtime for iterative MapReduce , 2010, HPDC '10.

[21]  Jugal K. Kalita,et al.  Experiments in Microblog Summarization , 2010, 2010 IEEE Second International Conference on Social Computing.

[22]  Mimmo Parente,et al.  Towards OLAP Analysis of Multidimensional Tweet Streams , 2015, DOLAP.

[23]  Gabriella Pasi,et al.  Lattice navigation for collaborative filtering by means of (fuzzy) formal concept analysis , 2013, SAC '13.

[24]  Kazufumi Watanabe,et al.  Jasmine: a real-time local-event detection system based on geolocation information propagated to microblogs , 2011, CIKM '11.

[25]  Ambuj K. Singh,et al.  The social media genome: Modeling individual topic-specific behavior in social media , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[26]  Paola Velardi,et al.  Efficient temporal mining of micro-blog texts and its application to event discovery , 2015, Data Mining and Knowledge Discovery.

[27]  Ruairí de Fréin,et al.  Distributed Formal Concept Analysis Algorithms Based on an Iterative MapReduce Framework , 2012, ICFCA.

[28]  Ahmed Y. Tawfik,et al.  Towards a Temporal Extension of Formal Concept Analysis , 2001, Canadian Conference on AI.

[29]  Vincenzo Loia,et al.  Hierarchical web resources retrieval by exploiting Fuzzy Formal Concept Analysis , 2012, Inf. Process. Manag..

[30]  Ilknur Celik,et al.  Leveraging the Semantics of Tweets for Adaptive Faceted Search on Twitter , 2011, SEMWEB.

[31]  David A. Shamma,et al.  Characterizing debate performance via aggregated twitter sentiment , 2010, CHI.

[32]  Mimmo Parente,et al.  Time Aware Knowledge Extraction for microblog summarization on Twitter , 2015, Inf. Fusion.

[33]  Jure Leskovec,et al.  Patterns of temporal variation in online media , 2011, WSDM '11.

[34]  Christopher M. Danforth,et al.  Temporal Patterns of Happiness and Information in a Global Social Network: Hedonometrics and Twitter , 2011, PloS one.

[35]  Jason J. Jung,et al.  Social big data: Recent achievements and new challenges , 2015, Information Fusion.

[36]  Rada Mihalcea,et al.  Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.

[37]  Jonas Poelmans,et al.  Analyzing Chat Conversations of Pedophiles with Temporal Relational Semantic Systems , 2012, 2012 European Intelligence and Security Informatics Conference.

[38]  Werner Nutt,et al.  Basic Description Logics , 2003, Description Logic Handbook.