EuroGames16: Evaluating Change Detection in Online Conversation

We introduce the challenging task of detecting changes from an online conversation. Our goal is to detect significant changes in, for example, sentiment or topic in a stream of messages that are part of an ongoing conversation. Our approach relies on first applying linguistic preprocessing or collecting simple statistics on the messages in the conversation in order to build a time series. Change point detection algorithms are then applied to identify the location of significant changes in the distribution of the underlying time series. We present a collection of sport events on which we can evaluate the performance of our change detection method. Our experiments, using several change point detection algorithms and several types of time series, show that it is possible to detect salient changes in an on-line conversation with relatively high accuracy.

[1]  Jaideep Srivastava,et al.  Event detection from time series data , 1999, KDD '99.

[2]  Tetsuya Sakai,et al.  TREC 2014 Temporal Summarization Track Overview , 2014, TREC.

[3]  Fei Yao,et al.  BJUT at TREC 2015 Temporal Summarization Track , 2015, TREC.

[4]  Michael Fimin,et al.  Breaking bad: avoiding the 10 worst IT admin habits , 2016, Netw. Secur..

[5]  Cyril Goutte,et al.  Detecting Changes in Twitter Streams using Temporal Clusters of Hashtags , 2017, NEWS@ACL.

[6]  Stefan Selzer,et al.  Storyline detection and tracking using Dynamic Latent Dirichlet Allocation , 2016, Proceedings of the 2nd Workshop on Computing News Storylines (CNS 2016).

[7]  Gábor J. Székely,et al.  Hierarchical Clustering via Joint Between-Within Distances: Extending Ward's Minimum Variance Method , 2005, J. Classif..

[8]  Jakub Piskorski,et al.  On the Creation of a Security-Related Event Corpus , 2017, NEWS@ACL.

[9]  Chandra Erdman,et al.  bcp: An R Package for Performing a Bayesian Analysis of Change Point Problems , 2007 .

[10]  Éric Gaussier,et al.  A Probabilistic Interpretation of Precision, Recall and F-Score, with Implication for Evaluation , 2005, ECIR.

[11]  G. Fry Standards and technology. , 1979, Journal of the American Optometric Association.

[12]  J. Hartigan,et al.  A Bayesian Analysis for Change Point Problems , 1993 .

[13]  David S. Matteson,et al.  Leveraging cloud data to mitigate user experience from ‘breaking bad’ , 2014, 2016 IEEE International Conference on Big Data (Big Data).

[14]  Marti A. Hearst,et al.  newsLens: building and visualizing long-ranging news stories , 2017, NEWS@ACL.

[15]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[16]  Ryan P. Adams,et al.  Bayesian Online Changepoint Detection , 2007, 0710.3742.

[17]  V Wensley Koch,et al.  Breaking bad. , 2014, Journal of the American Veterinary Medical Association.

[18]  Fry Ga,et al.  Standards and technology. , 1979 .

[19]  David S. Matteson,et al.  ecp: An R Package for Nonparametric Multiple Change Point Analysis of Multivariate Data , 2013, 1309.3295.

[20]  Saif Mohammad,et al.  Sentiment Analysis of Short Informal Texts , 2014, J. Artif. Intell. Res..