A survey on location estimation techniques for events detected in Twitter

Detection of events using voluntarily generated content in microblogs has been the objective of numerous recent studies. One essential challenge tackled in these studies is estimating the locations of events. In this paper, we review the state-of-the-art location estimation techniques used in the localization of events detected in microblogs, particularly in Twitter, which is one of the most popular microblogging platforms worldwide. We analyze these techniques with respect to the targeted event type, granularity of estimated locations, location-related features selected as sources of spatial evidence, and the method used to make aggregate decisions based on the extracted evidence. We discuss the strengths and advantages of alternative solutions to various problems related to location estimation, as well as their preconditions and limitations. We examine the most widely used evaluation methods to analyze the accuracy of estimations and present the results reported in the literature. We also discuss our findings and highlight important research challenges that may need further attention.

[1]  Anthony Stefanidis,et al.  Triangulating Social Multimedia Content for Event Localization using Flickr and Twitter , 2015, Trans. GIS.

[2]  James Allan,et al.  Topic detection and tracking: event-based information organization , 2002 .

[3]  R. Mesiar,et al.  Aggregation operators: properties, classes and construction methods , 2002 .

[4]  Linda L. Hill Georeferencing - The Geographic Associations of Information , 2009, Digital libraries and electronic publishing.

[5]  Dongman Lee,et al.  EventRadar: A Real-Time Local Event Detection Scheme Using Twitter Stream , 2012, 2012 IEEE International Conference on Green Computing and Communications.

[6]  Mohamed A. Sharaf,et al.  Emerging event detection in social networks with location sensitivity , 2014, World Wide Web.

[7]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[8]  Benyuan Liu,et al.  Online Social Networks Flu Trend Tracker: A Novel Sensory Approach to Predict Flu Trends , 2012, BIOSTEC.

[9]  Sharon Myrtle Paradesi,et al.  Geotagging Tweets Using Their Content , 2011, FLAIRS.

[10]  Peiquan Jin,et al.  Spatiotemporal Information for the Web , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[11]  Bu-Sung Lee,et al.  Event Detection in Twitter , 2011, ICWSM.

[12]  Muskan Garg,et al.  Review on event detection techniques in social multimedia , 2016, Online Inf. Rev..

[13]  Wael Khreich,et al.  A Survey of Techniques for Event Detection in Twitter , 2015, Comput. Intell..

[14]  Dieter Fox,et al.  Bayesian Filtering for Location Estimation , 2003, IEEE Pervasive Comput..

[15]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[16]  Charu C. Aggarwal,et al.  A Survey of Stream Clustering Algorithms , 2018, Data Clustering: Algorithms and Applications.

[17]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[18]  Calton Pu,et al.  LITMUS: A Multi-Service Composition System for Landslide Detection , 2015, IEEE Transactions on Services Computing.

[19]  Michael Gertz,et al.  Efficient online extraction of keywords for localized events in twitter , 2017, GeoInformatica.

[20]  G. G. Meyer,et al.  Lecture notes in business information processing , 2009 .

[21]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[22]  Yutaka Matsuo,et al.  Tweet Analysis for Real-Time Event Detection and Earthquake Reporting System Development , 2013, IEEE Transactions on Knowledge and Data Engineering.

[23]  Daniel B. Neill,et al.  Fast subset scan for spatial pattern detection , 2012 .

[24]  Susanne Heuser,et al.  Location Based Social Networks – Definition, Current State of the Art and Research Agenda , 2013, Trans. GIS.

[25]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Data stream clustering: A survey , 2013, CSUR.

[26]  Michael S. Bernstein,et al.  Twitinfo: aggregating and visualizing microblogs for event exploration , 2011, CHI.

[27]  Peiquan Jin,et al.  Spatiotemporal Information for the Web , 2014, Encyclopedia of Social Network Analysis and Mining.

[28]  M. Kulldorff Spatial Scan Statistics: Models, Calculations, and Applications , 1999 .

[29]  Keiichi Tamura,et al.  Identifying bursty areas of emergency topics in geotagged tweets using density-based spatiotemporal clustering algorithm , 2014, 2014 IEEE 7th International Workshop on Computational Intelligence and Applications (IWCIA).

[30]  Nico Piatkowski,et al.  Solving Large Scale Learning Tasks. Challenges and Algorithms , 2016, Lecture Notes in Computer Science.

[31]  Nadia Magnenat-Thalmann,et al.  Who, where, when and what: discover spatio-temporal topics for twitter users , 2013, KDD.

[32]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[33]  Pinar Senkul,et al.  Semantic Expansion of Hashtags for Enhanced Event Detection in Twitter , 2012 .

[34]  Kazufumi Watanabe,et al.  Jasmine: a real-time local-event detection system based on geolocation information propagated to microblogs , 2011, CIKM '11.

[35]  Michael Gertz,et al.  EvenTweet: Online Localized Event Detection from Twitter , 2013, Proc. VLDB Endow..

[36]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[37]  Hanan Samet,et al.  TwitterStand: news in tweets , 2009, GIS.

[38]  Anthony Stefanidis,et al.  #Earthquake: Twitter as a Distributed Sensor System , 2013, Trans. GIS.

[39]  Kyumin Lee,et al.  You are where you tweet: a content-based approach to geo-locating twitter users , 2010, CIKM.

[40]  Tarek F. Abdelzaher,et al.  On quality of event localization from social network feeds , 2015, 2015 IEEE International Conference on Pervasive Computing and Communication Workshops (PerCom Workshops).

[41]  Eric M. Clark,et al.  Measuring climate change on Twitter using Google's algorithm: perception and events , 2015, Int. J. Web Inf. Syst..

[42]  Dan Roth,et al.  Provenance-Assisted Classification in Social Networks , 2014, IEEE Journal of Selected Topics in Signal Processing.

[43]  Leandro Krug Wives,et al.  Location-Based Events Detection on Micro-Blogs , 2012, ArXiv.

[44]  Weiru Liu,et al.  A survey of location inference techniques on Twitter , 2015, J. Inf. Sci..

[45]  Shaowen Wang,et al.  FluMapper: A cyberGIS application for interactive analysis of massive location‐based social media , 2014, Concurr. Comput. Pract. Exp..

[46]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[47]  Shiguang Wang,et al.  Joint Localization of Events and Sources in Social Networks , 2015, 2015 International Conference on Distributed Computing in Sensor Systems.

[48]  Halit Oguztüzün,et al.  Evidential estimation of event locations in microblogs using the Dempster-Shafer theory , 2016, Inf. Process. Manag..

[49]  Keiichi Tamura,et al.  Detecting Location-Based Enumerating Bursts in Georeferenced Micro-Posts , 2013, 2013 Second IIAI International Conference on Advanced Applied Informatics.

[50]  Anthony K. H. Tung,et al.  Spatial clustering methods in data mining : A survey , 2001 .

[51]  Robert Power,et al.  Emergency Situation Awareness: Twitter Case Studies , 2014, ISCRAM-med.

[52]  A. Stefanidis,et al.  Harvesting ambient geospatial information from social media feeds , 2011, GeoJournal.

[53]  Roberto V. Zicari,et al.  PoliTwi: Early detection of emerging political topics on twitter and the impact on concept-level sentiment analysis , 2014, Knowl. Based Syst..

[54]  Takumi Ichimura,et al.  Density-Based Spatiotemporal Clustering Algorithm for Extracting Bursty Areas from Georeferenced Documents , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[55]  Jie Yin,et al.  Location extraction from disaster-related microblogs , 2013, WWW.

[56]  Halit Oguztüzün,et al.  Evidential location estimation for events detected in Twitter , 2013, GIR '13.

[57]  J. Brownstein,et al.  A Case Study of the New York City 2012-2013 Influenza Season With Daily Geocoded Twitter Data From Temporal and Spatiotemporal Perspectives , 2014, Journal of medical Internet research.

[58]  Arthur P. Dempster,et al.  Upper and Lower Probabilities Induced by a Multivalued Mapping , 1967, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[59]  João Gama,et al.  Online Social Networks Event Detection: A Survey , 2016, Solving Large Scale Learning Tasks.

[60]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[61]  Daniel A. Keim,et al.  State-of-the-Art Report of Visual Analysis for Event Detection in Text Data Streams , 2014, EuroVis.

[62]  Shaowen Wang,et al.  Mapping the global Twitter heartbeat: The geography of Twitter , 2013, First Monday.

[63]  Xiao Zhang,et al.  SensePlace2: GeoTwitter analytics support for situational awareness , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[64]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[65]  Judith Gelernter,et al.  Geo‐parsing Messages from Microtext , 2011, Trans. GIS.

[66]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[67]  Stuart E. Middleton,et al.  Real-Time Crisis Mapping of Natural Disasters Using Social Media , 2014, IEEE Intelligent Systems.

[68]  Oren Etzioni,et al.  Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.

[69]  Halit Oguztüzün,et al.  Semantic Expansion of Tweet Contents for Enhanced Event Detection in Twitter , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[70]  Bu-Sung Lee,et al.  TwiNER: named entity recognition in targeted twitter stream , 2012, SIGIR '12.

[71]  Yan Huang,et al.  Location-based event search in social texts , 2015, 2015 International Conference on Computing, Networking and Communications (ICNC).

[72]  M. Kennedy Georeferencing: The Geographic Associations of Information , 2008 .

[73]  Wei Zhang,et al.  STREAMCUBE: Hierarchical spatio-temporal hashtag clustering for event exploration over the Twitter stream , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[74]  Ed H. Chi,et al.  Tweets from Justin Bieber's heart: the dynamics of the location field in user profiles , 2011, CHI.

[75]  Linda L. Hill Georeferencing: The Geographic Associations of Information (Digital Libraries and Electronic Publishing) , 2006 .

[76]  Greg Welch,et al.  Welch & Bishop , An Introduction to the Kalman Filter 2 1 The Discrete Kalman Filter In 1960 , 1994 .

[77]  Charu C. Aggarwal,et al.  A Survey of Text Clustering Algorithms , 2012, Mining Text Data.

[78]  Wee Keong Ng,et al.  A survey on data stream clustering and classification , 2015, Knowledge and Information Systems.

[79]  Liang Zhao,et al.  STED: semi-supervised targeted-interest event detectionin in twitter , 2013, KDD.

[80]  Hanan Samet,et al.  NewsStand: a new view on news , 2008, GIS '08.

[81]  Bertrand De Longueville,et al.  "OMG, from here, I can see the flames!": a use case of mining location based social networks to acquire spatio-temporal data on forest fires , 2009, LBSN '09.

[82]  Krishnaprasad Thirunarayan,et al.  Extracting City Traffic Events from Social Streams , 2015, ACM Trans. Intell. Syst. Technol..

[83]  Kazutoshi Sumiya,et al.  Measuring geographical regularities of crowd behaviors for Twitter-based geo-social event detection , 2010, LBSN '10.

[84]  Rui Li,et al.  TEDAS: A Twitter-based Event Detection and Analysis System , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[85]  Ross Maciejewski,et al.  Visualizing Social Media Sentiment in Disaster Scenarios , 2015, WWW.

[86]  Jie Yin,et al.  Using Social Media to Enhance Emergency Situation Awareness , 2012, IEEE Intelligent Systems.

[87]  Peng Zhang,et al.  Estimating the Locations of Emergency Events from Twitter Streams , 2014, ITQM.

[88]  Ron Sivan,et al.  Web-a-where: geotagging web content , 2004, SIGIR '04.

[89]  Sarah Vieweg,et al.  Processing Social Media Messages in Mass Emergency , 2014, ACM Comput. Surv..

[90]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[91]  Stéphane Marchand-Maillet,et al.  Where Is the News Breaking? Towards a Location-Based Event Detection Framework for Journalists , 2014, MMM.

[92]  Tao Cheng,et al.  Event Detection using Twitter: A Spatio-Temporal Approach , 2014, PloS one.

[93]  Alexander Zipf,et al.  An Advanced Systematic Literature Review on Spatiotemporal Analyses of Twitter Data , 2015, Trans. GIS.

[94]  Alun Richards,et al.  A Descriptive Model of Patient Readiness, Motivators, and Hepatitis C Treatment Uptake among Australian Prisoners , 2014, PloS one.