SNSJam: Road traffic analysis and prediction by fusing data from multiple social networks

Abstract The increased popularity of micro-blogging applications together with the widespread of location-aware devices have resulted in the creation of large streams of geo-tagged data. Such data provides a great opportunity for researchers to explore event detection and prediction. In particular, road traffic detection and prediction are of great importance to various applications, i.e. Intelligent Transportation Systems. Current works proposed traffic jam detection from a single data source with a single language. However, for countries where the residents are speaking two, or more, languages and are interacting with more than one online social platform, single-language and single-source systems are insufficient to capture the necessary online information. Therefore, in this paper, we introduce SNSJam, an effective system to detect and predict road traffic jams using cross-lingual (English and Arabic) data collected from multiple dynamic sources, such as Twitter and Instagram. SNSJam classifier not only detect traffic events, but also identifies the causes of traffic jams. To identify the location of a traffic event, a Location Recognizer is developed that extracts locations from text and GPS of the post. Additionally, the Location Recognizer supports user-defined locations, which are common names among people. Our experiments show that by combining Arabic and English data streams, the accuracies of traffic events detection and prediction are significantly improved as compared with that of the individual languages. Additionally, combining data streams from multiple sources (Twitter and Instagram) further improved the accuracies of event detection and prediction over any individual source. A visualization interface was developed to show the detected spatio-temporal traffic events on a dynamic map. The detection and prediction results are validated against ground truth data obtained from the concerned authorities in the UAE.

[1]  Freddy Lécué,et al.  Westland row why so slow?: fusing social media and linked data sources for understanding real-time traffic conditions , 2013, IUI '13.

[2]  M. Williams,et al.  Knowing the Tweeters: Deriving Sociologically Relevant Demographics from Twitter , 2013 .

[3]  Xiao Wang,et al.  Detecting Traffic Information From Social Media Texts With Deep Learning Approaches , 2018, IEEE Transactions on Intelligent Transportation Systems.

[4]  Fabrizio Sebastiani,et al.  QT2S: A System for Monitoring Road Traffic Via Fine Grounding of Tweets , 2017, ICWSM.

[5]  Nirmalya Roy,et al.  Analyzing Social Media Texts and Images to Assess the Impact of Flash Floods in Cities , 2017, 2017 IEEE International Conference on Smart Computing (SMARTCOMP).

[6]  Benyuan Liu,et al.  Twitter Improves Seasonal Influenza Prediction , 2018, HEALTHINF.

[7]  Qingshan Liu,et al.  Spatiotemporal Satellite Image Fusion Using Deep Convolutional Neural Networks , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[8]  Medha A. Shah,et al.  Real time road traffic event detection using Twitter and spark , 2017, 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT).

[9]  Kevin Heaslip,et al.  Developing a Twitter-based traffic event detection model using deep learning architectures , 2019, Expert Syst. Appl..

[10]  Yeran Sun,et al.  On fine-grained geolocalisation of tweets and real-time traffic incident detection , 2019, Inf. Process. Manag..

[11]  Bruno Martins,et al.  Geocoding Textual Documents Through a Hierarchy of Linear Classifiers , 2015, EPIA.

[12]  Tarek F. Abdelzaher,et al.  ClariSense+: An enhanced traffic anomaly explanation service using social network feeds , 2016, Pervasive Mob. Comput..

[13]  Mehmet A. Orgun,et al.  Real-time event detection from the Twitter data stream using the TwitterNews+ Framework , 2019, Inf. Process. Manag..

[14]  Christian Jutten,et al.  Multimodal Data Fusion: An Overview of Methods, Challenges, and Prospects , 2015, Proceedings of the IEEE.

[15]  Antonio Alfredo Ferreira Loureiro,et al.  Traffic Event Detection Using Online Social Networks , 2017, 2017 13th International Conference on Distributed Computing in Sensor Systems (DCOSS).

[16]  Erik Cambria,et al.  Tensor Fusion Network for Multimodal Sentiment Analysis , 2017, EMNLP.

[17]  Zaher Al Aghbari,et al.  Synthizing Handwritten Arabic Text Using Active Shape Models , 2011, IP&C.

[18]  Hadi Kharrazi,et al.  Characterizing Diabetes, Diet, Exercise, and Obesity Comments on Twitter , 2017, Int. J. Inf. Manag..

[19]  Mokhairi Makhtar,et al.  Fusion model for ontology based image retrieval , 2015 .

[20]  Eleonora D'Andrea,et al.  Real-Time Detection of Traffic From Twitter Stream Analysis , 2015, IEEE Transactions on Intelligent Transportation Systems.

[21]  Abeer Alwan,et al.  Introduction to the Issue on Data Science: Machine Learning for Audio Signal Processing , 2019, IEEE J. Sel. Top. Signal Process..

[22]  Heri Ramampiaro,et al.  Locality-adapted kernel densities of term co-occurrences for location prediction of tweets , 2019, Inf. Process. Manag..

[23]  Wenli Zhang,et al.  Predicting Asthma-Related Emergency Department Visits Using Big Data , 2015, IEEE Journal of Biomedical and Health Informatics.

[24]  Craig MacDonald,et al.  Event attendance classification in social media , 2019, Inf. Process. Manag..

[25]  Zaher Al Aghbari,et al.  Analysis and prediction of influenza in the UAE based on Arabic tweets , 2018, 2018 IEEE 3rd International Conference on Big Data Analysis (ICBDA).

[26]  Sarah Alhumoud,et al.  Twitter Analysis for Intelligent Transportation , 2019, Comput. J..

[27]  Qingming Huang,et al.  Effective Multimodality Fusion Framework for Cross-Media Topic Detection , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Judith Gelernter,et al.  Geocoding location expressions in Twitter messages: A preference learning method , 2014, J. Spatial Inf. Sci..

[29]  Panagiotis Georgakis,et al.  Incident detection using data from social media , 2017, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).

[30]  Bruno Martins,et al.  Automated Geocoding of Textual Documents: A Survey of Current Approaches , 2017, Trans. GIS.

[31]  Erik Cambria,et al.  Fusing audio, visual and textual clues for sentiment analysis from multimodal content , 2016, Neurocomputing.

[32]  Cyrus Shahabi,et al.  Crowd sensing of traffic anomalies based on human mobility and social media , 2013, SIGSPATIAL/GIS.

[33]  Edward A. Fox,et al.  Read between the lines: A Machine Learning Approach for Disambiguating the Geo-location of Tweets , 2015, JCDL.

[34]  Vanessa Testoni,et al.  Multimodal data fusion for sensitive scene localization , 2019, Inf. Fusion.

[35]  Michael Gertz,et al.  EvenTweet: Online Localized Event Detection from Twitter , 2013, Proc. VLDB Endow..

[36]  Songnian Li,et al.  Sensing and detecting traffic events using geosocial media data: A review , 2018, Comput. Environ. Urban Syst..

[37]  Alok N. Choudhary,et al.  Real-time disease surveillance using Twitter data: demonstration on flu and cancer , 2013, KDD.

[38]  Erik Cambria,et al.  A review of affective computing: From unimodal analysis to multimodal fusion , 2017, Inf. Fusion.

[39]  Christan Grant,et al.  Tweet Geolocation Error Estimation , 2016 .

[40]  Zaher Al Aghbari,et al.  Gabor Wavelet Recognition Approach for Off-Line Handwritten Arabic Using Explicit Segmentation , 2013, IP&C.

[41]  Cécile Favre,et al.  Event detection, tracking, and visualization in Twitter: a mention-anomaly-based approach , 2015, Social Network Analysis and Mining.

[42]  Yutaka Matsuo,et al.  Tweet Analysis for Real-Time Event Detection and Earthquake Reporting System Development , 2013, IEEE Transactions on Knowledge and Data Engineering.

[43]  Josiane Mothe,et al.  Location extraction from tweets , 2018, Inf. Process. Manag..

[44]  Fabio Crestani,et al.  Event mining and timeliness analysis from heterogeneous news streams , 2019, Inf. Process. Manag..