Sensing Urban Transportation Events from Multi-Channel Social Signals with the Word2vec Fusion Model

Social sensors perceive the real world through social media and online web services, which have the advantages of low cost and large coverage over traditional physical sensors. In intelligent transportation researches, sensing and analyzing such social signals provide a new path to monitor, control and optimize transportation systems. However, current research is largely focused on using single channel online social signals to extract and sense traffic information. Clearly, sensing and exploiting multi-channel social signals could effectively provide deeper understanding of traffic incidents. In this paper, we utilize cross-platform online data, i.e., Sina Weibo and News, as multi-channel social signals, then we propose a word2vec-based event fusion (WBEF) model for sensing, detecting, representing, linking and fusing urban traffic incidents. Thus, each traffic incident can be comprehensively described from multiple aspects, and finally the whole picture of unban traffic events can be obtained and visualized. The proposed WBEF architecture was trained by about 1.15 million multi-channel online data from Qingdao (a coastal city in China), and the experiments show our method surpasses the baseline model, achieving an 88.1% F1 score in urban traffic incident detection. The model also demonstrates its effectiveness in the open scenario test.

[1]  Freddy Lécué,et al.  Smart traffic analytics in the semantic web with STAR-CITY: Scenarios, system and lessons learned in Dublin City , 2014, J. Web Semant..

[2]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[3]  Chang-Tien Lu,et al.  Steds: Social Media Based Transportation Event Detection with Text Summarization , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[4]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[5]  FeiYue Wang,et al.  A framework for social signal processing and analysis:from social sensing networks to computational dialectical analytics , 2013 .

[6]  Jiawei Han,et al.  Mining Multi-aspect Reflection of News Events in Twitter: Discovery, Linking and Presentation , 2015, 2015 IEEE International Conference on Data Mining.

[7]  Pasquale Lops,et al.  Learning Word Embeddings from Wikipedia for Content-Based Recommender Systems , 2016, ECIR.

[8]  Athena Vakali,et al.  Sentiment analysis leveraging emotions and word embeddings , 2017 .

[9]  Jiawei Han,et al.  EKNOT: Event Knowledge from News and Opinions in Twitter , 2016, AAAI.

[10]  Niloy Ganguly,et al.  Spammers' networks within online social networks: a case-study on Twitter , 2011, WWW.

[11]  Wang,et al.  Review of road traffic control strategies , 2003, Proceedings of the IEEE.

[12]  Fei-Yue Wang,et al.  Data-Driven Intelligent Transportation Systems: A Survey , 2011, IEEE Transactions on Intelligent Transportation Systems.

[13]  Constantinos Antoniou,et al.  Use of Geotagged Social Media in Urban Settings: Empirical Evidence on Its Potential from Twitter , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[14]  Fei-Yue Wang,et al.  Parallel Control and Management for Intelligent Transportation Systems: Concepts, Architectures, and Applications , 2010, IEEE Transactions on Intelligent Transportation Systems.

[15]  Tao Wang,et al.  Crowdsourcing in ITS: The State of the Work and the Networking , 2016, IEEE Transactions on Intelligent Transportation Systems.

[16]  Xiao Wang,et al.  Traffic Congestion and Social Media in China , 2013, IEEE Intelligent Systems.

[17]  Chong Wang,et al.  Nested Hierarchical Dirichlet Processes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Po Hu,et al.  Learning Continuous Word Embedding with Metadata for Question Retrieval in Community Question Answering , 2015, ACL.

[19]  Chao Yang,et al.  Empirical Evaluation and New Design for Fighting Evolving Twitter Spammers , 2011, IEEE Transactions on Information Forensics and Security.

[20]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[21]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[22]  Srinivasan Venkatesh,et al.  Battling the Internet water army: Detection of hidden paid posters , 2011, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[23]  Yisheng Lv,et al.  Social media based transportation research: the state of the work and the networking , 2017, IEEE/CAA Journal of Automatica Sinica.

[24]  Jun Zhao,et al.  How to Generate a Good Word Embedding , 2015, IEEE Intelligent Systems.

[25]  Mario Kolberg,et al.  Integrating Twitter Traffic Information with Kalman Filter Models for Public Transportation Vehicle Arrival Time Prediction , 2015, Big-Data Analytics and Cloud Computing.

[26]  Zhendong Niu,et al.  Knowledge-based recommendation: a review of ontology-based recommender systems for e-learning , 2017, Artificial Intelligence Review.

[27]  Ding Wen,et al.  Linguistic Dynamic Analysis of Traffic Flow Based on Social Media—A Case Study , 2016, IEEE Transactions on Intelligent Transportation Systems.

[28]  Min Song,et al.  Topic-based content and sentiment analysis of Ebola virus on Twitter and in the news , 2016, J. Inf. Sci..

[29]  Fenghua Zhu,et al.  Cyber-physical-social system in intelligent transportation , 2015, IEEE/CAA Journal of Automatica Sinica.

[30]  Sheng-Tzong Cheng,et al.  The Adaptive Road Routing Recommendation for Traffic Congestion Avoidance in Smart City , 2014, Wirel. Pers. Commun..

[31]  Sinno Jialin Pan,et al.  Short and Sparse Text Topic Modeling via Self-Aggregation , 2015, IJCAI.

[32]  Susan T. Dumais,et al.  Characterizing Microblogs with Topic Models , 2010, ICWSM.

[33]  Wei Shen,et al.  Improving Traffic Prediction with Tweet Semantics , 2013, IJCAI.

[34]  Fang Chen,et al.  TrafficWatch: Real-Time Traffic Incident Detection and Monitoring Using Social Media , 2016, PAKDD.

[35]  Li Zhenghua Language Technology Platform , 2011 .

[36]  S. Travis Waller,et al.  Transportation application of social media: Travel mode extraction , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[37]  Changsheng Xu,et al.  A Generic Framework for Social Event Analysis , 2017, ICMR.

[38]  Alex Talevski,et al.  HoneySpam 2.0: Profiling Web Spambot Behaviour , 2009, PRIMA.

[39]  Liuqing Yang,et al.  Big Data for Social Transportation , 2016, IEEE Transactions on Intelligent Transportation Systems.

[40]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[41]  Zhendong Niu,et al.  A hybrid knowledge-based recommender system for e-learning based on ontology and sequential pattern mining , 2017, Future Gener. Comput. Syst..

[42]  Juan-Zi Li,et al.  Measuring the Influence from User-Generated Content to News via Cross-dependence Topic Modeling , 2015, DASFAA.

[43]  Eleonora D'Andrea,et al.  Real-Time Detection of Traffic From Twitter Stream Analysis , 2015, IEEE Transactions on Intelligent Transportation Systems.

[44]  Brian D. Davison,et al.  Empirical study of topic modeling in Twitter , 2010, SOMA '10.

[45]  Henry Leung,et al.  Data fusion in intelligent transportation systems: Progress and challenges - A survey , 2011, Inf. Fusion.

[46]  Qing He,et al.  Forecasting the Subway Passenger Flow Under Event Occurrences With Social Media , 2017, IEEE Transactions on Intelligent Transportation Systems.

[47]  Kaan Ozbay,et al.  Virtual Sensors , 2014 .

[48]  Feng Chen,et al.  From Twitter to detector: real-time traffic incident detection using social media data , 2016 .

[49]  Andrew Y. Ng,et al.  Parsing with Compositional Vector Grammars , 2013, ACL.

[50]  Jordan Boyd-Graber,et al.  Latent Dirichlet Allocation with Infinite Vocabulary , 2016 .

[51]  Fenghua Zhu,et al.  A Kind of Novel ITS Based on Space-Air-Ground Big-Data , 2016, IEEE Intelligent Transportation Systems Magazine.

[52]  A. R. Cook,et al.  ANALYSIS OF FREEWAY TRAFFIC TIME-SERIES DATA BY USING BOX-JENKINS TECHNIQUES , 1979 .

[53]  Lixin Gao,et al.  Road traffic prediction by incorporating online information , 2014, WWW '14 Companion.

[54]  Fei-Yue Wang,et al.  The Emergence of Intelligent Enterprises: From CPS to CPSS , 2010, IEEE Intelligent Systems.

[55]  Navid Rekabsaz Enhancing Information Retrieval with Adapted Word Embedding , 2016, SIGIR.

[56]  M. Shamim Hossain,et al.  Cross-Platform Emerging Topic Detection and Elaboration from Multimedia Streams , 2015, ACM Trans. Multim. Comput. Commun. Appl..

[57]  M. de Rijke,et al.  Predicting IMDB Movie Ratings Using Social Media , 2012, ECIR.

[58]  Zhendong Niu,et al.  Using Adverse Weather Data in Social Media to Assist with City-Level Traffic Situation Awareness and Alerting , 2018, Applied Sciences.

[59]  Ricardo Jardim-Goncalves,et al.  Twitter mining for traffic events detection , 2015, 2015 Science and Information Conference (SAI).

[60]  Zhendong Niu,et al.  Heterogeneous Knowledge-Based Attentive Neural Networks for Short-Term Music Recommendations , 2018, IEEE Access.

[61]  Fenghua Zhu,et al.  Parallel Transportation Management and Control System and Its Applications in Building Smart Cities , 2016, IEEE Transactions on Intelligent Transportation Systems.

[62]  Hongfei Yan,et al.  Comparing Twitter and Traditional Media Using Topic Models , 2011, ECIR.

[63]  Constantinos Antoniou,et al.  Mapping Social Media for Transportation Studies , 2016, IEEE Intelligent Systems.

[64]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[65]  Noriko Kando,et al.  Time Series Topic Modeling and Bursty Topic Detection of Correlated News and Twitter , 2013, IJCNLP.

[66]  Hui Wang,et al.  Web-Based Traffic Sentiment Analysis: Methods and Applications , 2014, IEEE Transactions on Intelligent Transportation Systems.