Extraction of traffic information from social media interactions: Methods and experiments

With the rapid development of social media, User Generated Content (UGC) has spawned huge amount of information in the society today. In the age of Big Data, we can provide important information for peoples' transportation needs through exploring and making full use of traffic data in social media. This paper introduces techniques like Natural Language Processing, cloud and open platform, mobile Internet, and human computer interacions to extract traffic information from text-based data buried in social media. We utilized Sina Weibo(weibo.com, a Twitter equivalent in China) as our main source of data, and developed a prototype system that published and captured traffic status through an Android based app (application). Experiments showed that the prototype system ran well in real time.

[1]  Chen Chuan-bin Matching Urban Traffic Information in Chinese Natural Language with Road Network , 2009 .

[2]  Fei-Yue Wang,et al.  Data-Driven Intelligent Transportation Systems: A Survey , 2011, IEEE Transactions on Intelligent Transportation Systems.

[3]  James Purnama,et al.  Traffic Condition Information Extraction & Visualization from Social Media Twitter for Android Mobile Application , 2011, Proceedings of the 2011 International Conference on Electrical Engineering and Informatics.

[4]  Lucila Ohno-Machado,et al.  Natural language processing: an introduction , 2011, J. Am. Medical Informatics Assoc..

[5]  Dan Jurafsky,et al.  Statistical Natural Language Processing , 2010, Encyclopedia of Machine Learning.

[6]  Lu Feng,et al.  A cross-step word segmentation algorithm for understanding traffic information represented in natural Chinese language , 2009 .

[7]  Fengxiang Qiao,et al.  Social Media Applications to Publish Dynamic Transportation Information on Campus , 2011 .

[8]  Zheng Shi Overview of Question-Answering , 2002 .

[9]  Jindřich Libovický Statistical Natural Language Processing Methods in Music Notation Analysis , 2013 .

[10]  Paul Scarponcini Generalized Model for Linear Referencing in Transportation , 2002, GeoInformatica.

[11]  Yutaka Matsuo,et al.  Tweet Analysis for Real-Time Event Detection and Earthquake Reporting System Development , 2013, IEEE Transactions on Knowledge and Data Engineering.

[12]  Xiao Wang,et al.  Traffic Congestion and Social Media in China , 2013, IEEE Intelligent Systems.