Real-time traffic classification with Twitter data mining

The growth of vehicles in Yogyakarta Province, Indonesia is not proportional to the growth of roads. This problem causes severe traffic jam in many main roads. Common traffic anomalies detection using surveillance camera requires manpower and costly, while traffic anomalies detection with crowdsourcing mobile applications are mostly owned by private. This research aims to develop a real-time traffic classification by harnessing the power of social network data, Twitter. In this study, Twitter data are processed to the stages of preprocessing, feature extraction, and tweet classification. This study compares classification performance of three machine learning algorithms, namely Naive Bayes (NB), Support Vector Machine (SVM), and Decision Tree (DT). Experimental results show that SVM algorithm produced the best performance among the other algorithms with 99.77% and 99.87% of classification accuracy in balanced and imbalanced data, respectively. This research implies that social network service may be used as an alternative source for traffic anomalies detection by providing information of traffic flow condition in real-time.

[1]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[2]  W. Jatmiko,et al.  Traffic intelligent system architecture based on social media information , 2012, 2012 International Conference on Advanced Computer Science and Information Systems (ICACSIS).

[3]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[4]  Y. Matsuo,et al.  Real-time event extraction for driving information from social sensors , 2012, 2012 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems (CYBER).

[5]  Edi Winarko,et al.  Klasifikasi Posting Twitter Kemacetan Lalu Lintas Kota Bandung Menggunakan Naive Bayesian Classification , 2013 .

[6]  Ricardo Jardim-Goncalves,et al.  Twitter mining for traffic events detection , 2015, 2015 Science and Information Conference (SAI).

[7]  Ir. Lukito Edi Nugroho,et al.  PENERAPAN ANALISIS SENTIMEN PADA TWITTER BERBAHASA INDONESIA SEBAGAI PEMBERI RATING , 2014 .

[8]  Vasile Palade,et al.  Class Imbalance Learning Methods for Support Vector Machines , 2013 .

[9]  Eleonora D'Andrea,et al.  Real-Time Detection of Traffic From Twitter Stream Analysis , 2015, IEEE Transactions on Intelligent Transportation Systems.

[10]  Ral Garreta,et al.  Learning scikit-learn: Machine Learning in Python , 2013 .

[11]  Feng Chen,et al.  From Twitter to detector: real-time traffic incident detection using social media data , 2016 .