Traffic Flow Breakdown Prediction using Machine Learning Approaches

Traffic flow breakdown is the abrupt shift from operation at free-flow conditions to congested conditions and is typically the result of complex interactions in traffic dynamics. Because of its stochastic nature, breakdown is commonly predicted only in a probabilistic manner. This paper focuses on using stationary aggregated traffic data to capture traffic dynamics, developing and testing machine learning (ML) approaches for traffic breakdown prediction and comparing them with the traditionally used probabilistic approaches. The contribution of this study is three-fold: it explores the usefulness of temporally and spatially lagged detector data in predicting traffic flow breakdown occurrence, it develops and tests ML approaches for traffic breakdown prediction using this data, and it compares the predictive power and performance of these approaches with the traditionally used probabilistic methods. Feature selection results indicate that breakdown prediction benefits greatly from the inclusion of temporally and spatially lagged variables. Comparing the performance of the ML methods with the probabilistic approaches, ML methods achieve better prediction performance in relation to the class-balanced accuracy, true positive rate (recall), true negative rate (specificity), and positive predictive value (precision). Depending on the application of the prediction approach, the method selection criteria may differ on a case-by-case basis. Overall, the best performance was achieved by the neural network and support vector machine approaches with class balancing, and with the random forest approach without class balancing. Recommendations on the choice of prediction approaches based on the specific application objectives are also given.

[1]  Helbing,et al.  Congested traffic states in empirical observations and microscopic simulations , 2000, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[2]  Nicholas G. Polson,et al.  Deep learning for short-term traffic flow prediction , 2016, 1604.04527.

[3]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[4]  E. Kaplan,et al.  Nonparametric Estimation from Incomplete Observations , 1958 .

[5]  George C. Runger,et al.  Gene selection with guided regularized random forest , 2012, Pattern Recognit..

[6]  Michael I. Jordan,et al.  On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes , 2001, NIPS.

[7]  Hani S. Mahmassani,et al.  Likelihood and Duration of Flow Breakdown , 2010 .

[8]  Hani S. Mahmassani,et al.  Stochastic modeling of traffic flow breakdown phenomenon: Application to predicting travel time reliability , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[9]  Martin Treiber,et al.  Traffic Flow Dynamics , 2013 .

[10]  D. Helbing,et al.  Theoretical vs. empirical classification and prediction of congested traffic states , 2009, 0903.0929.

[11]  Fei-Yue Wang,et al.  Traffic Flow Prediction With Big Data: A Deep Learning Approach , 2015, IEEE Transactions on Intelligent Transportation Systems.

[12]  Hani S. Mahmassani,et al.  Prediction and Mitigation of Flow Breakdown Occurrence for Weather Affected Networks: Case Study of Chicago, Illinois , 2019, Transportation Research Record: Journal of the Transportation Research Board.

[13]  Martin Treiber,et al.  Traffic Flow Dynamics: Data, Models and Simulation , 2012 .

[14]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[15]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT' 98.

[16]  Sam Yagar,et al.  Exploration of the Breakdown Phenomenon in Freeway Traffic , 1998 .

[17]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[18]  FreundYoav,et al.  Large Margin Classification Using the Perceptron Algorithm , 1999 .

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  Shan Suthaharan,et al.  Big data classification: problems and challenges in network intrusion prediction with machine learning , 2014, PERV.

[21]  George C. Runger,et al.  Feature selection via regularized trees , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[22]  Alireza Talebpour,et al.  Machine Learning Approach to Short-Term Traffic Congestion Prediction in a Connected Environment , 2018, Transportation Research Record: Journal of the Transportation Research Board.