Using machine learning and big data approaches to predict travel time based on historical and real-time data from Taiwan electronic toll collection

As the technology in automation and computation advances, traffic data can be easily collected from multiple sources, such as sensors and surveillance cameras. To extract value from the huge volumes of available data requires the capability to process and extract patterns in large datasets. In this paper, a machine learning method embedded within a big data analytics platform is constructed by using random forests method and Apache Hadoop to predict highway travel time based on data collected from highway electronic toll collection in Taiwan. Various prediction models are then developed for highway travel time based on historical and real-time data to provide drivers with estimated and adjusted travel time information.

[1]  Mu-Chen Chen,et al.  A data mining based approach for travel time prediction in freeway with non-recurrent congestion , 2014, Neurocomputing.

[2]  Sharath Chandra Guntuku,et al.  Big Data Analytics framework for Peer-to-Peer Botnet detection using Random Forests , 2014, Inf. Sci..

[3]  Mu-Chen Chen,et al.  Identifying important variables for predicting travel time of freeway with non-recurrent congestion with neural networks , 2012, Neural Computing and Applications.

[4]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[5]  Chung-Cheng Lu,et al.  A bayesian dynamic linear model approach for real-time short-term freeway travel time prediction , 2011 .

[6]  Majid Mirmehdi,et al.  Traffic sign recognition using MSER and Random Forests , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[7]  Nikolaos Geroliminis,et al.  Experienced travel time prediction for congested freeways , 2013 .

[8]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[9]  D. T. Lee,et al.  Travel-time prediction with support vector regression , 2004, IEEE Transactions on Intelligent Transportation Systems.

[10]  Jun Liu,et al.  Freeway path travel time prediction based on heterogeneous traffic data through nonparametric model , 2016, J. Intell. Transp. Syst..

[11]  van Lint,et al.  Reliable Real-Time Framework for Short-Term Freeway Travel Time Prediction , 2006 .

[12]  Steven I-Jy Chien,et al.  DYNAMIC TRAVEL TIME PREDICTION WITH REAL-TIME AND HISTORICAL DATA , 2003 .

[13]  Daniel Neagu,et al.  Using random forest and decision tree models for a new vehicle prediction approach in computational toxicology , 2015, Soft Computing.

[14]  Eeti Jain,et al.  Categorizing Twitter users on the basis of their interests using Hadoop/Mahout platform , 2014, 2014 9th International Conference on Industrial and Information Systems (ICIIS).

[15]  Eleni I. Vlahogianni,et al.  Short-term traffic forecasting: Where we are and where we’re going , 2014 .

[16]  Fu-Hsiang Chen,et al.  An alternative model for the analysis of detecting electronic industries earnings management using stepwise regression, random forest, and decision tree , 2016, Soft Comput..

[17]  Lin Wang,et al.  Metric forests based on Gaussian mixture model for visual image classification , 2018, Soft Comput..

[18]  Yunhao Liu,et al.  Big Data: A Survey , 2014, Mob. Networks Appl..

[19]  João Falcão e Cunha,et al.  Health Twitter Big Bata Management with Hadoop Framework , 2015 .

[20]  Satu Innamaa,et al.  Short-Term Prediction of Travel Time using Neural Networks on an Interurban Highway , 2005 .

[21]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[22]  Pritam Gajkumar Shah,et al.  BIG DATA MINING TOOLS FOR UNSTRUCTURED DATA: A REVIEW , 2015 .

[23]  Francisco Herrera,et al.  On the use of MapReduce for imbalanced big data using Random Forest , 2014, Inf. Sci..

[24]  Eric C. Grunsky,et al.  Predictive lithological mapping of Canada's North using Random Forest classification applied to geophysical and geochemical data , 2015, Comput. Geosci..

[25]  Matthias Weidlich,et al.  Traveling time prediction in scheduled transportation with journey segments , 2017, Inf. Syst..

[26]  Margrit Betke,et al.  Comparing random forest approaches to segmenting and classifying gestures , 2017, Image Vis. Comput..

[27]  Saeid Nahavandi,et al.  A genetic algorithm-based method for improving quality of travel time prediction intervals , 2011 .

[28]  Xiaoyan Zhang,et al.  Short-term travel time prediction , 2003 .