Travel Time Prediction using Tree-Based Ensembles

In this paper, we consider the task of predicting travel times between two arbitrary points in an urban scenario. We view this problem from two temporal perspectives: long-term forecasting with a horizon of several days and short-term forecasting with a horizon of one hour. Both of these perspectives are relevant for planning tasks in the context of urban mobility and transportation services. We utilize tree-based ensemble methods that we train and evaluate on a dataset of taxi trip records from New York City. Through extensive data analysis, we identify relevant temporal and spatial features. We also engineer additional features based on weather and routing data. The latter is obtained via a routing solver operating on the road network. The computational results show that the addition of this routing data can be beneficial to the model performance. Moreover, employing different models for short and long-term prediction is useful as short-term models are better suited to mirror current traffic conditions. In fact, we show that accurate short-term predictions may be obtained with only little training data.

[1]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[2]  H. J. Van Zuylen,et al.  Accurate freeway travel time prediction with state-space neural networks under missing data , 2005 .

[3]  D. T. Lee,et al.  Travel-time prediction with support vector regression , 2004, IEEE Transactions on Intelligent Transportation Systems.

[4]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[5]  Fei-Yue Wang,et al.  Travel time prediction with LSTM neural network , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[6]  Eibe Frank,et al.  Accelerating the XGBoost algorithm using GPU computing , 2017, PeerJ Comput. Sci..

[7]  Shinichi Morishita,et al.  On Classification and Regression , 1998, Discovery Science.

[8]  Dongjoo Park,et al.  Dynamic multi-interval bus travel time prediction using bus transit data , 2010 .

[9]  Samiul Hasan,et al.  Modeling of Travel Time Variations on Urban Links in London , 2011 .

[10]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[11]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[12]  Marlin W. Ulmer,et al.  Approximate Dynamic Programming for Dynamic Vehicle Routing , 2017 .

[13]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[14]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[15]  Tarek F. Abdelzaher,et al.  On Limits of Travel Time Predictions: Insights from a New York City Case Study , 2014, 2014 IEEE 34th International Conference on Distributed Computing Systems.

[16]  Angshuman Guin,et al.  Travel Time Prediction Using a Seasonal Autoregressive Integrated Moving Average Time Series Model , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[17]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[18]  Che-Ming Chen,et al.  Long-term travel time prediction using gradient boosting , 2020, J. Intell. Transp. Syst..

[19]  Ruimin Li,et al.  Incorporating uncertainty into short-term travel time predictions , 2011 .

[20]  Juan Cheng,et al.  Research on Travel Time Prediction Model of Freeway Based on Gradient Boosting Decision Tree , 2019, IEEE Access.

[21]  Christopher M. Bishop,et al.  Classification and regression , 1997 .

[22]  Yanru Zhang,et al.  A gradient boosting method to improve travel time prediction , 2015 .

[23]  Baozhen Yao,et al.  Prediction of Bus Travel Time Using Random Forests Based on Near Neighbors , 2018, Comput. Aided Civ. Infrastructure Eng..