Situation Aware Multi-task Learning for Traffic Prediction

Due to the recent vast availability of transportation traffic data, major research efforts have been devoted to traffic prediction, which is useful in many applications such as urban planning, traffic management and navigations systems. Current prediction methods that independently train a model per traffic sensor cannot accurately predict traffic in every situation (e.g., rush hours, constructions and accidents) because there may not exist sufficient training samples per sensor for all situations. To address this shortcoming, our core idea is to explore the commonalities of prediction tasks across multiple sensors who behave similarly in a specific traffic situation. Instead of building a model independently per sensor, we propose a Multi-Task Learning (MTL) framework that aims to first automatically identify the traffic situations and then simultaneously build one forecasting model for similar-behaving sensors per traffic situation. The key innovation here is that instead of the straightforward application of MTL where each "task" corresponds to a sensor, we relate each MTL's "task" to a traffic situation. Specifically, we first identify these traffic situations by running clustering algorithms on all sensors' data. Subsequently, to enforce the commonalities under each identified situation, we use the group Lasso regularization in MTL to select a common set of features for the prediction tasks, and we adapt efficient FISTA algorithm with guaranteed convergence rate. We evaluated our methods with a large volume of real-world traffic sensor data; our results show that by incorporating traffic situations, our proposed MTL framework performs consistently better than naively applying MTL per sensor. Moreover, our holistic approach, under different traffic situations, outperforms all the best traffic prediction approaches for a given situation by up to 18% and 30% in short and long term predictions, respectively.

[1]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[2]  Jieping Ye,et al.  Multi-Task Feature Learning Via Efficient l2, 1-Norm Minimization , 2009, UAI.

[3]  Ramin Yasdi Prediction of Road Traffic using a Neural Network Approach , 1999, Neural Computing & Applications.

[4]  Robert A. Jacobs,et al.  Hierarchical Mixtures of Experts and the EM Algorithm , 1993, Neural Computation.

[5]  Eleni I. Vlahogianni,et al.  Short‐term traffic forecasting: Overview of objectives and methods , 2004 .

[6]  Jiayu Zhou,et al.  Clustered Multi-Task Learning Via Alternating Structure Optimization , 2011, NIPS.

[7]  Hal Daumé,et al.  Learning Multiple Tasks using Manifold Regularization , 2010, NIPS.

[8]  Anthony K. H. Tung,et al.  SMiLer: A Semi-Lazy Time Series Prediction System for Sensors , 2015, SIGMOD Conference.

[9]  Jing Zhao,et al.  Document Clustering Based on Nonnegative Sparse Matrix Factorization , 2005, ICNC.

[10]  Massimiliano Pontil,et al.  Regularized multi--task learning , 2004, KDD.

[11]  Geoffrey E. Hinton,et al.  Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[12]  C. Rasmussen,et al.  Gaussian Process Priors with Uncertain Inputs - Application to Multiple-Step Ahead Time Series Forecasting , 2002, NIPS.

[13]  Tong Zhang,et al.  A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , 2005, J. Mach. Learn. Res..

[14]  Ugur Demiryurek,et al.  Latent Space Model for Road Networks to Predict Time-Varying Traffic , 2016, KDD.

[15]  Eleni I. Vlahogianni,et al.  Short-term traffic forecasting: Where we are and where we’re going , 2014 .

[16]  Wenhao Huang,et al.  Deep Architecture for Traffic Flow Prediction: Deep Belief Networks With Multitask Learning , 2014, IEEE Transactions on Intelligent Transportation Systems.

[17]  Isam Kaysi,et al.  INTEGRATED APPROACH TO VEHICLE ROUTING AND CONGESTION PREDICTION FOR REAL-TIME DRIVER GUIDANCE , 1993 .

[18]  Ugur Demiryurek,et al.  Deep Learning: A Generic Approach for Extreme Condition Traffic Forecasting , 2017, SDM.

[19]  Yu Zheng,et al.  Travel time estimation of a path using sparse trajectories , 2014, KDD.

[20]  Jean-Philippe Vert,et al.  Clustered Multi-Task Learning: A Convex Formulation , 2008, NIPS.

[21]  D. Bertsekas On the Goldstein-Levitin-Polyak gradient projection method , 1974, CDC 1974.

[22]  Christian S. Jensen,et al.  Travel Cost Inference from Sparse, Spatio-Temporally Correlated Time Series Using Markov Models , 2013, Proc. VLDB Endow..

[23]  John Rice,et al.  Accurate estimation of travel times from single-loop detectors 1 1 Funding for this research was pro , 1998 .

[24]  Tao Cheng,et al.  Non-parametric regression for space-time forecasting under missing data , 2012, Comput. Environ. Urban Syst..

[25]  P. Abbeel,et al.  Short – Term Traffic Forecasting : Modeling and Learning Spatio – Temporal Relations in Transportation Networks Using Graph Neural Networks by Behrooz Shahsavari Research Project , 2015 .

[26]  Billy M. Williams,et al.  Comparison of parametric and nonparametric models for traffic flow forecasting , 2002 .

[27]  Rajesh Krishnan,et al.  Short-Term Traffic Prediction Under Normal and Abnormal Traffic Conditions on Urban Roads , 2012 .

[28]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[29]  K. Bredies,et al.  Linear Convergence of Iterative Soft-Thresholding , 2007, 0709.1598.

[30]  Masashi Sugiyama,et al.  Trajectory Regression on Road Networks , 2011, AAAI.

[31]  Ugur Demiryurek,et al.  Utilizing Real-World Transportation Data for Accurate Traffic Prediction , 2012, 2012 IEEE 12th International Conference on Data Mining.

[32]  Jan-Ming Ho,et al.  Travel time prediction with support vector regression , 2003, Proceedings of the 2003 IEEE International Conference on Intelligent Transportation Systems.

[33]  Wanli Min,et al.  Real-time road traffic prediction with spatio-temporal correlations , 2011 .

[34]  Massimiliano Pontil,et al.  Convex multi-task feature learning , 2008, Machine Learning.

[35]  Rich Caruana,et al.  Multitask Learning , 1997, Machine-mediated learning.

[36]  Junjie Wu,et al.  Traffic Speed Prediction and Congestion Source Exploration: A Deep Learning Method , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[37]  Cyrus Shahabi,et al.  Real-Time Traffic Video Analysis Using Intel Viewmont Coprocessor , 2013, DNIS.

[38]  Mihaela van der Schaar,et al.  Mining the Situation: Spatiotemporal Traffic Prediction With Big Data , 2015, IEEE Journal of Selected Topics in Signal Processing.

[39]  Jieping Ye,et al.  Multi-Task Learning for Spatio-Temporal Event Forecasting , 2015, KDD.

[40]  Antonio Torralba,et al.  Sharing Visual Features for Multiclass and Multiview Object Detection , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[42]  Chih-Jen Lin,et al.  Projected Gradient Methods for Nonnegative Matrix Factorization , 2007, Neural Computation.

[43]  Yiannis Kamarianakis,et al.  Space-time modeling of traffic flow , 2002, Comput. Geosci..