Gaussian Process Decentralized Data Fusion and Active Sensing for Spatiotemporal Traffic Modeling and Prediction in Mobility-on-Demand Systems

Mobility-on-demand (MoD) systems have recently emerged as a promising paradigm of one-way vehicle sharing for sustainable personal urban mobility in densely populated cities. We assume the capability of a MoD system to be enhanced by deploying robotic shared vehicles that can autonomously cruise the streets to be hailed by users. A key challenge of the MoD system is that of real-time, fine-grained mobility demand and traffic flow sensing and prediction. This paper presents novel Gaussian process (GP) decentralized data fusion and active sensing algorithms for real-time, fine-grained traffic modeling and prediction with a fleet of MoD vehicles. The predictive performance of our decentralized data fusion algorithms are theoretically guaranteed to be equivalent to that of sophisticated centralized sparse GP approximations. We derive consensus filtering variants requiring only local communication between neighboring vehicles. We theoretically guarantee the performance of our decentralized active sensing algorithms. When they are used to gather informative data for mobility demand prediction, they can achieve a dual effect of fleet rebalancing to service mobility demands. Empirical evaluation on real-world datasets shows that our algorithms are significantly more time-efficient and scalable in the size of data and fleet while achieving predictive performance comparable to that of state-of-the-art algorithms. Note to Practitioners-Knowing, understanding, and predicting spatiotemporally varying traffic phenomena in real time has become increasingly important to the goal of achieving smooth-flowing, congestion-free traffic in densely populated urban cities, which motivates our work here. This paper addresses the following fundamental problem of data fusion and active sensing: How can a fleet of autonomous robotic vehicles or mobile probes actively cruise a road network to gather and assimilate the most informative data for predicting a spatiotemporally varying traffic phenomenon like a mobility demand pattern or traffic flow? Existing centralized solutions are poorly suited because they suffer from a single point of failure and incur huge communication, space, and time overheads with large data and fleet. This paper proposes novel efficient and scalable decentralized data fusion and active sensing algorithms with theoretical performance guarantees. The practical applicability of our algorithms is not restricted to traffic monitoring [1]-[4]; they can be used in other environmental sensing applications such as mineral prospecting [5], precision agriculture, monitoring of ocean/freshwater phenomena (e.g., plankton bloom) [6]-[9], forest ecosystems, pollution (e.g., oil spill), or contamination. Note that the decentralized data fusion component of our algorithms can also be used for static sensors and passive mobile probes and, interestingly, adapted to parallel implementations to be run on a cluster of machines for achieving efficient and scalable probabilistic prediction (i.e., with predictive uncertainty) with large data. Empirical results show that our algorithms can perform well with two datasets featuring real-world traffic phenomena in the densely-populated urban city of Singapore. A limitation of our algorithms is that the decentralized data fusion components assume independence between multiple traffic phenomena while the decentralized active sensing components only work for a single traffic phenomenon. So, in our future work, we will generalize our algorithms to perform active sensing of multiple traffic phenomena and remove the assumption of independence between them.

[1]  William L Eisele,et al.  TRAVEL TIME DATA COLLECTION HANDBOOK , 1998 .

[2]  Michael Edward Hohn Geostatistics and Petroleum Geology (2nd ed.) , 2000, Technometrics.

[3]  Jorge Cortés,et al.  Distributed Kriged Kalman Filter for Spatial Estimation , 2009, IEEE Transactions on Automatic Control.

[4]  Nicholas Jing Yuan,et al.  T-Finder: A Recommender System for Finding Passengers and Vacant Taxis , 2013, IEEE Transactions on Knowledge and Data Engineering.

[5]  Kian Hsiang Low,et al.  Gaussian Process-Based Decentralized Data Fusion and Active Sensing for Mobility-on-Demand System , 2013, Robotics: Science and Systems.

[6]  Y. Kamarianakis,et al.  Forecasting Traffic Flow Conditions in an Urban Network: Comparison of Multivariate and Univariate Approaches , 2003 .

[7]  Marko Wagner,et al.  Geostatistics For Environmental Scientists , 2016 .

[8]  Emilio Frazzoli,et al.  Robotic load balancing for mobility-on-demand systems , 2012, Int. J. Robotics Res..

[9]  Michael Edward Hohn,et al.  Geostatistics and Petroleum Geology , 1988 .

[10]  Kian Hsiang Low,et al.  Information-Theoretic Approach to Efficient Adaptive Path Planning for Mobile Robotic Environmental Sensing , 2009, ICAPS.

[11]  Mark Coates,et al.  Distributed particle filters for sensor networks , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[12]  Hoong Chuin Lau,et al.  Toward Large-Scale Agent Guidance in an Urban Taxi Service , 2012, UAI.

[13]  Peng Yang,et al.  Stability and Convergence Properties of Dynamic Average Consensus Estimators , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[14]  V. N. Bogaevski,et al.  Matrix Perturbation Theory , 1991 .

[15]  Athanasios K. Ziliaskopoulos,et al.  Foundations of Dynamic Traffic Assignment: The Past, the Present and the Future , 2001 .

[16]  Patrick J. F. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 2003 .

[17]  Nicholas R. Jennings,et al.  Decentralised Coordination of Mobile Sensors Using the Max-Sum Algorithm , 2009, IJCAI.

[18]  Jane Yung-jen Hsu,et al.  Context-aware taxi demand hotspots prediction , 2010, Int. J. Bus. Intell. Data Min..

[19]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[20]  R.M. Murray,et al.  On a decentralized active sensing strategy using mobile sensor platforms in a network , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[21]  Gilbert Laporte,et al.  Dynamic pickup and delivery problems , 2010, Eur. J. Oper. Res..

[22]  Kian Hsiang Low,et al.  Hierarchical Bayesian Nonparametric Approach to Modeling and Learning the Wisdom of Crowds of Urban Traffic Route Planning Agents , 2012, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[23]  Kristian Kersting,et al.  Stacked Gaussian Process Learning , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[24]  Neil D. Lawrence,et al.  Fast Sparse Gaussian Process Methods: The Informative Vector Machine , 2002, NIPS.

[25]  Kian Hsiang Low,et al.  Recent Advances in Scaling Up Gaussian Process Predictive Models for Large Spatiotemporal Data , 2014, DyDESS.

[26]  Ilse C. F. Ipsen,et al.  Determinant Approximations , 2011, 1105.0437.

[27]  Mohan S. Kankanhalli,et al.  Nonmyopic \(\epsilon\)-Bayes-Optimal Active Learning of Gaussian Processes , 2014, ICML.

[28]  Andreas Krause,et al.  Toward Community Sensing , 2008, 2008 International Conference on Information Processing in Sensor Networks (ipsn 2008).

[29]  Kian Hsiang Low,et al.  Adaptive multi-robot wide-area exploration and mapping , 2008, AAMAS.

[30]  R. Olfati-Saber Distributed Tracking for Mobile Sensor Networks with Information-Driven Mobility , 2007, 2007 American Control Conference.

[31]  Markos Papageorgiou,et al.  Real-time freeway traffic state estimation based on extended Kalman filter: a general approach , 2005 .

[32]  Kian Hsiang Low,et al.  Generalized Online Sparse Gaussian Processes with Application to Persistent Mobile Robot Localization , 2014, ECML/PKDD.

[33]  Kian Hsiang Low,et al.  Decentralized active robotic exploration and mapping for probabilistic field classification in environmental sensing , 2012, AAMAS.

[34]  Kian Hsiang Low,et al.  Multi-robot informative path planning for active sensing of environmental phenomena: a tale of two algorithms , 2013, AAMAS.

[35]  A. Bayen,et al.  A traffic model for velocity data assimilation , 2010 .

[36]  Zoubin Ghahramani,et al.  Local and global sparse Gaussian process approximations , 2007, AISTATS.

[37]  R. Reese Geostatistics for Environmental Scientists , 2001 .

[38]  Hao Chen,et al.  Real-time freeway traffic state prediction: A particle filter approach , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[39]  KrauseAndreas,et al.  Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies , 2008 .

[40]  Wanli Min,et al.  Real-time road traffic prediction with spatio-temporal correlations , 2011 .

[41]  D. Clawin,et al.  Wireless LAN performance under varied stress conditions in vehicular traffic scenarios , 2002, Proceedings IEEE 56th Vehicular Technology Conference.

[42]  Kian Hsiang Low,et al.  Parallel Gaussian Process Regression for Big Data: Low-Rank Representation Meets Markov Approximation , 2014, AAAI.

[43]  Neil D. Lawrence,et al.  Fast Forward Selection to Speed Up Sparse Gaussian Process Regression , 2003, AISTATS.

[44]  Carlos Guestrin,et al.  Robust Probabilistic Inference in Distributed Systems , 2004, UAI.

[45]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[46]  Hui Xiong,et al.  An energy-efficient mobile recommender system , 2010, KDD.

[47]  Kian Hsiang Low,et al.  Robot Boats as a Mobile Aquatic Sensor Network , 2009 .

[48]  Kian Hsiang Low,et al.  Telesupervised remote surface water quality sensing , 2010, 2010 IEEE Aerospace Conference.

[49]  Jongeun Choi,et al.  Explorative navigation of mobile sensor networks using sparse Gaussian processes , 2010, 49th IEEE Conference on Decision and Control (CDC).

[50]  R. Olfati-Saber,et al.  Consensus Filters for Sensor Networks and Distributed Sensor Fusion , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[51]  Márk Jelasity,et al.  Gossip-based aggregation in large dynamic networks , 2005, TOCS.

[52]  Gaurav S. Sukhatme,et al.  Decentralized Data Fusion and Active Sensing with Mobile Sensors for Modeling and Predicting Spatiotemporal Traffic Phenomena , 2012, UAI.

[53]  RasmussenCarl Edward,et al.  A Unifying View of Sparse Approximate Gaussian Process Regression , 2005 .

[54]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[55]  Kian Hsiang Low,et al.  Parallel Gaussian Process Regression with Low-Rank Covariance Matrix Approximations , 2013, UAI.

[56]  Andreas Krause,et al.  Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies , 2008, J. Mach. Learn. Res..

[57]  Karthik K. Srinivasan,et al.  Determination of Number of Probe Vehicles Required for Reliable Travel Time Measurement in Urban Network , 1996 .

[58]  Kian Hsiang Low,et al.  Active Markov information-theoretic path planning for robotic environmental sensing , 2011, AAMAS.

[59]  Kian Hsiang Low,et al.  GP-Localize: Persistent Mobile Robot Localization using Online Sparse Gaussian Process Observation Model , 2014, AAAI.

[60]  Sebastian Thrun,et al.  Decentralized Sensor Fusion with Distributed Particle Filters , 2002, UAI.

[61]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[62]  H. Durrant-Whyte,et al.  The ANSER Project: Data Fusion Across Multiple Uninhabited Air Vehicles , 2003 .

[63]  H. Banks Center for Research in Scientific Computationにおける研究活動 , 1999 .

[64]  Richard M. Murray,et al.  DYNAMIC CONSENSUS FOR MOBILE NETWORKS , 2005 .

[65]  R. Olfati-Saber,et al.  Distributed Kalman Filter with Embedded Consensus Filters , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[66]  Carl E. Rasmussen,et al.  A Unifying View of Sparse Approximate Gaussian Process Regression , 2005, J. Mach. Learn. Res..

[67]  Mohan S. Kankanhalli,et al.  Active Learning Is Planning: Nonmyopic ε-Bayes-Optimal Active Learning of Gaussian Processes , 2014, ECML/PKDD.

[68]  Zhaohui Wu,et al.  Prediction of urban human mobility using large-scale taxi traces and its applications , 2012, Frontiers of Computer Science.

[69]  C. Guestrin,et al.  Distributed regression: an efficient framework for modeling sensor network data , 2004, Third International Symposium on Information Processing in Sensor Networks, 2004. IPSN 2004.

[70]  Kian Hsiang Low,et al.  Adaptive Sampling for Multi-Robot Wide-Area Exploration , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[71]  Kian Hsiang Low,et al.  Multi-robot active sensing of non-stationary gaussian process-based environmental phenomena , 2014, AAMAS.

[72]  William J. Mitchell,et al.  Reinventing the Automobile: Personal Urban Mobility for the 21st Century , 2010 .

[73]  Alberto Elfes,et al.  Cooperative aquatic sensing using the telesupervised adaptive ocean sensor fleet , 2009, Remote Sensing.