Learning Geo-Contextual Embeddings for Commuting Flow Prediction

Predicting commuting flows based on infrastructure and land-use information is critical for urban planning and public policy development. However, it is a challenging task given the complex patterns of commuting flows. Conventional models, such as gravity model, are mainly derived from physics principles and limited by their predictive power in real-world scenarios where many factors need to be considered. Meanwhile, most existing machine learning-based methods ignore the spatial correlations and fail to model the influence of nearby regions. To address these issues, we propose Geo-contextual Multitask Embedding Learner (GMEL), a model that captures the spatial correlations from geographic contextual information for commuting flow prediction. Specifically, we first construct a geo-adjacency network containing the geographic contextual information. Then, an attention mechanism is proposed based on the framework of graph attention network (GAT) to capture the spatial correlations and encode geographic contextual information to embedding space. Two separate GATs are used to model supply and demand characteristics. A multitask learning framework is used to introduce stronger restrictions and enhance the effectiveness of the embedding representation. Finally, a gradient boosting machine is trained based on the learned embeddings to predict commuting flows. We evaluate our model using real-world datasets from New York City and the experimental results demonstrate the effectiveness of our proposal against the state of the art.

[1]  Alberto Vancheri The dynamics of complex urban systems : an interdisciplinary approach , 2010 .

[2]  Alexander J. Smola,et al.  Deep Graph Library: Towards Efficient and Scalable Deep Learning on Graphs , 2019, ArXiv.

[3]  Arthur Cayley,et al.  The Collected Mathematical Papers: On Monge's “Mémoire sur la théorie des déblais et des remblais” , 2009 .

[4]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[5]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[6]  Maxime Lenormand,et al.  Systematic comparison of trip distribution laws and models , 2015, 1506.04889.

[7]  Tomoharu Iwata,et al.  Neural Collective Graphical Models for Estimating Spatio-Temporal Population Flow from Aggregated Data , 2019, AAAI.

[8]  J. Bram,et al.  The Evolution of Commuting Patterns in the New York City Metro Area , 2005 .

[9]  C. Bamford,et al.  Geography of transport , 1978 .

[10]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Reconstructing commuters network using machine learning and urban indicators , 2019, Scientific Reports.

[11]  Zhenhui Li,et al.  Region Representation Learning via Mobility Flow , 2017, CIKM.

[12]  W. Tobler A Computer Movie Simulating Urban Growth in the Detroit Region , 1970 .

[13]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[14]  Nathan Eagle,et al.  Limits of Predictability in Commuting Flows in the Absence of Data for Calibration , 2014, Scientific Reports.

[15]  Junbo Zhang,et al.  Flow Prediction in Spatio-Temporal Networks Based on Multitask Deep Learning , 2020, IEEE Transactions on Knowledge and Data Engineering.

[16]  Jean-Claude Thill,et al.  Trip distribution modeling with Twitter data , 2019, Comput. Environ. Urban Syst..

[17]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[18]  B. Slack,et al.  The Geography of Transport Systems , 2006 .

[19]  Kai Zheng,et al.  Origin-Destination Matrix Prediction via Graph Convolution: a New Perspective of Passenger Demand Modeling , 2019, KDD.

[20]  Tie-Yan Liu,et al.  DeepGBM: A Deep Learning Framework Distilled by GBDT for Online Prediction Tasks , 2019, KDD.

[21]  G. Zipf The P 1 P 2 D Hypothesis: On the Intercity Movement of Persons , 1946 .

[22]  Bistra N. Dilkina,et al.  A Machine Learning Approach to Modeling Human Migration , 2017, COMPASS.

[23]  Kaan Ozbay,et al.  Dynamic Origin–Destination Matrix Prediction with Line Graph Neural Networks and Kalman Filter , 2019 .

[24]  Andrew Zisserman,et al.  Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[25]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[26]  Marta C. González,et al.  A universal model for mobility and migration patterns , 2011, Nature.

[27]  Dennis Luxen,et al.  Real-time routing with OpenStreetMap data , 2011, GIS.

[28]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[29]  A. E. Austin,et al.  The Cost of a Commute: A Multidisciplinary Approach to Osteoarthritis in New Kingdom Egypt , 2017 .

[30]  Junbo Zhang,et al.  Urban Traffic Prediction from Spatio-Temporal Data Using Deep Meta Learning , 2019, KDD.