UUKG: Unified Urban Knowledge Graph Dataset for Urban Spatiotemporal Prediction

Accurate Urban SpatioTemporal Prediction (USTP) is of great importance to the development and operation of the smart city. As an emerging building block, multi-sourced urban data are usually integrated as urban knowledge graphs (UrbanKGs) to provide critical knowledge for urban spatiotemporal prediction models. However, existing UrbanKGs are often tailored for specific downstream prediction tasks and are not publicly available, which limits the potential advancement. This paper presents UUKG, the unified urban knowledge graph dataset for knowledge-enhanced urban spatiotemporal predictions. Specifically, we first construct UrbanKGs consisting of millions of triplets for two metropolises by connecting heterogeneous urban entities such as administrative boroughs, POIs, and road segments. Moreover, we conduct qualitative and quantitative analysis on constructed UrbanKGs and uncover diverse high-order structural patterns, such as hierarchies and cycles, that can be leveraged to benefit downstream USTP tasks. To validate and facilitate the use of UrbanKGs, we implement and evaluate 15 KG embedding methods on the KG completion task and integrate the learned KG embeddings into 9 spatiotemporal models for five different USTP tasks. The extensive experimental results not only provide benchmarks of knowledge-enhanced USTP models under different task settings but also highlight the potential of state-of-the-art high-order structure-aware UrbanKG embedding methods. We hope the proposed UUKG fosters research on urban knowledge graphs and broad smart city applications. The dataset and source code are available at https://github.com/usail-hkust/UUKG/.

[1]  Depeng Jin,et al.  Urban Knowledge Graph Aided Mobile User Profiling , 2023, ACM Transactions on Knowledge Discovery from Data.

[2]  Depeng Jin,et al.  Hierarchical Knowledge Graph Learning Enabled Socioeconomic Indicator Prediction in Location-Based Social Network , 2023, WWW.

[3]  Yanjie Fu,et al.  UrbanKG: An Urban Knowledge Graph System , 2023, ACM Transactions on Intelligent Systems and Technology.

[4]  Yanxin Xi,et al.  Knowledge-infused Contrastive Learning for Urban Imagery-based Socioeconomic Prediction , 2023, WWW.

[5]  Yong Li,et al.  Developing knowledge graph based system for urban computing , 2022, Proceedings of the 1st ACM SIGSPATIAL International Workshop on Geospatial Knowledge Graphs.

[6]  G. Qi,et al.  An Urban Traffic Knowledge Graph-Driven Spatial-Temporal Graph Convolutional Network for Traffic Flow Prediction , 2022, IJCKG.

[7]  Yizhou Sun,et al.  Dual-Geometric Space Embedding Model for Two-View Knowledge Graphs , 2022, KDD.

[8]  Qingming Huang,et al.  Geometry Interaction Knowledge Graph Embeddings , 2022, AAAI.

[9]  Shirui Pan,et al.  Ultrahyperbolic Knowledge Graph Embeddings , 2022, KDD.

[10]  J. Lian,et al.  OntoProtein: Protein Pretraining With Gene Ontology Embedding , 2022, ICLR.

[11]  Francesco Di Giovanni,et al.  Understanding over-squashing and bottlenecks on graphs via curvature , 2021, ICLR.

[12]  Yong Li,et al.  Knowledge-driven Site Selection via Urban Knowledge Graph , 2021, ArXiv.

[13]  Huandong Wang,et al.  Spatio-Temporal Urban Knowledge Graph Enabled Mobility Prediction , 2021, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[14]  Jure Leskovec,et al.  Modeling Heterogeneous Hierarchies with Relation-specific Hyperbolic Cones , 2021, NeurIPS.

[15]  Fei Teng,et al.  Urban Flow Pattern Mining Based on Multi-Source Heterogeneous Data Fusion and Knowledge Graph Embedding , 2021, IEEE Transactions on Knowledge and Data Engineering.

[16]  Jiannong Cao,et al.  CityNet: A Multi-city Multi-modal Dataset for Smart City Applications , 2021, ArXiv.

[17]  Qingming Huang,et al.  Dual Quaternion Knowledge Graph Embeddings , 2021, AAAI.

[18]  Yanfeng Sun,et al.  Hierarchical Graph Convolution Network for Traffic Forecasting , 2021, AAAI.

[19]  Ramesh Nallapati,et al.  Mixed-Curvature Multi-Relational Graph Neural Network for Knowledge Graph Completion , 2021, WWW.

[20]  Jiyuan Tan,et al.  Research on the Construction of a Knowledge Graph and Knowledge Reasoning Model in the Field of Urban Traffic , 2021, Sustainability.

[21]  Sameh K. Mohamed,et al.  BioKG: A Knowledge Graph for Relational Learning On Biological Data , 2020, CIKM.

[22]  Yu Liu,et al.  T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction , 2018, IEEE Transactions on Intelligent Transportation Systems.

[23]  Lina Yao,et al.  Adaptive Graph Convolutional Recurrent Network for Traffic Forecasting , 2020, NeurIPS.

[24]  Kunpeng Liu,et al.  Incremental Mobile User Profiling: Reinforcement Learning with Spatial Knowledge Graph for Modeling Event Streams , 2020, KDD.

[25]  Xiaojun Chang,et al.  Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks , 2020, KDD.

[26]  Tiantian Zhu,et al.  Temporal Multi-Graph Convolutional Network for Traffic Flow Prediction , 2020, IEEE Transactions on Intelligent Transportation Systems.

[27]  Lars Juhl Jensen,et al.  Clinical Knowledge Graph Integrates Proteomics Data into Clinical Decision-Making , 2020, bioRxiv.

[28]  Christopher R'e,et al.  Low-Dimensional Hyperbolic Knowledge Graph Embeddings , 2020, ACL.

[29]  Yun Chen,et al.  Urban Multi-Source Spatio-Temporal Data Analysis Aware Knowledge Graph Embedding , 2020, Symmetry.

[30]  Timothy M. Hospedales,et al.  Multi-relational Poincaré Graph Embeddings , 2019, NeurIPS.

[31]  Hai Yang,et al.  Hexagon-Based Convolutional Neural Network for Supply-Demand Forecasting of Ride-Sourcing Services , 2019, IEEE Transactions on Intelligent Transportation Systems.

[32]  Yu Meng,et al.  Spherical Text Embedding , 2019, NeurIPS.

[33]  Jure Leskovec,et al.  Hyperbolic Graph Convolutional Neural Networks , 2019, NeurIPS.

[34]  Ning Feng,et al.  Attention Based Spatial-Temporal Graph Convolutional Networks for Traffic Flow Forecasting , 2019, AAAI.

[35]  Timothy M. Hospedales,et al.  TuckER: Tensor Factorization for Knowledge Graph Completion , 2019, EMNLP.

[36]  Nitesh V. Chawla,et al.  DeepCrime: Attentive Hierarchical Recurrent Networks for Crime Prediction , 2018, CIKM.

[37]  Jian-Yun Nie,et al.  RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space , 2018, ICLR.

[38]  Nicolas Usunier,et al.  Canonical Tensor Decomposition for Knowledge Base Completion , 2018, ICML.

[39]  Linpeng Huang,et al.  A Neural Attention Model for Urban Air Quality Inference: Learning the Weights of Monitoring Stations , 2018, AAAI.

[40]  Christopher De Sa,et al.  Representation Tradeoffs for Hyperbolic Embeddings , 2018, ICML.

[41]  Chao Zhang,et al.  DeepMove: Predicting Human Mobility with Attentional Recurrent Networks , 2018, WWW.

[42]  Jules White,et al.  DxNAT — Deep neural networks for explaining non-recurring traffic congestion , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[43]  Jinzhi Lei,et al.  A Deep Learning Approach to the Citywide Traffic Accident Risk Prediction , 2017, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[44]  Zhanxing Zhu,et al.  Spatio-temporal Graph Convolutional Neural Network: A Deep Learning Framework for Traffic Forecasting , 2017, IJCAI.

[45]  Nicholas Jing Yuan,et al.  Understanding People Lifestyles: Construction of Urban Movement Knowledge Graph from GPS Trajectory , 2017, IJCAI.

[46]  Pasquale Minervini,et al.  Convolutional 2D Knowledge Graph Embeddings , 2017, AAAI.

[47]  Xiuwen Yi,et al.  DNN-based prediction model for spatio-temporal data , 2016, SIGSPATIAL/GIS.

[48]  Daniel Kifer,et al.  Crime Rate Inference with Big Data , 2016, KDD.

[49]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[50]  Guillaume Bouchard,et al.  Complex Embeddings for Simple Link Prediction , 2016, ICML.

[51]  Xuan Song,et al.  Learning Deep Representation from Big and Heterogeneous Data for Traffic Accident Inference , 2016, AAAI.

[52]  Han Xiao,et al.  From One Point to a Manifold: Knowledge Graph Embedding for Precise Link Prediction , 2015, IJCAI.

[53]  Gang Pan,et al.  Bike sharing station placement leveraging heterogeneous urban open data , 2015, UbiComp.

[54]  Yunpeng Wang,et al.  Large-Scale Transportation Network Congestion Evolution Prediction Using Deep Learning Theory , 2015, PloS one.

[55]  Shane D. Johnson,et al.  UK open source crime data: accuracy and possibilities for research , 2015 .

[56]  Zhiyuan Liu,et al.  Learning Entity and Relation Embeddings for Knowledge Graph Completion , 2015, AAAI.

[57]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[58]  Jianfeng Gao,et al.  Embedding Entities and Relations for Learning and Inference in Knowledge Bases , 2014, ICLR.

[59]  Chris Mungall,et al.  Global biotic interactions: An open infrastructure to share and analyze species-interaction datasets , 2014, Ecol. Informatics.

[60]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[61]  Zhen Wang,et al.  Knowledge Graph Embedding by Translating on Hyperplanes , 2014, AAAI.

[62]  Christian P. Robert,et al.  Statistics for Spatio-Temporal Data , 2014 .

[63]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[64]  Jason Weston,et al.  Learning Structured Embeddings of Knowledge Bases , 2011, AAAI.

[65]  Hans-Peter Kriegel,et al.  A Three-Way Model for Collective Learning on Multi-Relational Data , 2011, ICML.

[66]  Geoffrey E. Hinton,et al.  Transforming Auto-Encoders , 2011, ICANN.

[67]  Sreenivas Gollapudi,et al.  Diversifying search results , 2009, WSDM '09.

[68]  A. O. Houcine On hyperbolic groups , 2006 .

[69]  C. Willmott,et al.  Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance , 2005 .

[70]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[71]  Jeff Z. Pan,et al.  Construction and Applications of Open Business Knowledge Graph , 2022 .

[72]  Xiaoming Fu,et al.  Inferring Individual Human Mobility From Sparse Check-in Data: A Temporal-Context-Aware Approach , 2024, IEEE Transactions on Computational Social Systems.

[73]  Yuzhong Qu,et al.  CKGG: A Chinese Knowledge Graph for High-School Geography Education and Beyond , 2021, SEMWEB.

[74]  Moritz Rodenhausen,et al.  Geometric Group Theory , 2010 .