Grab-Posisi: An Extensive Real-Life GPS Trajectory Dataset in Southeast Asia

Real-world GPS trajectory datasets are essential for geographical applications such as map inference, map matching, traffic detection, etc. Currently only a handful of GPS trajectory datasets are publicly available and the quality of these datasets varies. Most of the existing datasets have limited geographical coverage (a focus on China or the USA), have low sampling rates and less contextual information of the GPS pings. This paper presents Grab-Posisi, the first GPS trajectory dataset of Southeast Asia from both developed countries (Singapore) and developing countries (Jakarta, Indonesia). The data were collected very recently in April 2019 with a 1 second sampling rate, which is the highest amongst all the existing open source datasets. It also has richer contextual information, including the accuracy level, bearing, speed and labels trajectories by data acquisition source (Android or iOS phones) and driving mode (Car or Motorcycle). The dataset contains more than 88 million pings and covers more than 1 million kms. Experiments on the dataset demonstrate new challenges for various geographical applications. The dataset is of great value and a significant resource for the community to benchmark and revisit existing algorithms.

[1]  Lin Zhang,et al.  One-month beijing taxi GPS trajectory dataset with taxi IDs and vehicle status , 2018, DATA@SenSys.

[2]  John Krumm,et al.  Hidden Markov map matching through noise and sparseness , 2009, GIS.

[3]  Robert B. Noland,et al.  A High Accuracy Fuzzy Logic Based Map Matching Algorithm for Road Transport , 2006, J. Intell. Transp. Syst..

[4]  Tieniu Tan,et al.  Predicting the Next Location: A Recurrent Model with Spatial and Temporal Contexts , 2016, AAAI.

[5]  Andy Hopper,et al.  Scalable, Distributed, Real-Time Map Generation , 2006, IEEE Pervasive Computing.

[6]  Leonidas J. Guibas,et al.  Knowledge-based trajectory completion from sparse GPS samples , 2016, SIGSPATIAL/GIS.

[7]  Qingquan Li,et al.  A Review of GPS Trajectories Classification Based on Transportation Mode , 2018, Sensors.

[8]  Chao Zhang,et al.  SERM: A Recurrent Model for Next Location Prediction in Semantic Trajectories , 2017, CIKM.

[9]  Kay W. Axhausen,et al.  Processing Raw Data from Global Positioning Systems without Additional Information , 2009 .

[10]  Billy M. Williams,et al.  Modeling and Forecasting Vehicular Traffic Flow as a Seasonal ARIMA Process: Theoretical Basis and Empirical Results , 2003, Journal of Transportation Engineering.

[11]  Leonidas J. Guibas,et al.  City-Scale Map Creation and Updating using GPS Collections , 2016, KDD.

[12]  Tao Cheng,et al.  Inferring hybrid transportation modes from sparse GPS data using a moving window SVM classification , 2012, Comput. Environ. Urban Syst..

[13]  Rajesh Krishna Balan,et al.  Real-time trip information service for a large taxi fleet , 2011, MobiSys '11.

[14]  김기현,et al.  디지탈 영상에서 Adaptive Thresholding에 관한 연구 , 1986 .

[15]  Yu Zheng GPS Trajectories with transportation mode labels , 2010 .

[16]  Yang Wang,et al.  Identifying Different Transportation Modes from Trajectory Data Using Tree-Based Ensemble Classifiers , 2017, ISPRS Int. J. Geo Inf..

[17]  Fredrik Gustafsson,et al.  Particle filters for positioning, navigation, and tracking , 2002, IEEE Trans. Signal Process..

[18]  Roger Zimmermann,et al.  Eddy: an error-bounded delay-bounded real-time map matching algorithm using HMM and online Viterbi decoder , 2014, SIGSPATIAL/GIS.

[19]  Johann Schrammel,et al.  Comparison of Travel Diaries Generated from Smartphone Data and Dedicated GPS Devices , 2015 .

[20]  Kevin Heaslip,et al.  Inferring transportation modes from GPS trajectories using a convolutional neural network , 2018, ArXiv.

[21]  Wei-Ying Ma,et al.  Understanding mobility based on GPS data , 2008, UbiComp.

[22]  Yanhua Li,et al.  Planning Bike Lanes based on Sharing-Bikes' Trajectories , 2017, KDD.

[23]  Xing Xie,et al.  Learning transportation mode from raw gps data for geographic applications on the web , 2008, WWW.

[24]  Daqing Zhang,et al.  Measuring social functions of city regions from large-scale taxi behaviors , 2011, 2011 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops).

[25]  Bruno Martins,et al.  Predicting future locations with hidden Markov models , 2012, UbiComp.

[26]  Zhanxing Zhu,et al.  Spatio-temporal Graph Convolutional Neural Network: A Deep Learning Framework for Traffic Forecasting , 2017, IJCAI.

[27]  Weiwei Sun,et al.  COBWEB: a robust map update system using GPS trajectories , 2015, UbiComp.

[28]  Dieter Pfoser,et al.  A comparison and evaluation of map construction algorithms using vehicle tracking data , 2014, GeoInformatica.

[29]  Yang Yang,et al.  Optimization Model of Taxi Fleet Size Based on GPS Tracking Data , 2019, Sustainability.

[30]  Roger Zimmermann,et al.  Content vs. Context , 2015, ACM Trans. Multim. Comput. Commun. Appl..

[31]  Pasi Fränti,et al.  CellNet: Inferring Road Networks from GPS Trajectories , 2018, ACM Trans. Spatial Algorithms Syst..

[32]  Sher Paul Singh,et al.  Measurement and Analysis of Vibration Levels in Two and Three Wheel Delivery Vehicles in Southeast Asia , 2015 .

[33]  Roger Zimmermann,et al.  A general feature-based map matching framework with trajectory simplification , 2016, IWGS@SIGSPATIAL.

[34]  Yu Liu,et al.  T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction , 2018, IEEE Transactions on Intelligent Transportation Systems.

[35]  Lin Zhang,et al.  A multi-period analysis of taxi drivers' behaviors based on GPS trajectories , 2017, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).

[36]  Xing Xie,et al.  T-drive: driving directions based on taxi trajectories , 2010, GIS '10.

[37]  James Biagioni,et al.  Map inference in the face of noise and disparity , 2012, SIGSPATIAL/GIS.