Exploiting Taxi Demand Hotspots Based on Vehicular Big Data Analytics

In the urban transportation system, the unbalanced relationship between taxi demand and the number of running taxis reduces the drivers' income and the levels of passengers' satisfaction. With the help of vehicular global positioning system (GPS) data, the taxi demand distribution of city can be analyzed to provide advice for drivers. A clustering algorithm called Density-Based Spatial Clustering of Applications with Noise (DBSCAN) is suitable for discovering demand hotspots. However, the execution efficiency is still a big challenge when DBSCAN is applied on big databases. In this paper, we propose an improved density-based clustering algorithm called Grid and Kd-tree for DBSCAN (GD-DBSCAN), which integrates partitioning method with kd-tree structure to improve the computational performance of DBSCAN. Furthermore, this algorithm can take advantages of multi cores and shared memory to parallelize related functions. The experiment shows GD- DBSCAN is efficient, it has an improvement of at least 10% in performance compared with DBSCAN.

[1]  Shih-Fen Cheng,et al.  Density Peaks Clustering Approach for Discovering Demand Hot Spots in City-scale Taxi Fleet Dataset , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[2]  João Gama,et al.  An online recommendation system for the taxi stand choice problem (Poster) , 2012, 2012 IEEE Vehicular Networking Conference (VNC).

[3]  Zhaohui Wu,et al.  This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS 1 Land-Use Classification Using Taxi GPS Traces , 2022 .

[4]  Lin Sun,et al.  Understanding Taxi Service Strategies From Taxi GPS Traces , 2015, IEEE Transactions on Intelligent Transportation Systems.

[5]  João Gama,et al.  Predicting Taxi–Passenger Demand Using Streaming Data , 2013, IEEE Transactions on Intelligent Transportation Systems.

[6]  Haoyu Tan,et al.  MR-DBSCAN: a scalable MapReduce-based DBSCAN algorithm for heavily skewed data , 2013, Frontiers of Computer Science.

[7]  I. Wald,et al.  On building fast kd-Trees for Ray Tracing, and on doing that in O(N log N) , 2006, 2006 IEEE Symposium on Interactive Ray Tracing.

[8]  Rong Du,et al.  Effective Urban Traffic Monitoring by Vehicular Sensor Networks , 2015, IEEE Transactions on Vehicular Technology.

[9]  Rafael Sachetto Oliveira,et al.  G-DBSCAN: A GPU Accelerated Algorithm for Density-based Clustering , 2013, ICCS.