Inferring Trip Occupancies in the Rise of Ride-Hailing Services

The knowledge of all occupied and unoccupied trips made by self-employed drivers are essential for optimized vehicle dispatch by ride-hailing services (e.g., Didi Dache, Uber, Lyft, Grab, etc.). However, the occupancy status of vehicles is not always known to the service operators due to adoption of multiple ride-hailing apps. In this paper, we propose a novel framework, Learning to INfer Trips (LINT), to infer occupancy of car trips by exploring characteristics of observed occupied trips. Two main research steps, stop point classification and structural segmentation, are included in LINT. In the stop point classification step, we represent a vehicle trajectory as a sequence of stop points, and assign stop points with pick-up, drop-off, and intermediate labels. The classification of vehicle trajectory stop points produces a stop point label sequence. For structural segmentation, we further propose several segmentation algorithms, including greedy segmentation (GS), efficient greedy segmentation (EGS), and dynamic programming-based segmentation (DP) to infer occupied trip from stop point label sequences. Our comprehensive experiments on real vehicle trajectories from self-employed drivers show that (1) the proposed stop point classifier predicts stop point labels with high accuracy, and (2) the proposed segmentation algorithm GS delivers the best accuracy performance with efficient running time.

[1]  Rich Caruana,et al.  Predicting good probabilities with supervised learning , 2005, ICML.

[2]  Christian S. Jensen,et al.  Travel Cost Inference from Sparse, Spatio-Temporally Correlated Time Series Using Markov Models , 2013, Proc. VLDB Endow..

[3]  Nicholas Jing Yuan,et al.  Sensing the Pulse of Urban Refueling Behavior , 2015, ACM Trans. Intell. Syst. Technol..

[4]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[5]  Li Yuan Wang,et al.  A Novel Map Matching Algorithm , 2014 .

[6]  Nicholas Jing Yuan,et al.  T-Finder: A Recommender System for Finding Passengers and Vacant Taxis , 2013, IEEE Transactions on Knowledge and Data Engineering.

[7]  Ramayya Krishnan,et al.  Understanding taxi drivers’ routing choices from spatial and social traces , 2015, Frontiers of Computer Science.

[8]  Licia Capra,et al.  Urban Computing: Concepts, Methodologies, and Applications , 2014, TIST.

[9]  Xianfeng Huang,et al.  Identifying Spatial Structure of Urban Functional Centers Using Travel Survey Data: A Case Study of Singapore , 2013, COMP '13.

[10]  Christian S. Jensen,et al.  Stochastic skyline route planning under time-varying uncertainty , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[11]  Hui Xiong,et al.  Exploiting Heterogeneous Human Mobility Patterns for Intelligent Bus Routing , 2014, 2014 IEEE International Conference on Data Mining.

[12]  Xing Xie,et al.  Discovering regions of different functions in a city using human mobility and POIs , 2012, KDD.

[13]  Xing Xie,et al.  Inferring Taxi Status Using GPS Trajectories , 2012, ArXiv.

[14]  Guannan Liu,et al.  A cost-effective recommender system for taxi drivers , 2014, KDD.

[15]  Christian S. Jensen,et al.  Finding lowest-cost paths in settings with safe and preferred zones , 2017, The VLDB Journal.

[16]  Xing Xie,et al.  T-drive: driving directions based on taxi trajectories , 2010, GIS '10.

[17]  Philip S. Yu,et al.  When and Where: Predicting Human Movements Based on Social Spatial-Temporal Events , 2014, SDM.

[18]  Padhraic Smyth,et al.  Modeling human location data with mixtures of kernel densities , 2014, KDD.

[19]  Xing Xie,et al.  Understanding transportation modes based on GPS data for web applications , 2010, TWEB.

[20]  Christian S. Jensen,et al.  Path Cost Distribution Estimation Using Trajectory Data , 2016, Proc. VLDB Endow..

[21]  Dino Pedreschi,et al.  Unveiling the complexity of human mobility by querying and mining massive trajectory data , 2011, The VLDB Journal.

[22]  Christian S. Jensen,et al.  Multi-Cost Optimal Route Planning Under Time-Varying Uncertainty , 2013 .

[23]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[24]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[25]  Yu Zheng,et al.  Travel time estimation of a path using sparse trajectories , 2014, KDD.

[26]  Masamichi Shimosaka,et al.  Forecasting urban dynamics with mobility logs by bilinear Poisson regression , 2015, UbiComp.