Taxi-Passenger-Demand Modeling Based on Big Data from a Roving Sensor Network

Investigating passenger demand is essential for the taxicab business. Existing solutions are typically based on offline data collected by manual investigations, which are often dated and inaccurate for real-time analysis. To address this issue, we propose Dmodel, employing roving taxicabs as real-time mobile sensors to (i) infer passenger arriving moments by interactions of vacant taxicabs, and then (ii) infer passenger demand by customized online training with both historical and real-time data. Dmodel utilizes a novel parameter called pickup pattern based on an entropy of pickup events (accounts for various real-world logical information, e.g., bad weather) to reduce the size of big historical taxicab data to be processed. We evaluate Dmodel with a real-world 450 GB dataset of 14,000 taxicabs for a half year, and results show that compared to the ground truth, Dmodel achieves 83 percent accuracy and outperforms a statistical model by 42 percent. We further present an application where Dmodel is used to dispatch vacant taxicabs to achieve an equilibrium between passenger demand and taxicab supply across urban regions.

[1]  Yu Zheng,et al.  Constructing popular routes from uncertain trajectories , 2012, KDD.

[2]  Xing Xie,et al.  Where to find my next passenger , 2011, UbiComp '11.

[3]  Xing Xie,et al.  Discovering regions of different functions in a city using human mobility and POIs , 2012, KDD.

[4]  Sejoon Lim,et al.  City-scale traffic estimation from a roving sensor network , 2012, SenSys '12.

[5]  Xing Xie,et al.  An Interactive-Voting Based Map Matching Algorithm , 2010, 2010 Eleventh International Conference on Mobile Data Management.

[6]  Zhi-Hua Zhou,et al.  iBAT: detecting anomalous taxi trajectories from GPS traces , 2011, UbiComp '11.

[7]  Jiawei Han,et al.  Adaptive Fastest Path Computation on a Road Network: A Traffic Mining Approach , 2007, VLDB.

[8]  Rajesh Krishna Balan,et al.  Real-time trip information service for a large taxi fleet , 2011, MobiSys '11.

[9]  Gang Pan,et al.  Mining the semantics of origin-destination flows using taxi traces , 2012, UbiComp '12.

[10]  Yan Huang,et al.  Detecting regions of disequilibrium in taxi services under uncertainty , 2012, SIGSPATIAL/GIS.

[11]  Satish V. Ukkusuri,et al.  Urban Link Travel Time Estimation Using Large-scale Taxi Data with Partial Information , 2013 .

[12]  Guangzhong Sun,et al.  Driving with knowledge from the physical world , 2011, KDD.

[13]  Carlo Ratti,et al.  Exploring human movements in Singapore: a comparative analysis based on mobile phone and taxicab usages , 2013, UrbComp '13.

[14]  Kentaro Uesugi,et al.  Adaptive Routing of Cruising Taxis by Mutual Exchange of Pathways , 2008, KES.

[15]  Paolo Santi,et al.  Supporting Information for Quantifying the Benefits of Vehicle Pooling with Shareability Networks Data Set and Pre-processing , 2022 .

[16]  Carlo Ratti,et al.  Taxi-Aware Map: Identifying and Predicting Vacant Taxis in the City , 2010, AmI.

[17]  Hui Xiong,et al.  An energy-efficient mobile recommender system , 2010, KDD.

[18]  Xing Xie,et al.  Discovering spatio-temporal causal interactions in traffic data streams , 2011, KDD.

[19]  Hai Yang,et al.  Nonlinear pricing of taxi services , 2010 .

[20]  Schaller Consulting,et al.  The New York City Taxicab Fact Book , 2006 .

[21]  Sally Floyd,et al.  Wide area traffic: the failure of Poisson modeling , 1995, TNET.