Evaluating the Accuracy of GPS-based Taxi Trajectory Records

Taxi data are an underused source of travel information. A handful of research has been concerned with the processing of raw taxi GPS data to minimize random error. The study of methods that systematically detect erroneous data has, however, received less attention. Generally, an origin and a destination are identified when the taxi occupancy status (occupied/vacant) shifts. Information would be wrongly recorded if taxi drivers incorrectly operate their device or missed signals when the status of their taxi changes. It leads to extremely short trips or long trips. This study proposes a set of criteria to evaluate the accuracy of trips, imputed from taxi GPS data. In particular, attributes such as inaccurate signal, mismatch of movement and speed, abnormal average speed, and mismatch of trips length measured on maps and calculated from records are suggested. Taxi data should pass these tests if trips have been identified accurately. Using these suggested criteria, the accuracy of 150 million GPS records, collected in Guangzhou, China is evaluated.

[1]  Kay W. Axhausen,et al.  Processing Raw Data from Global Positioning Systems without Additional Information , 2009 .

[2]  S. Winter,et al.  Can Betweenness Centrality Explain Traffic Flow , 2009 .

[3]  M. Zaman-Allah,et al.  Ecological engineering : from concepts to applications Nodular diagnosis for ecological engineering of the symbiotic nitrogen fixation with legumes , 2011 .

[4]  M. C. Jones,et al.  A reliable data-based bandwidth selection method for kernel density estimation , 1991 .

[5]  Scott Duncan,et al.  Using global positioning systems in health research: a practical approach to data collection and processing. , 2011, American journal of preventive medicine.

[6]  Chengyang Zhang,et al.  Map-matching for low-sampling-rate GPS trajectories , 2009, GIS.

[7]  Yuan Tian,et al.  Understanding intra-urban trip patterns from taxi trajectory data , 2012, Journal of Geographical Systems.

[8]  John Krumm,et al.  Hidden Markov map matching through noise and sparseness , 2009, GIS.

[9]  Robert M. Richardson,et al.  HP Laboratories Technical Report , 2013 .

[10]  Soora Rasouli,et al.  Activity-based models of travel demand: promises, progress and prospects , 2014 .

[11]  S. Phithakkitnukoon,et al.  Urban mobility study using taxi traces , 2011, TDMA '11.

[12]  Oliver Pink,et al.  A statistical approach to map matching using road network geometry, topology and vehicular motion constraints , 2008, 2008 11th International IEEE Conference on Intelligent Transportation Systems.

[13]  Xing Xie,et al.  Reducing Uncertainty of Low-Sampling-Rate Trajectories , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[14]  R. Bertini,et al.  Transit Buses as Traffic Probes: Use of Geolocation Data for Empirical Evaluation , 2004 .

[15]  Jianting Zhang Smarter outlier detection and deeper understanding of large-scale taxi trip records: a case study of NYC , 2012, UrbComp '12.

[16]  Yanmin Zhu,et al.  Challenges and Opportunities in Exploiting Large-Scale GPS Probe Data , 2011 .

[17]  Toshiyuki Yamamoto,et al.  Feasibility of Using Taxi Dispatch System as Probes for Collecting Traffic Information , 2009, J. Intell. Transp. Syst..