Distributed Mining of Popular Paths in Road Networks

We consider the problem of finding large scale mobility patterns. A common challenge in mobility tracking systems is that large quantity of data is spread out spatially and temporally across many tracking sensors. We thus devise a spatial sampling and information exchange protocol that provides probabilistic guarantees on detecting prominent patterns. For this purpose, we define a general notion of significant popular paths that can capture many different types of motion. We design a summary sketch for the data at each tracking node, which can be updated efficiently, and then aggregated across devices to reconstruct the prominent paths in the global data. The algorithm is scalable, even with large number of mobile targets. It uses a hierarchic query system that automatically prioritizes important trajectories – those that are long and popular. We show further that this scheme can in fact give good results by sampling relatively few sensors and targets, and works for streaming spatial data. We prove differential privacy guarantees for the randomized algorithm. Extensive experiments on real GPS data show that the method is efficient and accurate, and is useful in predicting motion of travelers even with small samples.

[1]  Leonidas J. Guibas,et al.  Locating lucrative passengers for taxicab drivers , 2013, SIGSPATIAL/GIS.

[2]  Rouzbeh Razavi,et al.  Urban small cell deployments: Impact on the network energy consumption , 2012, 2012 IEEE Wireless Communications and Networking Conference Workshops (WCNCW).

[3]  Fang Zhao,et al.  Large-Scale Transit Network Optimization by Minimizing User Cost and Transfers , 2006 .

[4]  Maarten Löffler,et al.  Median Trajectories , 2010, Algorithmica.

[5]  Parameswaran Ramanathan,et al.  Distributed target classification and tracking in sensor networks , 2003 .

[6]  Helmut Alt,et al.  Computing the Fréchet distance between two polygonal curves , 1995, Int. J. Comput. Geom. Appl..

[7]  Leonidas J. Guibas,et al.  Discrete mobile centers , 2001, SCG '01.

[8]  Jie Gao,et al.  MinHash Hierarchy for Privacy Preserving Trajectory Sensing and Query , 2017, 2017 16th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN).

[9]  Zhenhong Lin,et al.  Charging infrastructure planning for promoting battery electric vehicles: An activity-based approach using multiday travel data , 2014 .

[10]  Nisheeth Shrivastava,et al.  Target tracking with binary proximity sensors , 2009, TOSN.

[11]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[12]  Aaron Roth,et al.  The Algorithmic Foundations of Differential Privacy , 2014, Found. Trends Theor. Comput. Sci..

[13]  Lei Chen,et al.  Finding time period-based most frequent path in big trajectory data , 2013, SIGMOD '13.

[14]  Lucas Chi Kwong Hui,et al.  Color Set Size Problem with Application to String Matching , 1992, CPM.

[15]  Eli Upfal,et al.  Probability and Computing: Randomized Algorithms and Probabilistic Analysis , 2005 .

[16]  Dieter Pfoser,et al.  Novel Approaches to the Indexing of Moving Object Trajectories , 2000, VLDB.

[17]  Dino Pedreschi,et al.  Trajectory pattern mining , 2007, KDD '07.

[18]  Anand Rajaraman,et al.  Mining of Massive Datasets , 2011 .

[19]  Joachim Gudmundsson,et al.  Detecting Commuting Patterns by Clustering Subtrajectories , 2011, Int. J. Comput. Geom. Appl..

[20]  Cecilia Mascolo,et al.  Mining users' significant driving routes with low-power sensors , 2014, SenSys.

[21]  Yifan Li,et al.  Clustering moving objects , 2004, KDD.

[22]  Din J. Wasem,et al.  Mining of Massive Datasets , 2014 .