NetClus: A Scalable Framework for Locating Top-K Sites for Placement of Trajectory-Aware Services

Optimal location queries identify the best locations to set up new facilities for providing service to its users. For several businesses such as fuel stations, cellphone base-stations, etc., placement queries require taking into account the mobility patterns (or trajectories) of the users. In this work, we formulate the TOPS (Trajectory-Aware Optimal Placement of Services) query that locates the best k sites on a road network for the prevailing user trajectories. The problem is NP-hard. The greedy approach, which is the state-of-the-art technique for this problem, is not scalable and practical for real urban-scale scenarios, primarily due to its high memory footprint beyond the capabilities of commodity machines. To overcome these challenges, we develop an indexing framework called NETCLUS that derives its power through an unique combination of FM sketches with network clustering. Empirical studies show that NETCLUS requires less than 100 s to answer the TOPS query on real datasets comprising of more than 250,000 sites and 120,000 trajectories.

[1]  Michael Kuby,et al.  Optimization of hydrogen stations in Florida using the Flow-Refueling Location Model , 2009 .

[2]  References , 1971 .

[3]  M. John Hodgson,et al.  A Generalized Model for Locating Facilities on a Network with Flow-Based Demand , 2010 .

[4]  Yufei Tao,et al.  Progressive computation of the min-dist optimal-location query , 2006, VLDB.

[5]  Peter J. Slater,et al.  Fundamentals of domination in graphs , 1998, Pure and applied mathematics.

[6]  Yanhua Li,et al.  Mining the Most Influential $k$ -Location Set from Massive Trajectories , 2016, IEEE Transactions on Big Data.

[7]  Seow Lim,et al.  Heuristic algorithms for siting alternative-fuel stations using the Flow-Refueling Location Model , 2010, Eur. J. Oper. Res..

[8]  Oded Berman,et al.  Locating Discretionary Service Facilities, II: Maximizing Market Size, Minimizing Inconvenience , 1995, Oper. Res..

[9]  Oded Berman,et al.  The generalized maximal covering location problem , 2002, Comput. Oper. Res..

[10]  Said Salhi,et al.  Facility Location: A Survey of Applications and Methods , 1996 .

[11]  Yannis Theodoridis,et al.  Index-based Most Similar Trajectory Search , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[12]  Christos Faloutsos,et al.  Efficient retrieval of similar time sequences under time warping , 1998, Proceedings 14th International Conference on Data Engineering.

[13]  Oded Berman,et al.  Locating flow-intercepting facilities: New approaches and results , 1995, Ann. Oper. Res..

[14]  Feifei Li,et al.  Optimal location queries in road network databases , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[15]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[16]  Deyi Li,et al.  Mining regular routes from GPS data for ridesharing recommendations , 2012, UrbComp '12.

[17]  Jae-Gil Lee,et al.  Trajectory clustering: a partition-and-group framework , 2007, SIGMOD '07.

[18]  Sriram Raghavan,et al.  People in motion: Spatio-temporal analytics on Call Detail Records , 2014, 2014 Sixth International Conference on Communication Systems and Networks (COMSNETS).

[19]  Dimitrios Gunopulos,et al.  Discovering similar multidimensional trajectories , 2002, Proceedings 18th International Conference on Data Engineering.

[20]  S. A. MirHassani,et al.  A Flexible Reformulation of the Refueling Station Location Problem , 2013, Transp. Sci..

[21]  Xing Xie,et al.  T-drive: driving directions based on taxi trajectories , 2010, GIS '10.

[22]  Chengyang Zhang,et al.  Map-matching for low-sampling-rate GPS trajectories , 2009, GIS.

[23]  M. John Hodgson,et al.  The Pickup Problem: Consumers' Locational Preferences in Flow Interception , 2009 .

[24]  Ning Chen,et al.  Approximation for Dominating Set Problem with Measure Functions , 2012, Comput. Artif. Intell..

[25]  Claudio Sterle,et al.  Flow Intercepting Facility Location: Problems, Models and Heuristics , 2009, J. Math. Model. Algorithms.

[26]  Christian S. Jensen,et al.  Discovery of convoys in trajectory databases , 2008, Proc. VLDB Endow..

[27]  Panos Kalnis,et al.  On Discovering Moving Clusters in Spatio-temporal Data , 2005, SSTD.

[28]  Sriram Raghavan,et al.  Indexing and matching trajectories under inconsistent sampling rates , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[29]  Philippe Flajolet,et al.  Probabilistic Counting Algorithms for Data Base Applications , 1985, J. Comput. Syst. Sci..

[30]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[31]  Oded Berman,et al.  Flow intercepting spatial interaction model: a new approach to optimal location of competitive facilities , 1998 .

[32]  Jiawei Han,et al.  Swarm: Mining Relaxed Temporal Moving Object Clusters , 2010, Proc. VLDB Endow..

[33]  Zvi Drezner,et al.  Facility location - applications and theory , 2001 .

[34]  Cheng Long,et al.  Efficient algorithms for optimal location queries in road networks , 2014, SIGMOD Conference.

[35]  Yang Du,et al.  The Optimal-Location Query , 2005, SSTD.

[36]  Guangzhong Sun,et al.  Driving with knowledge from the physical world , 2011, KDD.

[37]  Sriram Raghavan,et al.  Inferring Uncertain Trajectories from Partial Observations , 2014, 2014 IEEE International Conference on Data Mining.

[38]  Xuemin Lin,et al.  Selecting Stars: The k Most Representative Skyline Operator , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[39]  Christian S. Jensen,et al.  Trajectory based optimal segment computation in road network databases , 2013, SIGSPATIAL/GIS.

[40]  Farnoush Banaei Kashani,et al.  Optimal network location queries , 2010, GIS '10.

[41]  Oded Berman,et al.  Optimal Location of Discretionary Service Facilities , 1992, Transp. Sci..

[42]  Samir Khuller,et al.  The Budgeted Maximum Coverage Problem , 1999, Inf. Process. Lett..

[43]  Oded Berman,et al.  Locating Discretionary Service Facilities Based on Probabilistic Customer Flows , 1995, Transp. Sci..

[44]  Sriram Raghavan,et al.  Trajectory aware macro-cell planning for mobile users , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[45]  Lei Chen,et al.  On the Marriage of Edit Distance and Lp Norms , 2004, VLDB 2004.