Capacity constrained assignment in spatial databases

Given a point set <i>P</i> of customers (e.g., WiFi receivers) and a point set <i>Q</i> of service providers (e.g., wireless access points), where each <i>q</i> ∈ <i>Q</i> has a capacity <i>q.k</i>, the <i>capacity constrained assignment</i> (CCA) is a matching <i>M</i> ⊆ <i>Q</i> × <i>P</i> such that (i) each point <i>q</i> ∈ <i>Q</i> (<i>p</i> ∈ <i>P</i>) appears at most <i>k</i> times (at most once) in <i>M</i>, (ii) the size of <i>M</i> is maximized (i.e., it comprises min{|<i>P</i>|, ∑<sub><i>q</i>∈<i>Q</i></sub><sup><i>q</i>.<i>k</i></sup>} pairs), and (iii) the total assignment cost (i.e., the sum of Euclidean distances within all pairs) is minimized. Thus, the CCA problem is to identify the assignment with the optimal overall quality; intuitively, the quality of <i>q</i>'s service to <i>p</i> in a given (<i>q</i>, <i>p</i>) pair is anti-proportional to their distance. Although max-flow algorithms are applicable to this problem, they require the complete distance-based bipartite graph between <i>Q</i> and <i>P</i>. For large spatial datasets, this graph is expensive to compute and it may be too large to fit in main memory. Motivated by this fact, we propose efficient algorithms for <i>optimal assignment</i> that employ novel edge-pruning strategies, based on the spatial properties of the problem. Additionally, we develop <i>approximate</i> (i.e., suboptimal) CCA solutions that provide a trade-off between result accuracy and computation cost, abiding by theoretical quality guarantees. A thorough experimental evaluation demonstrates the efficiency and practicality of the proposed techniques.

[1]  Ismail Hakki Toroslu,et al.  Incremental assignment problem , 2007, Inf. Sci..

[2]  Hanan Samet,et al.  Distance browsing in spatial databases , 1999, TODS.

[3]  Andrew V. Goldberg,et al.  An efficient cost scaling algorithm for the assignment problem , 1995, Math. Program..

[4]  Abraham Silberschatz,et al.  Database Systems Concepts , 1997 .

[5]  Christos Faloutsos,et al.  The R+-Tree: A Dynamic Index for Multi-Dimensional Objects , 1987, VLDB.

[6]  J. Munkres ALGORITHMS FOR THE ASSIGNMENT AND TRANSIORTATION tROBLEMS* , 1957 .

[7]  Yannis Manolopoulos,et al.  Closest pair queries in spatial databases , 2000, SIGMOD '00.

[8]  Man Lung Yiu,et al.  Continuous Monitoring of Exclusive Closest Pairs , 2007, SSTD.

[9]  Man Lung Yiu,et al.  Clustering objects on a spatial network , 2004, SIGMOD '04.

[10]  David B. Shmoys,et al.  Approximation algorithms for facility location problems , 2000, APPROX.

[11]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[12]  Thomas Brinkhoff,et al.  A Framework for Generating Network-Based Moving Objects , 2002, GeoInformatica.

[13]  J. Vygen Approximation Algorithms for Facility Location Problems ( Lecture Notes ) , 2005 .

[14]  Ravindra K. Ahuja,et al.  Network Flows: Theory, Algorithms, and Applications , 1993 .

[15]  Abraham Silberschatz,et al.  Database System Concepts , 1980 .

[16]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[17]  Raymond Chi-Wing Wong,et al.  On Efficient Spatial Matching , 2007, VLDB.