Optimal matching between spatial datasets under capacity constraints

Consider a set of customers (e.g., WiFi receivers) and a set of service providers (e.g., wireless access points), where each provider has a capacity and the quality of service offered to its customers is anti-proportional to their distance. The Capacity Constrained Assignment (CCA) is a matching between the two sets such that (i) each customer is assigned to at most one provider, (ii) every provider serves no more customers than its capacity, (iii) the maximum possible number of customers are served, and (iv) the sum of Euclidean distances within the assigned provider-customer pairs is minimized. Although max-flow algorithms are applicable to this problem, they require the complete distance-based bipartite graph between the customer and provider sets. For large spatial datasets, this graph is expensive to compute and it may be too large to fit in main memory. Motivated by this fact, we propose efficient algorithms for optimal assignment that employ novel edge-pruning strategies, based on the spatial properties of the problem. Additionally, we develop incremental techniques that maintain an optimal assignment (in the presence of updates) with a processing cost several times lower than CCA recomputation from scratch. Finally, we present approximate (i.e., suboptimal) CCA solutions that provide a tunable trade-off between result accuracy and computation cost, abiding by theoretical quality guarantees. A thorough experimental evaluation demonstrates the efficiency and practicality of the proposed techniques.

[1]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[2]  Timos K. Sellis,et al.  Efficient Cost Models for Spatial Queries Using R-Trees , 2000, IEEE Trans. Knowl. Data Eng..

[3]  Yufei Tao,et al.  Progressive computation of the min-dist optimal-location query , 2006, VLDB.

[4]  Thomas Brinkhoff,et al.  A Framework for Generating Network-Based Moving Objects , 2002, GeoInformatica.

[5]  Hanan Samet,et al.  Distance browsing in spatial databases , 1999, TODS.

[6]  David Manlove,et al.  Strong Stability in the Hospitals/Residents Problem , 2003, STACS.

[7]  Ravindra K. Ahuja,et al.  Network Flows: Theory, Algorithms, and Applications , 1993 .

[8]  D. Bertsekas The auction algorithm: A distributed relaxation method for the assignment problem , 1988 .

[9]  Christos Faloutsos,et al.  The R+-Tree: A Dynamic Index for Multi-Dimensional Objects , 1987, VLDB.

[10]  Michel Balinski,et al.  Signature Methods for the Assignment Problem , 1985, Oper. Res..

[11]  L. Shapley,et al.  College Admissions and the Stability of Marriage , 1962 .

[12]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[13]  Kyriakos Mouratidis,et al.  Capacity constrained assignment in spatial databases , 2008, SIGMOD Conference.

[14]  Hans-Peter Kriegel,et al.  Knowledge Discovery in Large Spatial Databases: Focusing Techniques for Efficient Class Identification , 1995, SSD.

[15]  Warren B. Powell,et al.  The Dynamic Assignment Problem , 2004, Transp. Sci..

[16]  Harold W. Kuhn,et al.  The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[17]  Robert W. Irving,et al.  The Stable marriage problem - structure and algorithms , 1989, Foundations of computing series.

[18]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[19]  Theodore Bially,et al.  Space-filling curves: Their generation and their application to bandwidth reduction , 1969, IEEE Trans. Inf. Theory.

[20]  Yannis Manolopoulos,et al.  Closest pair queries in spatial databases , 2000, SIGMOD '00.

[21]  Man Lung Yiu,et al.  Computation and Monitoring of Exclusive Closest Pairs , 2008, IEEE Transactions on Knowledge and Data Engineering.

[22]  Dimitri P. Bertsekas,et al.  A new algorithm for the assignment problem , 1981, Math. Program..

[23]  Atsuyuki Okabe,et al.  Spatial Tessellations: Concepts and Applications of Voronoi Diagrams , 1992, Wiley Series in Probability and Mathematical Statistics.

[24]  James B. Orlin,et al.  QuickMatch--a very fast algorithm for the assignment problem , 1993 .

[25]  Katta G. Murty,et al.  Network programming , 1992 .

[26]  Kyriakos Mouratidis,et al.  Tree-based partition querying: a methodology for computing medoids in large spatial datasets , 2008, The VLDB Journal.

[27]  Man Lung Yiu,et al.  Clustering objects on a spatial network , 2004, SIGMOD '04.

[28]  Ulrich Derigs,et al.  A shortest augmenting path method for solving minimal perfect matching problems , 1981, Networks.

[29]  Robert E. Tarjan,et al.  Faster scaling algorithms for general graph matching problems , 1991, JACM.

[30]  Atsuyuki Okabe,et al.  Spatial Tessellations: Concepts and Applications of Voronoi Diagrams, Second Edition , 2000, Wiley Series in Probability and Mathematical Statistics.

[31]  Raymond Chi-Wing Wong,et al.  On Efficient Spatial Matching , 2007, VLDB.

[32]  M. Bernardine Dias,et al.  The Dynamic Hungarian Algorithm for the Assignment Problem with Changing Costs , 2007 .

[33]  J. Vygen Approximation Algorithms for Facility Location Problems ( Lecture Notes ) , 2005 .

[34]  Nick Roussopoulos,et al.  Nearest neighbor queries , 1995, SIGMOD '95.

[35]  Ismail Hakki Toroslu,et al.  Incremental assignment problem , 2007, Inf. Sci..

[36]  Ming S. Hung,et al.  Technical Note - A Polynomial Simplex Method for the Assignment Problem , 1983, Oper. Res..

[37]  Andrew V. Goldberg,et al.  An efficient cost scaling algorithm for the assignment problem , 1995, Math. Program..

[38]  Ravindra K. Ahuja,et al.  Network Flows , 2011 .

[39]  J. Munkres ALGORITHMS FOR THE ASSIGNMENT AND TRANSIORTATION tROBLEMS* , 1957 .