Two-sided online bipartite matching in spatial data: experiments and analysis

With the rapid development of sharing economy and mobile Internet in recent years, a wide range of applications of the T wo-sided O nline B ipartite M atching (TOBM) problem in spatial data are gaining increasing popularity. To be specific, given a group of workers and tasks that dynamically appear in a 2D space, the TOBM problem aims to find a matching with the maximum cardinality between workers and tasks satisfying the spatiotemporal constraints. Many works have studied this problem, but the settings of their problems are different from each other. Moreover, no prior works have compared the performances of the algorithms tailored for different settings under a unified definition. As a result, there lacks a guideline for practitioners to adopt appropriate algorithms for various scenarios. To fill the blank in this field, we present a comprehensive evaluation and analysis of the representative algorithms for the TOBM problem in this paper. We first give our unified definition and then provide uniform implementations for all the algorithms. Finally, based on the experimental results on both synthetic and real datasets, we discuss the strengths and weaknesses of the algorithms in terms of short-term effect and long-term effect, which can be guidance on selecting appropriate solutions or designing new methods.

[1]  Lei Chen,et al.  An Experimental Evaluation of Task Assignment in Spatial Crowdsourcing , 2018, Proc. VLDB Endow..

[2]  Panos Kalnis,et al.  Collective Travel Planning in Spatial Networks , 2016, IEEE Transactions on Knowledge and Data Engineering.

[3]  Ji-Rong Wen,et al.  Mining frequent neighborhood patterns in a large labeled graph , 2013, CIKM.

[4]  Cyrus Shahabi,et al.  GeoCrowd: enabling query answering with spatial crowdsourcing , 2012, SIGSPATIAL/GIS.

[5]  Richard M. Karp,et al.  Theoretical Improvements in Algorithmic Efficiency for Network Flow Problems , 1972, Combinatorial Optimization.

[6]  Lei Chen,et al.  Trichromatic Online Matching in Real-Time Spatial Crowdsourcing , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[7]  Jing Lv,et al.  Efficient Computation of Measurements of Correlated Patterns in Uncertain Data , 2011, ADMA.

[8]  Jieping Ye,et al.  Flexible Online Task Assignment in Real-Time Spatial Data , 2017, Proc. VLDB Endow..

[9]  Ugur Demiryurek,et al.  Maximizing the number of worker's self-selected tasks in spatial crowdsourcing , 2013, SIGSPATIAL/GIS.

[10]  Lei Chen,et al.  Conflict-Aware Event-Participant Arrangement and Its Variant for Online Setting , 2016, IEEE Transactions on Knowledge and Data Engineering.

[11]  Kian-Lee Tan,et al.  Temporal Spatial-Keyword Top-k publish/subscribe , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[12]  Yan Cui,et al.  SOPS: A System for Efficient Processing of Spatial-Keyword Publish/Subscribe , 2014, Proc. VLDB Endow..

[13]  Yan Liu,et al.  Poster: FooDNet: Optimized On Demand Take-out Food Delivery using Spatial Crowdsourcing , 2017, MobiCom.

[14]  Lei Chen,et al.  Online mobile Micro-Task Allocation in spatial crowdsourcing , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[15]  Yuhao Zhang,et al.  How to match when all vertices arrive online , 2018, STOC.

[16]  Yefim Dinitz,et al.  Dinitz' Algorithm: The Original Version and Even's Version , 2006, Essays in Memory of Shimon Even.

[17]  Panos Kalnis,et al.  Searching Trajectories by Regions of Interest , 2017, IEEE Transactions on Knowledge and Data Engineering.

[18]  Jieping Ye,et al.  The Simpler The Better: A Unified Approach to Predicting Original Taxi Demands based on Large-Scale Online Platforms , 2017, KDD.

[19]  Lei Chen,et al.  Latency-Oriented Task Completion via Spatial Crowdsourcing , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[20]  Ji-Rong Wen,et al.  Discovering Neighborhood Pattern Queries by sample answers in knowledge base , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[21]  Mauro Dell'Amico,et al.  Assignment Problems , 1998, IFIP Congress: Fundamentals - Foundations of Computer Science.

[22]  Gao Cong,et al.  Distributed Publish/Subscribe Query Processing on the Spatio-Textual Data Stream , 2016, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[23]  Jian Pei,et al.  Within-Network Classification Using Radius-Constrained Neighborhood Patterns , 2014, CIKM.

[24]  Lei Chen,et al.  Top-k Team Recommendation and Its Variants in Spatial Crowdsourcing , 2017, Data Science and Engineering.

[25]  Panos Kalnis,et al.  Location-Aware Top-k Term Publish/Subscribe , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[26]  Lidan Shou,et al.  SLADE: A Smart Large-Scale Task Decomposer in Crowdsourcing , 2018, IEEE Transactions on Knowledge and Data Engineering.

[27]  Jieping Ye,et al.  Dynamic Pricing in Spatial Crowdsourcing: A Matching-Based Approach , 2018, SIGMOD Conference.

[28]  Lei Chen,et al.  Online Minimum Matching in Real-Time Spatial Data: Experiments and Analysis , 2016, Proc. VLDB Endow..

[29]  Jieping Ye,et al.  A Taxi Order Dispatch Model based On Combinatorial Optimization , 2017, KDD.

[30]  Richard M. Karp,et al.  An optimal algorithm for on-line bipartite matching , 1990, STOC '90.

[31]  Yajun Wang,et al.  Two-sided Online Bipartite Matching and Vertex Cover: Beating the Greedy Algorithm , 2015, ICALP.

[32]  Panos Kalnis,et al.  Trajectory Similarity Join in Spatial Networks , 2017, Proc. VLDB Endow..

[33]  Lei Chen,et al.  Spatial Crowdsourcing: Challenges, Techniques, and Applications , 2017, Proc. VLDB Endow..

[34]  Zimu Zhou,et al.  Dynamic task assignment in spatial crowdsourcing , 2018, SIGSPACIAL.

[35]  Panos Kalnis,et al.  Parallel Trajectory-to-Location Join , 2019, IEEE Transactions on Knowledge and Data Engineering.

[36]  Christian S. Jensen,et al.  Spatial Keyword Querying , 2012, ER.

[37]  Panos Kalnis,et al.  Parallel trajectory similarity joins in spatial networks , 2018, The VLDB Journal.

[38]  Gao Cong,et al.  Diversity-Aware Top-k Publish/Subscribe for Text Stream , 2015, SIGMOD Conference.

[39]  Aranyak Mehta,et al.  Online Matching and Ad Allocation , 2013, Found. Trends Theor. Comput. Sci..

[40]  Harold W. Kuhn,et al.  The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[41]  Xiangliang Zhang,et al.  Efficient task assignment in spatial crowdsourcing with worker and task privacy protection , 2018, GeoInformatica.