SCAWG: A toolbox for generating synthetic workload for spatial crowdsourcing

With the popularity of mobile devices, Spatial Crowdsourcing (SC) is emerging as a new framework that enables human workers to perform tasks in the physical world. With spatial crowdsourcing, the goal is to outsource a set of spatiotemporal tasks (i.e., tasks with time and location) to a set of workers, which requires the workers to be physically present at the location of the tasks in order to perform them. Research efforts have focused on different aspects of SC, such as task assignment, task scheduling and protecting workers' privacy. However, one of the biggest challenges for the research community is the absence of public real-world datasets from SC applications such as Uber, TaskRabbit, etc. Therefore, we propose a synthetic workload generator for the SC applications, namely SCAWG1, to produce common datasets for experimentation, thus leading to reproducible research. SCAWG facilitates and standardizes workload generation in SC research. Specifically, SCAWG considers realistic spatiotemporal properties and behaviors for both workers and tasks. In addition, to further emulate the real-world workloads, it considers various application-specific constraints and properties of workers and tasks as well as their temporal arrival patterns. SCAWG is very flexible and thus can be customized for different spatial crowdsourcing and crowdsensing applications via a set of user-defined parameters and distributions. We developed an open-source toolbox2 to demonstrate the feasibility and applicability of SCAWG. The toolbox is designed to be generic and extensible, which would expedite experimental studies.

[1]  Loren G. Terveen,et al.  Avoiding the South Side and the Suburbs: The Geography of Mobile Crowdsourcing Markets , 2015, CSCW.

[2]  Deepak Ganesan,et al.  Labor dynamics in a mobile micro-task market , 2013, CHI.

[3]  Adrian Baddeley,et al.  Spatial Point Processes and their Applications , 2007 .

[4]  Jizhong Zhao,et al.  Reliable Diversity-Based Spatial Crowdsourcing by Moving Workers , 2014, Proc. VLDB Endow..

[5]  Daqing Zhang,et al.  CrowdRecruiter: selecting participants for piggyback crowdsensing under probabilistic coverage constraint , 2014, UbiComp.

[6]  Jie Wu,et al.  Multi-task assignment for crowdsensing in mobile social networks , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[7]  Cyrus Shahabi,et al.  A Server-Assigned Spatial Crowdsourcing Framework , 2015, ACM Trans. Spatial Algorithms Syst..

[8]  Cyrus Shahabi,et al.  A Framework for Protecting Worker Location Privacy in Spatial Crowdsourcing , 2014, Proc. VLDB Endow..

[9]  Ugur Demiryurek,et al.  Maximizing the number of worker's self-selected tasks in spatial crowdsourcing , 2013, SIGSPATIAL/GIS.

[10]  Wolfgang Weil,et al.  Spatial Point Processes and their Applications , 2007 .

[11]  Cyrus Shahabi,et al.  PrivGeoCrowd: A toolbox for studying private spatial Crowdsourcing , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[12]  Darren Gergle,et al.  On the "localness" of user-generated content , 2010, CSCW '10.

[13]  Daqing Zhang,et al.  CrowdTasker: Maximizing coverage quality in Piggyback Crowdsensing under budget constraint , 2015, 2015 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[14]  Lei Chen,et al.  GeoTruCrowd: trustworthy query answering with spatial crowdsourcing , 2013, SIGSPATIAL/GIS.

[15]  Jie Wu,et al.  QoI-Aware Multitask-Oriented Dynamic Participant Selection With Budget Constraints , 2014, IEEE Transactions on Vehicular Technology.

[16]  Jiming Chen,et al.  Toward optimal allocation of location dependent tasks in crowdsensing , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[17]  Cyrus Shahabi,et al.  GeoCrowd: enabling query answering with spatial crowdsourcing , 2012, SIGSPATIAL/GIS.

[18]  Cyrus Shahabi,et al.  Real-time task assignment in hyperlocal spatial crowdsourcing under budget constraints , 2016, 2016 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[19]  Hung Dang,et al.  Maximum Complex Task Assignment: Towards Tasks Correlation in Spatial Crowdsourcing , 2013, IIWAS '13.

[20]  Cyrus Shahabi,et al.  Effectively crowdsourcing the acquisition and analysis of visual data for disaster response , 2015, 2015 IEEE International Conference on Big Data (Big Data).