Machine Learning in Mobile Crowd Sourcing: A Behavior-Based Recruitment Model

With the advent of mobile crowd sourcing (MCS) systems and its applications, the selection of the right crowd is gaining utmost importance. The increasing variability in the context of MCS tasks makes the selection of not only the capable but also the willing workers crucial for a high task completion rate. Most of the existing MCS selection frameworks rely primarily on reputation-based feedback mechanisms to assess the level of commitment of potential workers. Such frameworks select workers having high reputation scores but without any contextual awareness of the workers, at the time of selection, or the task. This may lead to an unfair selection of workers who will not perform the task. Hence, reputation on its own only gives an approximation of workers’ behaviors since it assumes that workers always behave consistently regardless of the situational context. However, following the concept of cross-situational consistency, where people tend to show similar behavior in similar situations and behave differently in disparate ones, this work proposes a novel recruitment system in MCS based on behavioral profiling. The proposed approach uses machine learning to predict the probability of the workers performing a given task, based on their learned behavioral models. Subsequently, a group-based selection mechanism, based on the genetic algorithm, uses these behavioral models in complementation with a reputation-based model to recruit a group of workers that maximizes the quality of recruitment of the tasks. Simulations based on a real-life dataset show that considering human behavior in varying situations improves the quality of recruitment achieved by the tasks and their completion confidence when compared with a benchmark that relies solely on reputation.

[1]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[2]  Hadi Otrok,et al.  A stability-based group recruitment system for continuous mobile crowd sensing , 2018, Comput. Commun..

[3]  Jingdong Xu,et al.  Crowd Foraging: A QoS-Oriented Self-Organized Mobile Crowdsourcing Framework Over Opportunistic Networks , 2017, IEEE Journal on Selected Areas in Communications.

[4]  Giancarlo Fortino,et al.  Optimal Selection of Crowdsourcing Workers Balancing Their Utilities and Platform Profit , 2019, IEEE Internet of Things Journal.

[5]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[6]  Stefano Chessa,et al.  Mobile crowd sensing management with the ParticipAct living lab , 2017, Pervasive Mob. Comput..

[7]  Chengzhong Xu,et al.  Configuring in-memory cluster computing using random forest , 2018, Future Gener. Comput. Syst..

[8]  Hadi Otrok,et al.  Gale-Shapley Matching Game Selection—A Framework for User Satisfaction , 2019, IEEE Access.

[9]  Hadi Otrok,et al.  GRS: A Group-Based Recruitment System for Mobile Crowd Sensing , 2016, J. Netw. Comput. Appl..

[10]  Lei Wang,et al.  Comparison of random forest, artificial neural networks and support vector machine for intelligent diagnosis of rotating machinery , 2018, Trans. Inst. Meas. Control.

[11]  Audun Jøsang,et al.  AIS Electronic Library (AISeL) , 2017 .

[12]  Jiangtao Wang,et al.  GP-selector: a generic participant selection framework for mobile crowdsourcing systems , 2018, World Wide Web.

[13]  Hyun-Woo Lee,et al.  Toward a Trust Evaluation Mechanism in the Social Internet of Things , 2017, Sensors.

[14]  Marco Anisetti,et al.  A trust assurance technique for Internet of things based on human behavior compliance , 2019 .

[15]  Gyu Myoung Lee,et al.  Data centric trust evaluation and prediction framework for IOT , 2017, 2017 ITU Kaleidoscope: Challenges for a Data-Driven Society (ITU K).

[16]  Schahram Dustdar,et al.  Social Interaction Analysis for Team Collaboration , 2014, Encyclopedia of Social Network Analysis and Mining.

[17]  Badih Ghattas,et al.  A review of supervised machine learning algorithms and their applications to ecological data , 2012 .

[18]  Tagaram Soni Madhulatha,et al.  Comparison between K-Means and K-Medoids Clustering Algorithms , 2011 .

[19]  Jiangtao Wang,et al.  HyTasker: Hybrid Task Allocation in Mobile Crowd Sensing , 2018, IEEE Transactions on Mobile Computing.

[20]  Hadi Otrok,et al.  A Misbehaving-Proof Game Theoretical Selection Approach for Mobile Crowd Sourcing , 2020, IEEE Access.

[21]  MengChu Zhou,et al.  Security and trust issues in Fog computing: A survey , 2018, Future Gener. Comput. Syst..

[22]  Yang Wang,et al.  TaskMe: multi-task allocation in mobile crowd sensing , 2016, UbiComp.

[23]  R. Hogan,et al.  Explorations in behavioral consistency: properties of persons, situations, and behaviors. , 1991, Journal of personality and social psychology.

[24]  Luca Foschini,et al.  Quantifying User Reputation Scores, Data Trustworthiness, and User Incentives in Mobile Crowd-Sensing , 2017, IEEE Access.

[25]  Ahmed Eldawy,et al.  LARS: A Location-Aware Recommender System , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[26]  Ahmed Eldawy,et al.  LARS*: An Efficient and Scalable Location-Aware Recommender System , 2014, IEEE Transactions on Knowledge and Data Engineering.

[27]  Sarvapali D. Ramchurn,et al.  DEVISING A TRUST MODEL FOR MULTI-AGENT INTERACTIONS USING CONFIDENCE AND REPUTATION , 2004, Appl. Artif. Intell..

[28]  Mingchu Li,et al.  Reputation-based multi-auditing algorithmic mechanism for reliable mobile crowdsensing , 2018, Pervasive Mob. Comput..

[29]  Hakim Ghazzai,et al.  A Spatial Mobile Crowdsourcing Framework for Event Reporting , 2020, IEEE Transactions on Computational Social Systems.

[30]  Paolo Bellavista,et al.  Participact for smart and connected communities: exploiting social networks with profile extension in crowdsensing systems , 2018, ICDCN Workshops.

[31]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[32]  MengChu Zhou,et al.  An embedded feature selection method for imbalanced data classification , 2019, IEEE/CAA Journal of Automatica Sinica.

[33]  Jiangtao Wang,et al.  Allocating Heterogeneous Tasks in Participatory Sensing with Diverse Participant-Side Factors , 2019, IEEE Transactions on Mobile Computing.

[34]  Franco Romerio,et al.  A parametric genetic algorithm approach to assess complementary options of large scale windsolar coupling , 2017, IEEE/CAA Journal of Automatica Sinica.

[35]  Symeon Papavassiliou,et al.  Interest-aware energy collection & resource management in machine to machine communications , 2018, Ad Hoc Networks.

[36]  Dimitrios Tzovaras,et al.  Reputation assessment mechanism for carpooling applications based on clustering user travel preferences , 2019 .

[37]  Peter I. Frazier,et al.  Distance dependent Chinese restaurant processes , 2009, ICML.

[38]  Haoyi Xiong,et al.  Multi-Task Allocation in Mobile Crowd Sensing with Individual Task Quality Assurance , 2018, IEEE Transactions on Mobile Computing.

[39]  Jiangtao Wang,et al.  Learning-Assisted Optimization in Mobile Crowd Sensing: A Survey , 2019, IEEE Transactions on Industrial Informatics.

[40]  Shengli Xie,et al.  Incentive Mechanism for Reliable Federated Learning: A Joint Optimization Approach to Combining Reputation and Contract Theory , 2019, IEEE Internet of Things Journal.

[41]  Abdulhadi Shoufan,et al.  Drone Pilot Identification by Classifying Radio-Control Signals , 2018, IEEE Transactions on Information Forensics and Security.

[42]  Xiaofeng Gao,et al.  MAB-Based Reinforced Worker Selection Framework for Budgeted Spatial Crowdsensing , 2022, IEEE Transactions on Knowledge and Data Engineering.

[43]  Yan Liu,et al.  ActiveCrowd: A Framework for Optimized Multitask Allocation in Mobile Crowdsensing Systems , 2016, IEEE Transactions on Human-Machine Systems.

[44]  Roswell H. Johnson,et al.  The Loss Ratio Method of Extrapolating Oil Well Decline Curves , 1927 .

[45]  D. Funder,et al.  Situational similarity and behavioral consistency: Subjective, objective, variable-centered, and person-centered approaches ☆ , 2004 .

[46]  MengChu Zhou,et al.  Pareto-Optimization for Scheduling of Crude Oil Operations in Refinery via Genetic Algorithm , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[47]  Maja Bott,et al.  The Role of Crowdsourcing for Better Governance in International Development , 2012 .

[48]  Mario Vanhoucke,et al.  Hybrid tabu search and a truncated branch-and-bound for the unrelated parallel machine scheduling problem , 2015, Comput. Oper. Res..

[49]  Harry Zhang,et al.  The Optimality of Naive Bayes , 2004, FLAIRS.

[50]  MengChu Zhou,et al.  A Distance-Based Weighted Undersampling Scheme for Support Vector Machines and its Application to Imbalanced Classification , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[51]  Hadi Otrok,et al.  Multi-worker multi-task selection framework in mobile crowd sourcing , 2019, J. Netw. Comput. Appl..

[52]  Abdelkarim Erradi,et al.  A Deep Learning Spatiotemporal Prediction Framework for Mobile Crowdsourced Services , 2019, Mob. Networks Appl..