Joint Annotator-and-Spectrum Allocation in Wireless Networks for Crowd Labeling

The massive sensing data generated by Internet-of-Things will provide fuel for ubiquitous artificial intelligence (AI), automating the operations of our society ranging from transportation to healthcare. The implementation of ubiquitous AI, however, entails labelling of an enormous amount of data prior to the training of AI models via supervised learning. To tackle this challenge, we explore a new direction called wireless crowd labelling, which involves downloading data to many imperfect mobile annotators for repetition labelling with an aim of exploiting multicasting in wireless networks. In this cross-disciplinary area, the rate-distortion theory and the principle of repetition labelling for accuracy improvement together give rise to a new tradeoff between radio-and-annotator resources under a constraint on labelling accuracy. Building on the tradeoff and aiming at maximizing the labelling throughput, this work focuses on the joint optimization of encoding rate, annotator clustering, and sub-channel allocation, which results in an NP-hard integer programming problem. To devise an efficient solution approach, we establish an optimal sequential annotator-clustering scheme based on the order of decreasing signal-to-noise ratios, thereby allowing the optimal solution to be found by an efficient tree search. This solution can be further simplified when the channels are symmetric. Alternatively, the optimization problem can be recognized as a knapsack problem, which can be efficiently solved in pseudo-polynomial time by means of dynamic programming. In addition, the optimal polices are derived for the annotator constrained and spectrum constrained cases. Last, simulation results are presented to demonstrate the significant throughput gains based on the optimal solution compared with decoupled allocation of the two types of resources.

[1]  Pietro Perona,et al.  Online crowdsourcing: Rating annotators and obtaining cost-effective labels , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[2]  John E. Beasley Multidimensional Knapsack Problems , 2009, Encyclopedia of Optimization.

[3]  Javier R. Movellan,et al.  Whose Vote Should Count More: Optimal Integration of Labels from Labelers of Unknown Expertise , 2009, NIPS.

[4]  A. Roth,et al.  Two-sided matching , 1990 .

[5]  Zhi Ding,et al.  Federated Learning via Over-the-Air Computation , 2018, IEEE Transactions on Wireless Communications.

[6]  Alexander Schrijver,et al.  Combinatorial optimization. Polyhedra and efficiency. , 2003 .

[7]  Jinho Choi,et al.  Minimum Power Multicast Beamforming With Superposition Coding for Multiresolution Broadcast and Application to NOMA Systems , 2015, IEEE Transactions on Communications.

[8]  Alan M. Frieze Shortest path algorithms for knapsack type problems , 1976, Math. Program..

[9]  Zhihan Lv,et al.  A multicast delivery approach with minimum energy consumption for wireless multi-hop networks , 2016, Telecommun. Syst..

[10]  H. Vincent Poor,et al.  Application of Non-Orthogonal Multiple Access in LTE and 5G Networks , 2015, IEEE Communications Magazine.

[11]  Wei Yu,et al.  Content-Centric Sparse Multicast Beamforming for Cache-Enabled Cloud RAN , 2015, IEEE Transactions on Wireless Communications.

[12]  Giuseppe Piro,et al.  Multicast and Broadcast Services Over Mobile Networks: A Survey on Standardized Approaches and Scientific Outcomes , 2019, IEEE Communications Surveys & Tutorials.

[13]  Xindong Wu,et al.  Learning from crowdsourced labeled data: a survey , 2016, Artificial Intelligence Review.

[14]  Arumugam Nallanathan,et al.  Energy-Efficient Chance-Constrained Resource Allocation for Multicast Cognitive OFDM Network , 2016, IEEE Journal on Selected Areas in Communications.

[15]  Meixia Tao,et al.  Optimal dynamic multicast scheduling for cache-enabled content-centric wireless networks , 2015, 2015 IEEE International Symposium on Information Theory (ISIT).

[16]  Konstantinos Poularakis,et al.  Exploiting Caching and Multicast for 5G Wireless Networks , 2016, IEEE Transactions on Wireless Communications.

[17]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[18]  H. Vincent Poor,et al.  On the Spectral Efficiency and Security Enhancements of NOMA Assisted Multicast-Unicast Streaming , 2016, IEEE Transactions on Communications.

[19]  L. Hanzo,et al.  Non-Orthogonal Multiple Access for 5G and Beyond , 2018 .

[20]  Lajos Hanzo,et al.  Nonorthogonal Multiple Access for 5G and Beyond , 2017, Proceedings of the IEEE.

[21]  Jiaru Lin,et al.  Resource Allocation in Multicast OFDM Systems: Lower/Upper Bounds and Suboptimal Algorithm , 2011, IEEE Communications Letters.

[22]  A. S. Madhukumar,et al.  Genetic Algorithm Aided Proportional Fair Resource Allocation in Multicast OFDM Systems , 2015, IEEE Transactions on Broadcasting.

[23]  Yong Wang,et al.  Low-Latency Broadband Analog Aggregation for Federated Edge Learning , 2018, ArXiv.

[24]  Deniz Gündüz,et al.  Machine Learning at the Wireless Edge: Distributed Stochastic Gradient Descent Over-the-Air , 2019, 2019 IEEE International Symposium on Information Theory (ISIT).

[25]  Injong Rhee,et al.  Multicast Scheduling in Cellular Data Networks , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[26]  A. Benjamin,et al.  Proofs that Really Count: The Art of Combinatorial Proof , 2003 .

[27]  Pietro Perona,et al.  The Multidimensional Wisdom of Crowds , 2010, NIPS.

[28]  Kaibin Huang,et al.  Towards an Intelligent Edge: Wireless Communication Meets Machine Learning , 2018, ArXiv.

[29]  Gerardo Hermosillo,et al.  Learning From Crowds , 2010, J. Mach. Learn. Res..

[30]  Sergey Andreev,et al.  Wirelessly Powered Crowd Sensing: Joint Power Transfer, Sensing, Compression, and Transmission , 2017, IEEE Journal on Selected Areas in Communications.

[31]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[32]  Kiseon Kim,et al.  Multicast Scheduling and Resource Allocation Algorithms for OFDMA-Based Systems: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[33]  Fan Ye,et al.  Mobile crowdsensing: current state and future challenges , 2011, IEEE Communications Magazine.

[34]  T. Cover,et al.  Rate Distortion Theory , 2001 .

[35]  Panagiotis G. Ipeirotis,et al.  Repeated labeling using multiple noisy labelers , 2012, Data Mining and Knowledge Discovery.

[36]  Emil Björnson,et al.  Max–Min Fair Transmit Precoding for Multi-Group Multicasting in Massive MIMO , 2017, IEEE Transactions on Wireless Communications.

[37]  Jean C. Walrand,et al.  Incentive mechanisms for smartphone collaboration in data acquisition and distributed computing , 2012, 2012 Proceedings IEEE INFOCOM.

[38]  Yunhao Liu,et al.  Incentives for Mobile Crowd Sensing: A Survey , 2016, IEEE Communications Surveys & Tutorials.