Communication-Efficient Hierarchical Federated Learning for IoT Heterogeneous Systems with Imbalanced Data

Federated learning (FL) is a distributed learning methodology that allows multiple nodes to cooperatively train a deep learning model, without the need to share their local data. It is a promising solution for telemonitoring systems that demand intensive data collection from different locations while maintaining a strict privacy constraint. Due to privacy concerns and critical communication bottlenecks, it can become impractical to send the FL updated models to a centralized server. Thus, this paper studies the potential of hierarchical FL in IoT heterogeneous systems and propose an optimized solution for user assignment and resource allocation on multiple edge nodes. In particular, this work focuses on a generic class of machine learning models that are trained using gradient-descent-based schemes while considering the practical constraints of non-uniformly distributed data across different users. We evaluate the proposed system using two real-world datasets, and we show that it outperforms state-of-the-art FL solutions. In particular, our numerical results highlight the effectiveness of our approach and its ability to provide 4-6% increase in the classification accuracy, with respect to hierarchical FL schemes that consider distance-based user assignment. Furthermore, the proposed approach could significantly accelerate FL training and reduce communication overhead by providing 75-85% reduction in the communication rounds between edge nodes and the centralized server, for the same model accuracy.

[1]  Meixia Tao,et al.  Bandwidth Gain From Mobile Edge Computing and Caching in Wireless Multicast Systems , 2019, IEEE Transactions on Wireless Communications.

[2]  Mugen Peng,et al.  Joint Optimization of Data Sampling and User Selection for Federated Learning in the Mobile Edge Computing Systems , 2020, 2020 IEEE International Conference on Communications Workshops (ICC Workshops).

[3]  Choong Seon Hong,et al.  Socially-Aware-Clustering-Enabled Federated Learning for Edge Networks , 2021, IEEE Transactions on Network and Service Management.

[4]  Sebastian Bosse,et al.  Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment , 2016, IEEE Transactions on Image Processing.

[5]  H. Vincent Poor,et al.  Convergence Time Optimization for Federated Learning Over Wireless Networks , 2020, IEEE Transactions on Wireless Communications.

[6]  Peter Richtárik,et al.  Semi-stochastic coordinate descent , 2014, Optim. Methods Softw..

[7]  Giancarlo Fortino,et al.  Autonomic computation offloading in mobile edge for IoT applications , 2019, Future Gener. Comput. Syst..

[8]  K. B. Letaief,et al.  A Survey on Mobile Edge Computing: The Communication Perspective , 2017, IEEE Communications Surveys & Tutorials.

[9]  Alireza Ghasempour,et al.  Internet of Things in Smart Grid: Architecture, Applications, Services, Key Technologies, and Challenges , 2019, Inventions.

[10]  Mohsen Guizani,et al.  Optimal User-Edge Assignment in Hierarchical Federated Learning Based on Statistical Properties and Network Topology Constraints , 2022, IEEE Transactions on Network Science and Engineering.

[11]  Walid Saad,et al.  Federated Learning in the Sky: Joint Power Allocation and Scheduling with UAV Swarms , 2020, ICC 2020 - 2020 IEEE International Conference on Communications (ICC).

[12]  Amr Mohamed,et al.  User-Centric Networks Selection With Adaptive Data Compression for Smart Health , 2018, IEEE Systems Journal.

[13]  Michael I. Jordan,et al.  Distributed optimization with arbitrary local solvers , 2015, Optim. Methods Softw..

[14]  Mohsen Guizani,et al.  Analysis and Optimal Edge Assignment For Hierarchical Federated Learning on Non-IID Data , 2020, ArXiv.

[15]  Albert Y. Zomaya,et al.  Federated Learning over Wireless Networks: Optimization Model Design and Analysis , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[16]  Geyong Min,et al.  Communication-Efficient Federated Learning for Wireless Edge Intelligence in IoT , 2020, IEEE Internet of Things Journal.

[17]  Francesco Malandrino,et al.  Active Learning-based Classification in Automated Connected Vehicles , 2020, IEEE INFOCOM 2020 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[18]  Bernd Gärtner,et al.  Understanding and using linear programming , 2007, Universitext.

[19]  Shih-Lin Wu,et al.  Performance Analysis and Adaptive DRX Scheme for Dual Connectivity , 2019, IEEE Internet of Things Journal.

[20]  Xiang Li,et al.  On the Convergence of FedAvg on Non-IID Data , 2019, ICLR.

[21]  Yang Yang,et al.  Machine Learning Inspired Codeword Selection For Dual Connectivity in 5G User-Centric Ultra-Dense Networks , 2019, IEEE Transactions on Vehicular Technology.

[22]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[23]  Jose F. Monserrat,et al.  Multi-Radio Dual Connectivity for 5G Small Cells Interworking , 2020, IEEE Communications Standards Magazine.

[24]  Hao Wang,et al.  Optimizing Federated Learning on Non-IID Data with Reinforcement Learning , 2020, IEEE INFOCOM 2020 - IEEE Conference on Computer Communications.

[25]  Kenneth Heafield,et al.  Sparse Communication for Distributed Gradient Descent , 2017, EMNLP.

[26]  Jun Zhang,et al.  Edge-Assisted Hierarchical Federated Learning with Non-IID Data , 2019, ArXiv.

[27]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Yue Zhao,et al.  Federated Learning with Non-IID Data , 2018, ArXiv.

[29]  Deniz Gündüz,et al.  Hierarchical Federated Learning ACROSS Heterogeneous Cellular Networks , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[30]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[31]  Samy Bengio,et al.  Revisiting Distributed Synchronous SGD , 2016, ArXiv.

[32]  Wojciech Samek,et al.  Explainable AI: Interpreting, Explaining and Visualizing Deep Learning , 2019, Explainable AI.

[33]  Mohsen Guizani,et al.  Edge computing for energy-efficient smart health systems , 2020 .

[34]  Liang Liang,et al.  Self-Balancing Federated Learning With Global Imbalanced Data in Mobile Systems , 2021, IEEE Transactions on Parallel and Distributed Systems.

[35]  Takayuki Nishio,et al.  Client Selection for Federated Learning with Heterogeneous Resources in Mobile Edge , 2018, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[36]  G.B. Moody,et al.  The impact of the MIT-BIH Arrhythmia Database , 2001, IEEE Engineering in Medicine and Biology Magazine.

[37]  Klaus-Robert Müller,et al.  Robust and Communication-Efficient Federated Learning From Non-i.i.d. Data , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[38]  Giancarlo Fortino,et al.  Agent-based Internet of Things: State-of-the-art and research challenges , 2020, Future Gener. Comput. Syst..

[39]  Ligang He,et al.  Accelerating Federated Learning Over Reliability-Agnostic Clients in Mobile Edge Computing Systems , 2020, IEEE Transactions on Parallel and Distributed Systems.

[40]  Kin K. Leung,et al.  Adaptive Federated Learning in Resource Constrained Edge Computing Systems , 2018, IEEE Journal on Selected Areas in Communications.