Multimodal Federated Learning

Federated learning is proposed as an alternative to centralized machine learning since its client-server structure provides better privacy protection and scalability in real-world applications. In many applications, such as smart homes with IoT devices, local data on clients are generated from different modalities such as sensory, visual, and audio data. Existing federated learning systems only work on local data from a single modality, which limits the scalability of the systems. In this paper, we propose a multimodal and semi-supervised federated learning framework that trains autoencoders to extract shared or correlated representations from different local data modalities on clients. In addition, we propose a multimodal FedAvg algorithm to aggregate local autoencoders trained on different data modalities. We use the learned global autoencoder for a downstream classification task with the help of auxiliary labelled data on the server. We empirically evaluate our framework on different modalities including sensory data, depth camera videos, and RGB camera videos. Our experimental results demonstrate that introducing data from multiple modalities into federated learning can improve its accuracy. In addition, we can use labelled data from only one modality for supervised learning on the server and apply the learned model to testing data from other modalities to achieve decent accuracy (e.g., approximately 70% as the best performance), especially when combining contributions from both unimodal clients and multimodal clients.

[1]  Thomas Plötz,et al.  Ensembles of Deep LSTM Learners for Activity Recognition using Wearables , 2017, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[2]  Ang Li,et al.  GraphFL: A Federated Learning Framework for Semi-Supervised Node Classification on Graphs , 2020, 2022 IEEE International Conference on Data Mining (ICDM).

[3]  Eunho Yang,et al.  Federated Semi-Supervised Learning with Inter-Client Consistency , 2020, ArXiv.

[4]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[5]  Jeff A. Bilmes,et al.  On Deep Multi-View Representation Learning , 2015, ICML.

[6]  Richard Nock,et al.  Advances and Open Problems in Federated Learning , 2021, Found. Trends Mach. Learn..

[7]  Tianjian Chen,et al.  FedMVT: Semi-supervised Vertical Federated Learning with MultiView Training , 2020, ArXiv.

[8]  Pierre Baldi,et al.  Autoencoders, Unsupervised Learning, and Deep Architectures , 2011, ICML Unsupervised and Transfer Learning.

[9]  Ruslan Salakhutdinov,et al.  Think Locally, Act Globally: Federated Learning with Local and Global Representations , 2020, ArXiv.

[10]  Weisong Shi,et al.  Edge Computing: Vision and Challenges , 2016, IEEE Internet of Things Journal.

[11]  Andrea Cavallaro,et al.  DANA , 2020, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[12]  Nitish Srivastava,et al.  Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.

[13]  Hamed Haddadi,et al.  Privacy-preserving activity and health monitoring on databox , 2020, EdgeSys@EuroSys.

[14]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[15]  Xin Qin,et al.  FedHealth: A Federated Transfer Learning Framework for Wearable Healthcare , 2019, IEEE Intelligent Systems.

[16]  Héctor Pomares,et al.  mHealthDroid: A Novel Framework for Agile Development of Mobile Health Applications , 2014, IWAAL.

[17]  Bogdan Kwolek,et al.  Human fall detection on embedded platform using depth maps and wireless accelerometer , 2014, Comput. Methods Programs Biomed..

[18]  Flora D. Salim,et al.  Federated Self-Supervised Learning of Multisensor Representations for Embedded Intelligence , 2020, IEEE Internet of Things Journal.

[19]  Anit Kumar Sahu,et al.  Federated Learning: Challenges, Methods, and Future Directions , 2019, IEEE Signal Processing Magazine.

[20]  Xian Wu,et al.  Federated Learning for Vision-and-Language Grounding Problems , 2020, AAAI.

[21]  Sebastian U. Stich,et al.  Ensemble Distillation for Robust Model Fusion in Federated Learning , 2020, NeurIPS.

[22]  Thomas Plötz,et al.  Deep, Convolutional, and Recurrent Models for Human Activity Recognition Using Wearables , 2016, IJCAI.

[23]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[24]  Johan Lukkien,et al.  Multi-task Self-Supervised Learning for Human Activity Detection , 2019, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[25]  Ming Liu,et al.  Federated Imitation Learning: A Novel Framework for Cloud Robotic Systems With Heterogeneous Sensor Data , 2019, IEEE Robotics and Automation Letters.

[26]  Ameet Talwalkar,et al.  Federated Multi-Task Learning , 2017, NIPS.

[27]  Rui Li,et al.  Online Federated Multitask Learning , 2020 .

[28]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[29]  Wei Zhang,et al.  Federated learning for machinery fault diagnosis with dynamic validation and self-supervision , 2021, Knowl. Based Syst..

[30]  Xukan Ran,et al.  Deep Learning With Edge Computing: A Review , 2019, Proceedings of the IEEE.

[31]  Yi Liu,et al.  RC-SSFL: Towards Robust and Communication-efficient Semi-supervised Federated Learning System , 2020, ArXiv.

[32]  Ruzena Bajcsy,et al.  Berkeley MHAD: A comprehensive Multimodal Human Action Database , 2013, 2013 IEEE Workshop on Applications of Computer Vision (WACV).

[33]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Fenglong Ma,et al.  FedSiam: Towards Adaptive Federated Semi-Supervised Learning , 2020, 2012.03292.

[35]  Yue Zhao,et al.  Federated Learning with Non-IID Data , 2018, ArXiv.

[36]  Tanir Ozcelebi,et al.  Towards federated unsupervised representation learning , 2020, EdgeSys@EuroSys.

[37]  VALENTIN RADU,et al.  Multimodal Deep Learning for Activity and Context Recognition , 2018, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[38]  Sébastien Gambs,et al.  IOTFLA : A Secured and Privacy-Preserving Smart Home Architecture Implementing Federated Learning , 2019, 2019 IEEE Security and Privacy Workshops (SPW).

[39]  Hamed Haddadi,et al.  Semi-supervised Federated Learning for Activity Recognition , 2020, ArXiv.

[40]  Ricardo Chavarriaga,et al.  The Opportunity challenge: A benchmark database for on-body sensor-based activity recognition , 2013, Pattern Recognit. Lett..

[41]  Joseph E. Gonzalez,et al.  Benchmarking Semi-supervised Federated Learning , 2020, ArXiv.

[42]  Bradford J. Wood,et al.  Federated semi-supervised learning for COVID region segmentation in chest CT using multi-national data from China, Italy, Japan , 2020, Medical Image Analysis.

[43]  Jeff A. Bilmes,et al.  Deep Canonical Correlation Analysis , 2013, ICML.

[44]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[45]  Mani B. Srivastava,et al.  Enabling Edge Devices that Learn from Each Other: Cross Modal Training for Activity Recognition , 2018, EdgeSys@MobiSys.