EFMVFL: An Efficient and Flexible Multi-party Vertical Federated Learning without a Third Party

Federated learning allows multiple participants to conduct joint modeling without disclosing their local data. Vertical federated learning (VFL) handles the situation where participants share the same ID space and different feature spaces. In most VFL frameworks, to protect the security and privacy of the participants’ local data, a third party is needed to generate homomorphic encryption key pairs and perform decryption operations. In this way, the third party is granted the right to decrypt information related to model parameters. However, it isn’t easy to find such a credible entity in the real world. Existing methods for solving this problem are either communication-intensive or unsuitable for multi-party scenarios. By combining secret sharing and homomorphic encryption, we propose a novel VFL framework without a third party called EFMVFL, which supports flexible expansion to multiple participants with low communication overhead and is applicable to generalized linear models. We give instantiations of our framework under logistic regression and Poisson regression. Theoretical analysis and experiments show that our framework is secure, more efficient, and easy to be extended to multiple participants.

[1]  Peter Rindal,et al.  ABY3: A Mixed Protocol Framework for Machine Learning , 2018, IACR Cryptol. ePrint Arch..

[2]  Huafei Zhu,et al.  Privacy-Preserving Weighted Federated Learning Within the Secret Sharing Framework , 2020, IEEE Access.

[3]  Philip S. Yu,et al.  LDP-FL: Practical Private Aggregation in Federated Learning with Local Differential Privacy , 2021, IJCAI.

[4]  Sameer Wagh,et al.  SecureNN: 3-Party Secure Computation for Neural Network Training , 2019, Proc. Priv. Enhancing Technol..

[5]  Marcel Keller,et al.  MP-SPDZ: A Versatile Framework for Multi-Party Computation , 2020, IACR Cryptol. ePrint Arch..

[6]  Qiang Yang,et al.  SecureBoost: A Lossless Federated Learning Framework , 2019, IEEE Intelligent Systems.

[7]  Richard Nock,et al.  Private federated learning on vertically partitioned data via entity resolution and additively homomorphic encryption , 2017, ArXiv.

[8]  Jong-Seon No,et al.  Low-Complexity Deep Convolutional Neural Networks on Fully Homomorphic Encryption Using Multiplexed Convolutions , 2021, IACR Cryptol. ePrint Arch..

[9]  Yifei Zhang,et al.  Additively Homomorphical Encryption based Deep Neural Network for Asymmetrically Collaborative Machine Learning , 2020, ArXiv.

[10]  Payman Mohassel,et al.  SecureML: A System for Scalable Privacy-Preserving Machine Learning , 2017, 2017 IEEE Symposium on Security and Privacy (SP).

[11]  Tianjian Chen,et al.  Federated Machine Learning: Concept and Applications , 2019 .

[12]  John Domingue,et al.  The Future of the Internet , 1999, Academia Letters.

[13]  Yonggang Zhang,et al.  Privacy-preserving two-parties logistic regression on vertically partitioned data using asynchronous gradient sharing , 2020 .

[14]  Beng Chin Ooi,et al.  Privacy preserving vertical federated learning for tree-based models , 2020, Proc. VLDB Endow..

[15]  Lidia Arroyo Prieto Acm , 2020, Encyclopedia of Cryptography and Security.

[16]  Swaroop Ramaswamy,et al.  Federated Learning for Emoji Prediction in a Mobile Keyboard , 2019, ArXiv.

[17]  Ibrahim Aliyu,et al.  A Blockchain-Based Federated Forest for SDN-Enabled In-Vehicle Network Intrusion Detection System , 2021, IEEE Access.

[18]  Andrew Chi-Chih Yao,et al.  Protocols for Secure Computations (Extended Abstract) , 1982, FOCS.

[19]  IEEE Access , 2021, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[20]  Li Wang,et al.  When Homomorphic Encryption Marries Secret Sharing: Secure Large-Scale Sparse Logistic Regression and Applications in Risk Control , 2020, KDD.

[21]  Mariana Raykova,et al.  Secure Poisson Regression , 2021, IACR Cryptol. ePrint Arch..

[22]  Song Han,et al.  Deep Leakage from Gradients , 2019, NeurIPS.

[23]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[24]  Donald Beaver,et al.  Efficient Multiparty Protocols Using Circuit Randomization , 1991, CRYPTO.

[25]  Xiaoqian Jiang,et al.  Secure Logistic Regression based on Homomorphic Encryption , 2018, IACR Cryptol. ePrint Arch..

[26]  Peter Richtárik,et al.  Federated Learning: Strategies for Improving Communication Efficiency , 2016, ArXiv.

[27]  Cong Wang,et al.  GELU-Net: A Globally Encrypted, Locally Unencrypted Deep Neural Network for Privacy-Preserved Learning , 2018, IJCAI.