论文信息 - A ug 2 02 1 NeuraCrypt is not private

A ug 2 02 1 NeuraCrypt is not private

NeuraCrypt (Yara et al. arXiv 2021) is an algorithm that converts a sensitive dataset to an encoded dataset so that (1) it is still possible to train machine learning models on the encoded data, but (2) an adversary who has access only to the encoded dataset can not learn much about the original sensitive dataset. We break NeuraCrypt’s privacy claims, by perfectly solving the authors’ public challenge, and by showing that NeuraCrypt does not satisfy the formal privacy definitions posed in the original paper. Our attack consists of a series of boosting steps that, coupled with various design flaws, turns a 1% attack advantage into a 100% complete break of the scheme.

[1] Muriel Medard,et al. NeuraCrypt: Hiding Private Health Data via Random Neural Networks for Public Training , 2021, ArXiv.

[2] Somesh Jha,et al. Is Private Learning Possible with Instance Encoding? , 2021, 2021 IEEE Symposium on Security and Privacy (SP).

[3] Kai Li,et al. InstaHide: Instance-hiding Schemes for Private Distributed Learning , 2020, ICML.

[4] Andrew M. Dai,et al. Gmail Smart Compose: Real-Time Assisted Writing , 2019, KDD.

[5] Yifan Yu,et al. CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison , 2019, AAAI.

[6] Ahmed Hosny,et al. Artificial intelligence in radiology , 2018, Nature Reviews Cancer.

[7] Sebastian Thrun,et al. Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[8] Dinggang Shen,et al. Machine Learning in Medical Imaging , 2012, Lecture Notes in Computer Science.

[9] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[10] Eyal Kushilevitz,et al. From Differential Cryptanalysis to Ciphertext-Only Attacks , 1998, CRYPTO.

[11] Silvio Micali,et al. The knowledge complexity of interactive proof-systems , 1985, STOC '85.

[12] Silvio Micali,et al. Probabilistic Encryption , 1984, J. Comput. Syst. Sci..