Burn After Reading: Online Adaptation for Cross-domain Streaming Data

In the context of online privacy, many methods propose complex privacy and security preserving measures to protect sensitive data. In this paper we argue that: not storing any sensitive data is the best form of security. Thus we propose an online framework that “burns after reading”, i.e. each online sample is immediately deleted after it is processed. Meanwhile, we tackle the inevitable distribution shift between the labeled public data and unlabeled private data as a problem of unsupervised domain adaptation. Specifically, we propose a novel algorithm that aims at the most fundamental challenge of the online adaptation setting–the lack of diverse source-target data pairs. Therefore, we design a Cross-Domain Bootstrapping approach, called CRODOBO, to increase the combined diversity across domains. Further, to fully exploit the valuable discrepancies among the diverse combinations, we employ the training strategy of multiple learners with cosupervision. CRODOBO achieves state-of-the-art online performance on four domain adaptation benchmarks.

[1]  D. Rueckert,et al.  Medical imaging deep learning with differential privacy , 2021, Scientific Reports.

[2]  Boaz Rafaely,et al.  Frequency-domain adaptation of causal digital filters , 2000, IEEE Trans. Signal Process..

[3]  David Berthelot,et al.  MixMatch: A Holistic Approach to Semi-Supervised Learning , 2019, NeurIPS.

[4]  Behnam Neyshabur,et al.  The Deep Bootstrap Framework: Good Online Learners are Good Offline Generalizers , 2021, ICLR.

[5]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[6]  Yurong Liu,et al.  A survey of deep neural network architectures and their applications , 2017, Neurocomputing.

[7]  Bohyung Han,et al.  BranchOut: Regularization for Online Ensemble Tracking with Convolutional Neural Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Daumé,et al.  Domain Adaptation meets Active Learning , 2010, HLT-NAACL 2010.

[9]  Jure Leskovec,et al.  WILDS: A Benchmark of in-the-Wild Distribution Shifts , 2021, ICML.

[10]  Trevor Darrell,et al.  Semi-Supervised Domain Adaptation via Minimax Entropy , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[11]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[12]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Dongrui Wu,et al.  Online and Offline Domain Adaptation for Reducing BCI Calibration Effort , 2017, IEEE Transactions on Human-Machine Systems.

[14]  Stefano Ermon,et al.  A DIRT-T Approach to Unsupervised Domain Adaptation , 2018, ICLR.

[15]  Kilian Q. Weinberger,et al.  Deep Co-Training with Task Decomposition for Semi-Supervised Domain Adaptation , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[16]  Xin Yao,et al.  The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift , 2010, IEEE Transactions on Knowledge and Data Engineering.

[17]  Hongyue Sun,et al.  Online domain adaptation for continuous cross-subject liver viability evaluation based on irregular thermal data , 2020, IISE Transactions.

[18]  Xiaogang Wang,et al.  DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Bo Wang,et al.  Moment Matching for Multi-Source Domain Adaptation , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[20]  Sanjam Garg,et al.  Formalizing Data Deletion in the Context of the Right to Be Forgotten , 2020, IACR Cryptol. ePrint Arch..

[21]  Yoshua Bengio,et al.  Semi-supervised Learning by Entropy Minimization , 2004, CAP.

[22]  Xian-Sheng Hua,et al.  Two-Dimensional Multilabel Active Learning with an Efficient Online Adaptation Model for Image Classification , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Chao Li,et al.  Active multi-kernel domain adaptation for hyperspectral image classification , 2017, Pattern Recognit..

[24]  Haipeng Luo,et al.  Active Online Learning with Hidden Shifting Domains , 2021, AISTATS.

[25]  Avishek Saha,et al.  Active Supervised Domain Adaptation , 2011, ECML/PKDD.

[26]  Xin Yao,et al.  Resampling-Based Ensemble Methods for Online Class Imbalance Learning , 2015, IEEE Transactions on Knowledge and Data Engineering.

[27]  H. Vincent Poor,et al.  Federated Learning With Differential Privacy: Algorithms and Performance Analysis , 2019, IEEE Transactions on Information Forensics and Security.

[28]  Vijay Ganesh,et al.  Does AI Remember? Neural Networks and the Right to be Forgotten , 2020 .

[29]  Michael I. Jordan,et al.  Conditional Adversarial Domain Adaptation , 2017, NeurIPS.

[30]  Constantinos Patsakis,et al.  Backups and the right to be forgotten in the GDPR: An uneasy relationship , 2018, Comput. Law Secur. Rev..

[31]  Barbara Caputo,et al.  Kitting in the Wild through Online Domain Adaptation , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[32]  Kate Saenko,et al.  Return of Frustratingly Easy Domain Adaptation , 2015, AAAI.

[33]  Fabio Roli,et al.  Online Domain Adaptation for Person Re-Identification with a Human in the Loop , 2021, 2020 25th International Conference on Pattern Recognition (ICPR).

[34]  François Laviolette,et al.  Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..

[35]  Loris Nanni,et al.  Handcrafted vs. non-handcrafted features for computer vision classification , 2017, Pattern Recognit..

[36]  Chia-Mu Yu,et al.  CAFE: Catastrophic Data Leakage in Vertical Federated Learning , 2021, ArXiv.

[37]  Runhua Xu,et al.  HybridAlpha: An Efficient Approach for Privacy-Preserving Federated Learning , 2019, AISec@CCS.

[38]  Antoine Cornuéjols,et al.  Online Learning: Searching for the Best Forgetting Strategy under Concept Drift , 2013, ICONIP.

[39]  Michael I. Jordan,et al.  Learning Transferable Features with Deep Adaptation Networks , 2015, ICML.

[40]  Mahmood Ali,et al.  ERP system implementation in large enterprises - a systematic literature review , 2017, J. Enterp. Inf. Manag..

[41]  Changsheng Xu,et al.  Active Universal Domain Adaptation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[42]  Kate Saenko,et al.  Adversarial Dropout Regularization , 2017, ICLR.

[43]  Haizhou Li,et al.  Unsupervised Domain Adaptation via Domain Adversarial Training for Speaker Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[44]  Guangquan Zhang,et al.  Learning under Concept Drift: A Review , 2019, IEEE Transactions on Knowledge and Data Engineering.

[45]  The “Right to Be Forgotten” in the GDPR: Implementation Challenges and Potential Solutions , 2021, Privacy and Data Protection Challenges in the Distributed Era.

[46]  Yifan Zhang,et al.  COVID-DA: Deep Domain Adaptation from Typical Pneumonia to COVID-19 , 2020, ArXiv.

[47]  Song Han,et al.  Deep Leakage from Gradients , 2019, NeurIPS.

[48]  Andreas Savakis,et al.  ConDA: Continual Unsupervised Domain Adaptation , 2021, ArXiv.

[49]  Namil Kim,et al.  Drop to Adapt: Learning Discriminative Features for Unsupervised Domain Adaptation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[50]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[51]  Zhenguo Li,et al.  An Embedding Learning Framework for Numerical Features in CTR Prediction , 2020, KDD.

[52]  Peter Kieseberg,et al.  Humans forget, machines remember: Artificial intelligence and the Right to Be Forgotten , 2017, Comput. Law Secur. Rev..

[53]  Kate Saenko,et al.  Active Domain Adaptation via Clustering Uncertainty-weighted Embeddings , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[54]  Jianmin Wang,et al.  Multi-Adversarial Domain Adaptation , 2018, AAAI.

[55]  Erik G. Learned-Miller,et al.  Online domain adaptation of a pre-trained cascade of classifiers , 2011, CVPR 2011.

[56]  David Berthelot,et al.  FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence , 2020, NeurIPS.

[57]  Joseph K. Liu,et al.  DeepPAR and DeepDPA: Privacy Preserving and Asynchronous Deep Learning for Industrial IoT , 2020, IEEE Transactions on Industrial Informatics.

[58]  Henry Leung,et al.  Discriminative Partial Domain Adversarial Network , 2020, ECCV.

[59]  Roberto Souto Maior de Barros,et al.  A Boosting-like Online Learning Ensemble , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[60]  Lorenzo Bruzzone,et al.  Active Learning for Domain Adaptation in the Supervised Classification of Remote Sensing Images , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[61]  Yang Zhao,et al.  Deep High-Resolution Representation Learning for Visual Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[62]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[63]  Yuanzhi Li,et al.  Learning Overparameterized Neural Networks via Stochastic Gradient Descent on Structured Data , 2018, NeurIPS.

[64]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[65]  Boudewijn F. van Dongen,et al.  Process mining: a two-step approach to balance between underfitting and overfitting , 2008, Software & Systems Modeling.

[66]  Tatsuya Harada,et al.  Maximum Classifier Discrepancy for Unsupervised Domain Adaptation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[67]  Daniel Rueckert,et al.  A generic framework for privacy preserving deep learning , 2018, ArXiv.

[68]  Patrick Pérez,et al.  ADVENT: Adversarial Entropy Minimization for Domain Adaptation in Semantic Segmentation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[69]  Noam Tirosh Reconsidering the ‘Right to be Forgotten’ – memory rights and the right to memory in the new media era , 2017 .

[70]  Philip H.S. Torr,et al.  Gradient Matching for Domain Generalization , 2021, ArXiv.

[71]  Kate Saenko,et al.  VisDA: A Synthetic-to-Real Benchmark for Visual Domain Adaptation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[72]  Ser-Nam Lim,et al.  Curriculum Manager for Source Selection in Multi-Source Domain Adaptation , 2020, ECCV.

[73]  Gabriela Csurka,et al.  Domain Adaptation for Visual Applications: A Comprehensive Survey , 2017, ArXiv.

[74]  Jiaolong Xu,et al.  Hierarchical online domain adaptation of deformable part-based models , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[75]  Yuchen Zhang,et al.  Bridging Theory and Algorithm for Domain Adaptation , 2019, ICML.

[76]  Lee A. Bygrave,et al.  A right to be forgotten? , 2014, Commun. ACM.

[77]  Rui Zhang,et al.  A Hybrid Approach to Privacy-Preserving Federated Learning , 2018, Informatik Spektrum.

[78]  Jin Chen,et al.  Domain Adversarial Reinforcement Learning for Partial Domain Adaptation , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[79]  Dong-Hyun Lee,et al.  Pseudo-Label : The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks , 2013 .

[80]  Jiashi Feng,et al.  Domain Adaptation with Auxiliary Target Domain-Oriented Classifier , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[81]  Yi Yang,et al.  Contrastive Adaptation Network for Unsupervised Domain Adaptation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[82]  Yoshua Bengio,et al.  An empirical analysis of dropout in piecewise linear networks , 2013, ICLR.

[83]  Nuno Vasconcelos,et al.  Bidirectional Learning for Domain Adaptation of Semantic Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[84]  Huajun Chen,et al.  AliCG: Fine-grained and Evolvable Conceptual Graph Construction for Semantic Search at Alibaba , 2021, KDD.

[85]  Ian Osband,et al.  Risk versus Uncertainty in Deep Learning: Bayes, Bootstrap and the Dangers of Dropout , 2016 .

[86]  Quoc V. Le,et al.  Randaugment: Practical automated data augmentation with a reduced search space , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[87]  Xiaoyun Xu,et al.  Information Leakage by Model Weights on Federated Learning , 2020 .

[88]  Kurt Keutzer,et al.  Multi-source Domain Adaptation for Semantic Segmentation , 2019, NeurIPS.

[89]  Koby Crammer,et al.  Online Methods for Multi-Domain Learning and Adaptation , 2008, EMNLP.

[90]  Mikhail F. Kanevski,et al.  SVM-Based Boosting of Active Learning Strategies for Efficient Domain Adaptation , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[91]  Zoubin Ghahramani,et al.  Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[92]  Jiashi Feng,et al.  Do We Really Need to Access the Source Data? Source Hypothesis Transfer for Unsupervised Domain Adaptation , 2020, ICML.

[93]  Shigeki Sugano,et al.  Tactile object recognition using deep learning and dropout , 2014, 2014 IEEE-RAS International Conference on Humanoid Robots.

[94]  Rickmer Braren,et al.  Secure, privacy-preserving and federated machine learning in medical imaging , 2020, Nature Machine Intelligence.

[95]  Apostolos Burnetas,et al.  Optimal Adaptive Policies for Markov Decision Processes , 1997, Math. Oper. Res..

[96]  Trevor Darrell,et al.  Adversarial Discriminative Domain Adaptation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[97]  Xiaolong Li,et al.  Privacy-Enhanced Data Collection Based on Deep Learning for Internet of Vehicles , 2020, IEEE Transactions on Industrial Informatics.

[98]  Eleonora Vig,et al.  Online Domain Adaptation for Multi-Object Tracking , 2015, BMVC.

[99]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[100]  Alessandro Mantelero,et al.  The European Right to Be Forgotten , 2012 .

[101]  Steven C. H. Hoi,et al.  Online Deep Learning: Learning Deep Neural Networks on the Fly , 2017, IJCAI.