论文信息 - Deep Domain Generalization via Conditional Invariant Adversarial Networks

Deep Domain Generalization via Conditional Invariant Adversarial Networks

Domain generalization aims to learn a classification model from multiple source domains and generalize it to unseen target domains. A critical problem in domain generalization involves learning domain-invariant representations. Let X and Y denote the features and the labels, respectively. Under the assumption that the conditional distribution P(Y|X) remains unchanged across domains, earlier approaches to domain generalization learned the invariant representation T(X) by minimizing the discrepancy of the marginal distribution P(T(X)). However, such an assumption of stable P(Y|X) does not necessarily hold in practice. In addition, the representation learning function T(X) is usually constrained to a simple linear transformation or shallow networks. To address the above two drawbacks, we propose an end-to-end conditional invariant deep domain generalization approach by leveraging deep neural networks for domain-invariant representation learning. The domain-invariance property is guaranteed through a conditional invariant adversarial network that can learn domain-invariant representations w.r.t. the joint distribution P(T(X), Y) if the target domain data are not severely class unbalanced. We perform various experiments to demonstrate the effectiveness of the proposed method.

[1] Jianhua Lin,et al. Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[2] B. Scholkopf,et al. Fisher discriminant analysis with kernels , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).

[3] Antonio Torralba,et al. LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[4] G. Griffin,et al. Caltech-256 Object Category Dataset , 2007 .

[5] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[6] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[7] Antonio Torralba,et al. Exploiting hierarchical context on a large database of object categories , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8] Bernhard Schölkopf,et al. Causal Inference Using the Algorithmic Markov Condition , 2008, IEEE Transactions on Information Theory.

[9] Alexei A. Efros,et al. Unbiased look at dataset bias , 2011, CVPR 2011.

[10] Alexei A. Efros,et al. Undoing the Damage of Dataset Bias , 2012, ECCV.

[11] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[12] Bernhard Schölkopf,et al. On causal and anticausal learning , 2012, ICML.

[13] Laurens van der Maaten,et al. Barnes-Hut-SNE , 2013, ICLR.

[14] Bernhard Schölkopf,et al. Domain Adaptation under Target and Conditional Shift , 2013, ICML.

[15] Bernhard Schölkopf,et al. Domain Generalization via Invariant Feature Representation , 2013, ICML.

[16] Philip S. Yu,et al. Transfer Feature Learning with Joint Distribution Adaptation , 2013, 2013 IEEE International Conference on Computer Vision.

[17] Dong Xu,et al. Exploiting Low-Rank Structure from Latent Domains for Domain Generalization , 2014, ECCV.

[18] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[19] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[20] Mengjie Zhang,et al. Domain Generalization for Object Recognition with Multi-task Autoencoders , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[21] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.

[22] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23] Bernhard Schölkopf,et al. Multi-Source Domain Adaptation: A Causal View , 2015, AAAI.

[24] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] François Laviolette,et al. Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..

[26] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Bernhard Schölkopf,et al. Domain Adaptation with Conditional Transferable Components , 2016, ICML.

[28] Nicolas Courty,et al. Optimal Transport for Domain Adaptation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29] Mengjie Zhang,et al. Scatter Component Analysis: A Unified Framework for Domain Adaptation and Domain Generalization , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30] José M. F. Moura,et al. Multiple Source Domain Adaptation with Adversarial Training of Neural Networks , 2017, ArXiv.

[31] Michael I. Jordan,et al. Deep Transfer Learning with Joint Adaptation Networks , 2016, ICML.

[32] Yongxin Yang,et al. Deeper, Broader and Artier Domain Generalization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[33] Dacheng Tao,et al. Algorithm-Dependent Generalization Bounds for Multi-Task Learning. , 2017, IEEE transactions on pattern analysis and machine intelligence.

[34] Nicolas Courty,et al. Joint distribution optimal transportation for domain adaptation , 2017, NIPS.

[35] Bernhard Schölkopf,et al. Discovering Causal Signals in Images , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36] Dacheng Tao,et al. Domain Generalization via Conditional Invariant Representations , 2018, AAAI.

[37] Yun Fu,et al. Deep Domain Generalization With Structured Low-Rank Constraint , 2018, IEEE Transactions on Image Processing.