Unsupervised Domain Adaptation for Object Detection Using Distribution Matching in Various Feature Level

As the research on deep learning has become more active, the need for a lot of data has emerged. However, there are limitations in acquiring real data such as digital forensics, so domain adaptation technology is required to overcome this problem. This paper considers distribution matching in various feature level for unsupervised domain adaptation for object detection with a single stage detector. The object detection task assumes that training and test data are drawn from the same distribution; however, in a real environment, there is a domain gap between training and test data which leads to degrading performance significantly. Therefore, we aim to learn a model to generalize well in target domain of object detection by using maximum mean discrepancy (MMD) in various feature levels. We adjust MMD based on single shot multibox detector (SSD) model which is a single stage detector that learns to localize objects with various size using a multi-layer design of bounding box regression and infers object class simultaneously. The MMD loss on high-level features between source and target domain effectively reduces the domain discrepancy to learn a domain-invariant feature in SSD model. We evaluate the approaches using Syn2real object detection dataset. Experimental results show that reducing the domain shift in high-level features improves the cross-domain robustness of object detection, and domain adaptation works better with simple MMD method than complex method as GAN.

[1]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Kate Saenko,et al.  Deep CORAL: Correlation Alignment for Deep Domain Adaptation , 2016, ECCV Workshops.

[3]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[4]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[5]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[7]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[9]  Trevor Darrell,et al.  Simultaneous Deep Transfer Across Domains and Tasks , 2015, ICCV.

[10]  François Laviolette,et al.  Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..

[11]  Luc Van Gool,et al.  Domain Adaptive Faster R-CNN for Object Detection in the Wild , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.