论文信息 - Synthetic Data Generation to Mitigate the Low/No-Shot Problem in Machine Learning

Synthetic Data Generation to Mitigate the Low/No-Shot Problem in Machine Learning

The low/no-shot problem refers to a lack of available data for training deep learning algorithms. In remote sensing, complete image data sets are rare and do not always include the targets of interest. We propose a method to rapidly generate highfidelity synthetic satellite imagery featuring targets of interest over a range of solar illuminations and platform geometries. Specifically, we used the Digital Imaging and Remote Sensing Image Generation model and a custom image simulator to produce synthetic imagery of C130 aircraft in place of real Worldview-3 imagery. Our synthetic imagery was supplemented with real Worldview-3 images to test the efficacy of training deep learning algorithms with synthetic data. We deliberately chose a challenging test case of distinguishing C130s from other aircraft, or neither. Results show a negligible improvement in automatic target classification when synthetic data is supplemented with a small amount of real imagery. However, training with synthetic data alone only achieves F1-scores in line with a random classifier, suggesting that there is still significant domain mismatch between the real and synthetic datasets.

Gary Chern | Mark Pritt | Emily E. Berkson | Jared D. VanCor | Steven Esposito

[1] John Kerekes,et al. Efficient generation of image chips for training deep learning algorithms , 2017, Defense + Security.

[2] Jan Kautz,et al. Domain Stylization: A Strong, Simple Baseline for Synthetic to Real Image Domain Adaptation , 2018, ArXiv.

[3] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Robert D. Fiete,et al. Modeling the Imaging Chain of Digital Cameras , 2010 .

[5] Timo Aila,et al. A Style-Based Generator Architecture for Generative Adversarial Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[7] Varun Jampani,et al. Training Deep Networks with Synthetic Data: Bridging the Reality Gap by Domain Randomization , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[8] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[9] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[10] Qiwen Fu,et al. Colorization Using ConvNet and GAN , 2017 .

[11] Xueting Li,et al. A Closed-form Solution to Photorealistic Image Stylization , 2018, ECCV.

[12] Austen Groener,et al. Globally-scalable Automated Target Recognition (GATR) , 2019, 2019 IEEE Applied Imagery Pattern Recognition Workshop (AIPR).