论文信息 - Validation of Simulation-Based Testing: Bypassing Domain Shift with Label-to-Image Synthesis

Validation of Simulation-Based Testing: Bypassing Domain Shift with Label-to-Image Synthesis

Many machine learning applications can benefit from simulated data for systematic validation - in particular if real-life data is difficult to obtain or annotate. However, since simulations are prone to domain shift w.r.t. real-life data, it is crucial to verify the transferability of the obtained results.We propose a novel framework consisting of a generative label-to-image synthesis model together with different transferability measures to inspect to what extent we can transfer testing results of semantic segmentation models from synthetic data to equivalent real-life data. With slight modifications, our approach is extendable to, e.g., general multi-class classification tasks. Grounded on the transferability analysis, our approach additionally allows for extensive testing by incorporating controlled simulations. We validate our approach empirically on a semantic segmentation task on driving scenes. Transferability is tested using correlation analysis of IoU and a learned discriminator. Although the latter can distinguish between real-life and synthetic tests, in the former we observe surprisingly strong correlations of 0.7 for both cars and pedestrians.

[1] Foutse Khomh,et al. On Testing Machine Learning Programs , 2018, J. Syst. Softw..

[2] Ross B. Girshick,et al. Mask R-CNN , 2017, 1703.06870.

[3] Donghwan Shin,et al. Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous Car Case Study , 2020, 2020 IEEE 13th International Conference on Software Testing, Validation and Verification (ICST).

[4] Birgit Kirsch,et al. Informed Machine Learning -- A Taxonomy and Survey of Integrating Knowledge into Learning Systems , 2019 .

[5] Tomas Pfister,et al. Learning from Simulated and Unsupervised Images through Adversarial Training , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] E. James Whitehead,et al. A Modular Architecture for Procedural Generation of Towns, Intersections and Scenarios for Testing Autonomous Vehicles , 2020, 2020 IEEE Intelligent Vehicles Symposium (IV).

[7] Yang Zhao,et al. Deep High-Resolution Representation Learning for Visual Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] Germán Ros,et al. CARLA: An Open Urban Driving Simulator , 2017, CoRL.

[9] Martin Schels,et al. A Survey on Methods for the Safety Assurance of Machine Learning Based Systems , 2020 .

[10] Hanno Gottschalk,et al. Prediction Error Meta Classification in Semantic Segmentation: Detection via Aggregated Dispersion Measures of Softmax Probabilities , 2018, 2020 International Joint Conference on Neural Networks (IJCNN).

[11] Mark Harman,et al. Machine Learning Testing: Survey, Landscapes and Horizons , 2019, IEEE Transactions on Software Engineering.

[12] Sanja Fidler,et al. Meta-Sim: Learning to Generate Synthetic Datasets , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[13] Wouter M. Kouw,et al. A Review of Domain Adaptation without Target Labels , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Jan Kautz,et al. Video-to-Video Synthesis , 2018, NeurIPS.

[15] Jingdong Wang,et al. OCNet: Object Context Network for Scene Parsing , 2018, ArXiv.

[16] Sebastian Ramos,et al. The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Mei Wang,et al. Deep Visual Domain Adaptation: A Survey , 2018, Neurocomputing.

[18] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Jan Kautz,et al. High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20] Nenghai Yu,et al. Coherent Online Video Style Transfer , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[21] Andreas Geiger,et al. Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[22] Rafet Sifa,et al. Combining Machine Learning and Simulation to a Hybrid Modelling Approach: Current and Future Directions , 2020, IDA.

[23] Lukas Rummelhard,et al. Validation of Perception and Decision-Making Systems for Autonomous Driving via Statistical Model Checking , 2019, 2019 IEEE Intelligent Vehicles Symposium (IV).

[24] Xilin Chen,et al. Interlaced Sparse Self-Attention for Semantic Segmentation , 2019, ArXiv.

[25] Sebastian Wagner,et al. Towards Cross-Verification and Use of Simulation in the Assessment of Automated Driving , 2019, 2019 IEEE Intelligent Vehicles Symposium (IV).

[26] Eike Möhlmann,et al. Fundamental Considerations around Scenario-Based Testing for Automated Driving , 2020, 2020 IEEE Intelligent Vehicles Symposium (IV).

[27] Xilin Chen,et al. Object-Contextual Representations for Semantic Segmentation , 2019, ECCV.

[28] Lukas Hartjen,et al. Application of Evolutionary Algorithms and Criticality Metrics for the Verification and Validation of Automated Driving Systems at Urban Intersections , 2020, 2020 IEEE Intelligent Vehicles Symposium (IV).

[29] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[30] Sanja Fidler,et al. Meta-Sim2: Unsupervised Learning of Scene Structure for Synthetic Data Generation , 2020, ECCV.

[31] Qiao Wang,et al. VirtualWorlds as Proxy for Multi-object Tracking Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32] Xilin Chen,et al. SegFix: Model-Agnostic Boundary Refinement for Segmentation , 2020, ECCV.