论文信息 - Building Synthetic Simulated Environments for Configuring and Training Multi-camera Systems for Surveillance Applications

Building Synthetic Simulated Environments for Configuring and Training Multi-camera Systems for Surveillance Applications

Synthetic simulated environments are gaining popularity in the Deep Learning Era, as they can alleviate the effort and cost of two critical tasks to build multi-camera systems for surveillance applications: setting up the camera system to cover the use cases and generating the labeled dataset to train the required Deep Neural Networks (DNNs). However, there are no simulated environments ready to solve them for all kind of scenarios and use cases. Typically, ‘ad hoc’ environments are built, which cannot be easily applied to other contexts. In this work we present a methodology to build synthetic simulated environments with sufficient generality to be usable in different contexts, with little effort. Our methodology tackles the challenges of the appropriate parameterization of scene configurations, the strategies to generate randomly a wide and balanced range of situations of interest for training DNNs with synthetic data, and the quick image capturing from virtual cameras considering the rendering bottlenecks. We show a practical implementation example for the detection of incorrectly placed luggage in aircraft cabins, including the qualitative and quantitative analysis of the data generation process and its influence in a DNN training, and the required modifications to adapt it to other surveillance contexts.

[1] Sergey I. Nikolenko. Synthetic Data for Deep Learning , 2019, ArXiv.

[2] Quoc V. Le,et al. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[3] Ming-Syan Chen,et al. VIVID: Virtual Environment for Visual Deep Learning , 2018, ACM Multimedia.

[4] Rick Salay,et al. ProcSy: Procedural Synthetic Dataset Generation Towards Influence Factor Studies Of Semantic Segmentation Networks , 2019, CVPR Workshops.

[5] K.-K. Maninis,et al. Video Object Segmentation without Temporal Information , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6] Roman Seidel,et al. Learning from THEODORE: A Synthetic Omnidirectional Top-View Indoor Dataset for Deep Transfer Learning , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[7] Oihana Otaegui,et al. Web-based Video-Assisted Point Cloud Annotation for ADAS validation , 2019, Web3D.

[8] Lars Petersson,et al. Effective Use of Synthetic Data for Urban Scene Semantic Segmentation , 2018, ECCV.

[9] Yuling Xi,et al. Visual question answering model based on visual relationship detection , 2020, Signal Process. Image Commun..

[10] Krzysztof Czarnecki,et al. Precise Synthetic Image and LiDAR (PreSIL) Dataset for Autonomous Vehicle Perception , 2019, 2019 IEEE Intelligent Vehicles Symposium (IV).

[11] Vishal M. Patel,et al. Domain Adaptation for Visual Understanding , 2020, Domain Adaptation for Visual Understanding.

[12] Harm de Vries,et al. RMSProp and equilibrated adaptive learning rates for non-convex optimization. , 2015 .

[13] Zhuowen Tu,et al. Deeply Supervised Salient Object Detection with Short Connections , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Paul Bourke,et al. Blender and Immersive Gaming in a Hemispherical Dome , 2010, CGAMES 2010.

[15] Ashish Kapoor,et al. AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles , 2017, FSR.

[16] Viktor Seib,et al. Mixing Real and Synthetic Data to Enhance Neural Network Training - A Review of Current Approaches , 2020, ArXiv.

[17] Hristo Bojinov,et al. Object Detection Using Deep CNNs Trained on Synthetic Images , 2017, ArXiv.

[18] Matthew E. Taylor,et al. A survey and critique of multiagent deep reinforcement learning , 2019, Autonomous Agents and Multi-Agent Systems.

[19] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[20] Nanning Zheng,et al. View Adaptive Neural Networks for High Performance Skeleton-Based Human Action Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21] Taghi M. Khoshgoftaar,et al. A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[22] Michael J. Black,et al. SMPL: A Skinned Multi-Person Linear Model , 2023 .