Model Adaptation with Synthetic and Real Data for Semantic Dense Foggy Scene Understanding

This work addresses the problem of semantic scene understanding under dense fog. Although considerable progress has been made in semantic scene understanding, it is mainly related to clear-weather scenes. Extending recognition methods to adverse weather conditions such as fog is crucial for outdoor applications. In this paper, we propose a novel method, named Curriculum Model Adaptation (CMAda), which gradually adapts a semantic segmentation model from light synthetic fog to dense real fog in multiple steps, using both synthetic and real foggy data. In addition, we present three other main stand-alone contributions: (1) a novel method to add synthetic fog to real, clear-weather scenes using semantic input; (2) a new fog density estimator; (3) the Foggy Zurich dataset comprising 3808 real foggy images, with pixel-level semantic annotations for 16 images with dense fog. Our experiments show that (1) our fog simulation slightly outperforms a state-of-the-art competing simulation with respect to the task of semantic foggy scene understanding (SFSU); (2) CMAda improves the performance of state-of-the-art models for SFSU significantly by leveraging unlabeled real foggy data. The datasets and code will be made publicly available.

[1]  Alan Conrad Bovik,et al.  Referenceless Prediction of Perceptual Fog Density and Perceptual Image Defogging , 2015, IEEE Transactions on Image Processing.

[2]  Antonio M. López,et al.  The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Vishal M. Patel,et al.  Joint Transmission Map Estimation and Dehazing Using Deep Networks , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Luc Van Gool,et al.  Learning Driving Models with a Surround-View Camera System and a Route Planner , 2018, ArXiv.

[5]  Taesung Park,et al.  CyCADA: Cycle-Consistent Adversarial Domain Adaptation , 2017, ICML.

[6]  Luc Van Gool,et al.  Metric imitation by manifold transfer for efficient vision applications , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Yong Xu,et al.  Review of Video and Image Defogging Algorithms and Related Studies on Image Restoration and Enhancement , 2016, IEEE Access.

[8]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[9]  Heidrun Belzner,et al.  Image based fog detection in vehicles , 2012, 2012 IEEE Intelligent Vehicles Symposium.

[10]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Frédo Durand,et al.  A Fast Approximation of the Bilateral Filter Using a Signal Processing Approach , 2006, International Journal of Computer Vision.

[12]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[13]  Michael F. Cohen,et al.  Digital photography with flash and no-flash image pairs , 2004, ACM Trans. Graph..

[14]  Shree K. Nayar,et al.  Contrast Restoration of Weather Degraded Images , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[16]  Eric Dumont,et al.  Nighttime Visibility Analysis and Estimation Method in the Presence of Dense Fog , 2015, IEEE Transactions on Intelligent Transportation Systems.

[17]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[18]  Shai Avidan,et al.  Non-local Image Dehazing , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Jian Sun,et al.  Guided Image Filtering , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Li Xu,et al.  Mutual-Structure for Joint Filtering , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[21]  F. Durand,et al.  Flash photography enhancement via intrinsic relighting , 2004, ACM Trans. Graph..

[22]  Sergiu Nedevschi,et al.  Exponential Contrast Restoration in Fog Conditions for Driving Assistance , 2015, IEEE Transactions on Intelligent Transportation Systems.

[23]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Xiao Lu,et al.  Learning deep transmission network for single image dehazing , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[25]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[26]  Razvan-Catalin Miclea,et al.  Visibility Detection in Foggy Environment , 2015, 2015 20th International Conference on Control Systems and Computer Science.

[27]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Luc Van Gool,et al.  Semantic Foggy Scene Understanding with Synthetic Data , 2017, International Journal of Computer Vision.

[29]  Swami Sankaranarayanan,et al.  Learning from Synthetic Data: Addressing Domain Shift for Semantic Segmentation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[30]  Ketan Tang,et al.  Investigating Haze-Relevant Features in a Learning Framework for Image Dehazing , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Mario Fritz,et al.  Sequential Bayesian Model Update under Structured Scene Prior for Semantic Road Scenes Labeling , 2013, 2013 IEEE International Conference on Computer Vision.

[32]  Luc Van Gool,et al.  Dark Model Adaptation: Semantic Image Segmentation from Daytime to Nighttime , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[33]  Mohan M. Trivedi,et al.  Vision for Looking at Traffic Lights: Issues, Survey, and Perspectives , 2016, IEEE Transactions on Intelligent Transportation Systems.

[34]  Ian D. Reid,et al.  RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Xrin –Xe,et al.  DEPARTMENT OF COMMERCE National Oceanic and Atmospheric Administration , 2017 .

[36]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[37]  Slobodan Ilic,et al.  Classification of images in fog and fog-free scenes for use in vehicles , 2013, 2013 IEEE Intelligent Vehicles Symposium (IV).

[38]  Jean-Philippe Tarel,et al.  Improved visibility of road scene images under heterogeneous fog , 2010, 2010 IEEE Intelligent Vehicles Symposium.

[39]  Raanan Fattal,et al.  Dehazing Using Color-Lines , 2014, ACM Trans. Graph..

[40]  Peter Kontschieder,et al.  The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[41]  Ko Nishino,et al.  Bayesian Defogging , 2012, International Journal of Computer Vision.

[42]  Rich Caruana,et al.  Model compression , 2006, KDD '06.

[43]  L. M. Bergasa,et al.  Fog detection system based on computer vision techniques , 2009, 2009 12th International IEEE Conference on Intelligent Transportation Systems.

[44]  Ian D. Reid,et al.  RefineNet : MultiPath Refinement Networks with Identity Mappings for High-Resolution Semantic Segmentation , 2016 .

[45]  Nicolas Hautière,et al.  Towards night fog detection through use of in-vehicle multipurpose cameras , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[46]  Sehoon Ha,et al.  Iterative Training of Dynamic Skills Inspired by Human Coaching Techniques , 2014, ACM Trans. Graph..

[47]  Luc Van Gool,et al.  Domain Adaptive Faster R-CNN for Object Detection in the Wild , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[48]  Jean-Philippe Tarel,et al.  Vision Enhancement in Homogeneous and Heterogeneous Fog , 2012, IEEE Intelligent Transportation Systems Magazine.

[49]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Jean-Philippe Tarel,et al.  Automatic fog detection and estimation of visibility distance through use of an onboard camera , 2006, Machine Vision and Applications.

[51]  Mohinder Malhotra Single Image Haze Removal Using Dark Channel Prior , 2016 .

[52]  Jason Jeongsuk Yoon,et al.  Fast fog detection for camera based Advanced Driver Assistance Systems , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[53]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[54]  Michael S. Brown,et al.  Haze Visibility Enhancement: A Survey and Quantitative Benchmarking , 2016, Comput. Vis. Image Underst..

[55]  Jitendra Malik,et al.  Cross Modal Distillation for Supervision Transfer , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  Luc Van Gool,et al.  End-to-End Learning of Driving Models with Surround-View Cameras and Route Planners , 2018, ECCV.

[57]  Robby T. Tan,et al.  Visibility in bad weather from a single image , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[58]  Roberto Cipolla,et al.  Segmentation and Recognition Using Structure from Motion Point Clouds , 2008, ECCV.

[59]  Yuan-Kai Wang,et al.  Single Image Defogging by Multiscale Depth Fusion , 2014, IEEE Transactions on Image Processing.

[60]  Shree K. Nayar,et al.  Vision and the Atmosphere , 2002, International Journal of Computer Vision.

[61]  Raanan Fattal,et al.  Single image dehazing , 2008, ACM Trans. Graph..

[62]  Ronen Lerner,et al.  Recent progress in road and lane detection: a survey , 2012, Machine Vision and Applications.