Curriculum Model Adaptation with Synthetic and Real Data for Semantic Foggy Scene Understanding

This work addresses the problem of semantic scene understanding under fog. Although marked progress has been made in semantic scene understanding, it is mainly concentrated on clear-weather scenes. Extending semantic segmentation methods to adverse weather conditions such as fog is crucial for outdoor applications. In this paper, we propose a novel method, named Curriculum Model Adaptation (CMAda), which gradually adapts a semantic segmentation model from light synthetic fog to dense real fog in multiple steps, using both labeled synthetic foggy data and unlabeled real foggy data. The method is based on the fact that the results of semantic segmentation in moderately adverse conditions (light fog) can be bootstrapped to solve the same problem in highly adverse conditions (dense fog). CMAda is extensible to other adverse conditions and provides a new paradigm for learning with synthetic data and unlabeled real data. In addition, we present four other main stand-alone contributions: (1) a novel method to add synthetic fog to real, clear-weather scenes using semantic input; (2) a new fog density estimator; (3) a novel fog densification method for real foggy scenes without known depth; and (4) the Foggy Zurich dataset comprising 3808 real foggy images, with pixel-level semantic annotations for 40 images with dense fog. Our experiments show that (1) our fog simulation and fog density estimator outperform their state-of-the-art counterparts with respect to the task of semantic foggy scene understanding (SFSU); (2) CMAda improves the performance of state-of-the-art models for SFSU significantly, benefiting both from our synthetic and real foggy data. The foggy datasets and code are publicly available.

[1]  Robby T. Tan,et al.  Visibility in bad weather from a single image , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Raanan Fattal,et al.  Dehazing Using Color-Lines , 2014, ACM Trans. Graph..

[3]  Roberto Cipolla,et al.  Segmentation and Recognition Using Structure from Motion Point Clouds , 2008, ECCV.

[4]  Eric Dumont,et al.  Nighttime Visibility Analysis and Estimation Method in the Presence of Dense Fog , 2015, IEEE Transactions on Intelligent Transportation Systems.

[5]  Shai Avidan,et al.  Non-local Image Dehazing , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Luc Van Gool,et al.  Domain Adaptive Faster R-CNN for Object Detection in the Wild , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Martial Hebert,et al.  Watch and learn: Semi-supervised learning of object detectors from videos , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Mario Fritz,et al.  Sequential Bayesian Model Update under Structured Scene Prior for Semantic Road Scenes Labeling , 2013, 2013 IEEE International Conference on Computer Vision.

[9]  Mohan M. Trivedi,et al.  Vision for Looking at Traffic Lights: Issues, Survey, and Perspectives , 2016, IEEE Transactions on Intelligent Transportation Systems.

[10]  Ko Nishino,et al.  Bayesian Defogging , 2012, International Journal of Computer Vision.

[11]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[12]  Jean-Philippe Tarel,et al.  Vision Enhancement in Homogeneous and Heterogeneous Fog , 2012, IEEE Intelligent Transportation Systems Magazine.

[13]  Baining Guo,et al.  Context-aware textures , 2007, TOGS.

[14]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[15]  Razvan-Catalin Miclea,et al.  Visibility Detection in Foggy Environment , 2015, 2015 20th International Conference on Control Systems and Computer Science.

[16]  Ian D. Reid,et al.  RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[20]  Luc Van Gool,et al.  Real-time 3D Traffic Cone Detection for Autonomous Driving , 2019, 2019 IEEE Intelligent Vehicles Symposium (IV).

[21]  Swami Sankaranarayanan,et al.  Learning from Synthetic Data: Addressing Domain Shift for Semantic Segmentation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22]  Ming-Hsuan Yang,et al.  Learning to Adapt Structured Output Space for Semantic Segmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[23]  Jean-Philippe Tarel,et al.  Automatic fog detection and estimation of visibility distance through use of an onboard camera , 2006, Machine Vision and Applications.

[24]  Tomas Pfister,et al.  Learning from Simulated and Unsupervised Images through Adversarial Training , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[26]  Heidrun Belzner,et al.  Image based fog detection in vehicles , 2012, 2012 IEEE Intelligent Vehicles Symposium.

[27]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Frédo Durand,et al.  A Fast Approximation of the Bilateral Filter Using a Signal Processing Approach , 2006, International Journal of Computer Vision.

[29]  Luc Van Gool,et al.  Ensemble Projection for Semi-supervised Image Classification , 2013, 2013 IEEE International Conference on Computer Vision.

[30]  Luc Van Gool,et al.  Metric imitation by manifold transfer for efficient vision applications , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Trevor Darrell,et al.  LSDA: Large Scale Detection through Adaptation , 2014, NIPS.

[32]  Yong Xu,et al.  Review of Video and Image Defogging Algorithms and Related Studies on Image Restoration and Enhancement , 2016, IEEE Access.

[33]  Shree K. Nayar,et al.  Contrast Restoration of Weather Degraded Images , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  F. Durand,et al.  Flash photography enhancement via intrinsic relighting , 2004, ACM Trans. Graph..

[35]  Alan Conrad Bovik,et al.  Referenceless Prediction of Perceptual Fog Density and Perceptual Image Defogging , 2015, IEEE Transactions on Image Processing.

[36]  Antonio M. López,et al.  The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[38]  Alex Bewley,et al.  Incremental Adversarial Domain Adaptation for Continually Changing Environments , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[39]  Taesung Park,et al.  CyCADA: Cycle-Consistent Adversarial Domain Adaptation , 2017, ICML.

[40]  Jason Jeongsuk Yoon,et al.  Fast fog detection for camera based Advanced Driver Assistance Systems , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[41]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Dima Damen,et al.  Recognizing linked events: Searching the space of feasible explanations , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Michael S. Brown,et al.  Haze Visibility Enhancement: A Survey and Quantitative Benchmarking , 2016, Comput. Vis. Image Underst..

[44]  Philip David,et al.  Domain Adaptation for Semantic Segmentation of Urban Scenes , 2017 .

[45]  G. C. S.,et al.  Beiträge zur Physik der freien Atmosphäre , 1905, Nature.

[46]  Luc Van Gool,et al.  End-to-End Learning of Driving Models with Surround-View Cameras and Route Planners , 2018, ECCV.

[47]  Shree K. Nayar,et al.  Vision and Rain , 2007, International Journal of Computer Vision.

[48]  Michael F. Cohen,et al.  Digital photography with flash and no-flash image pairs , 2004, ACM Trans. Graph..

[49]  Jian Sun,et al.  Guided Image Filtering , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  L. M. Bergasa,et al.  Fog detection system based on computer vision techniques , 2009, 2009 12th International IEEE Conference on Intelligent Transportation Systems.

[51]  Ian D. Reid,et al.  RefineNet : MultiPath Refinement Networks with Identity Mappings for High-Resolution Semantic Segmentation , 2016 .

[52]  Nicolas Hautière,et al.  Towards night fog detection through use of in-vehicle multipurpose cameras , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[53]  Bobby Bodenheimer,et al.  Synthesis and evaluation of linear motion transitions , 2008, TOGS.

[54]  Li Xu,et al.  Mutual-Structure for Joint Filtering , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[55]  Jean-Philippe Tarel,et al.  Improved visibility of road scene images under heterogeneous fog , 2010, 2010 IEEE Intelligent Vehicles Symposium.

[56]  Peter Kontschieder,et al.  The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[57]  Luc Van Gool,et al.  Semantic Foggy Scene Understanding with Synthetic Data , 2017, International Journal of Computer Vision.

[58]  Mohinder Malhotra Single Image Haze Removal Using Dark Channel Prior , 2016 .

[59]  Yuan-Kai Wang,et al.  Single Image Defogging by Multiscale Depth Fusion , 2014, IEEE Transactions on Image Processing.

[60]  Shree K. Nayar,et al.  Vision and the Atmosphere , 2002, International Journal of Computer Vision.

[61]  Sergiu Nedevschi,et al.  Exponential Contrast Restoration in Fog Conditions for Driving Assistance , 2015, IEEE Transactions on Intelligent Transportation Systems.

[62]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[63]  Xiao Lu,et al.  Learning deep transmission network for single image dehazing , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[64]  Dani Lischinski,et al.  Joint bilateral upsampling , 2007, ACM Trans. Graph..

[65]  Kaiming He,et al.  Data Distillation: Towards Omni-Supervised Learning , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[66]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[67]  Xiaochun Cao,et al.  Single Image Dehazing via Multi-scale Convolutional Neural Networks , 2016, ECCV.

[68]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[69]  Vishal M. Patel,et al.  Joint Transmission Map Estimation and Dehazing Using Deep Networks , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[70]  Luc Van Gool,et al.  Model Adaptation with Synthetic and Real Data for Semantic Dense Foggy Scene Understanding , 2018, ECCV.

[71]  Ketan Tang,et al.  Investigating Haze-Relevant Features in a Learning Framework for Image Dehazing , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[72]  Yann LeCun,et al.  Road Scene Segmentation from a Single Image , 2012, ECCV.

[73]  Luc Van Gool,et al.  Dark Model Adaptation: Semantic Image Segmentation from Daytime to Nighttime , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[74]  Raanan Fattal,et al.  Single image dehazing , 2008, ACM Trans. Graph..

[75]  Ronen Lerner,et al.  Recent progress in road and lane detection: a survey , 2012, Machine Vision and Applications.

[76]  Jitendra Malik,et al.  Cross Modal Distillation for Supervision Transfer , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[77]  Slobodan Ilic,et al.  Classification of images in fog and fog-free scenes for use in vehicles , 2013, 2013 IEEE Intelligent Vehicles Symposium (IV).

[78]  Rich Caruana,et al.  Model compression , 2006, KDD '06.