Spatial Feedback Learning to Improve Semantic Segmentation in Hot Weather

High-temperature weather conditions induce geometrical distortions in images which can adversely affect the performance of a computer vision model performing downstream tasks such as semantic segmentation. The performance of such models has been shown to improve by adding a restoration network before a semantic segmentation network. The restoration network removes the geometrical distortions from the images and shows improved segmentation results. However, this approach suffers from a major architectural drawback that is the restoration network does not learn directly from the errors of the segmentation network. In other words, the restoration network is not task aware. In this work, we propose a semantic feedback learning approach, which improves the task of semantic segmentation giving a feedback response into the restoration network. This response works as an attend and fix mechanism by focusing on those areas of an image where restoration needs improvement. Also, we proposed loss functions: Iterative Focal Loss (iFL) and Class-Balanced Iterative Focal Loss (CB-iFL), which are specifically designed to improve the performance of the feedback network. These losses focus more on those samples that are continuously miss-classified over successive iterations. Our approach gives a gain of 17.41 mIoU over the standard segmentation model, including the additional gain of 1.9 mIoU with CB-iFL on the Cityscapes dataset.

[1]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[2]  D. Fried Probability of getting a lucky short-exposure image through turbulence* , 1978 .

[3]  Vincent Lepetit,et al.  Training a Feedback Loop for Hand Pose Estimation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[4]  Eduardo Romera,et al.  ERFNet: Efficient Residual Factorized ConvNet for Real-Time Semantic Segmentation , 2018, IEEE Transactions on Intelligent Transportation Systems.

[5]  Yoav Y. Schechner,et al.  Turbulence-induced 2D correlated image distortion , 2017, 2017 IEEE International Conference on Computational Photography (ICCP).

[6]  Zhou He-qin A Novel Physics-based Method for Restoration of Foggy Day Images , 2008 .

[7]  Raoul de Charette,et al.  Physics-Based Rendering for Improving Robustness to Rain , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[8]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[9]  Hyojin Kim,et al.  Learning to Focus and Track Extreme Climate Events , 2019, BMVC.

[10]  Danping Zou,et al.  Simultaneous video defogging and stereo reconstruction , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Kaiming He,et al.  Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[12]  I. Dror,et al.  RESTORATION OF ATMOSPHERICALLY BLURRED IMAGES ACCORDING TO WEATHER-PREDICTED ATMOSPHERIC MODULATION TRANSFER FUNCTIONS , 1997 .

[13]  Jenq-Neng Hwang,et al.  DesnowNet: Context-Aware Deep Network for Snow Removal , 2017, IEEE Transactions on Image Processing.

[14]  Lin Sun,et al.  Feedback Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Lihi Zelnik-Manor,et al.  Adversarial Feedback Loop , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[16]  Yang Song,et al.  Class-Balanced Loss Based on Effective Number of Samples , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  C. V. Jawahar,et al.  Munich to Dubai: How far is it for Semantic Segmentationƒ , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[18]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19]  Xiang Zhu,et al.  Removing Atmospheric Turbulence via Space-Invariant Deconvolution , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Shunli Zhang,et al.  Residual Multiscale Based Single Image Deraining , 2019, BMVC.

[21]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  C. V. Jawahar,et al.  IDD: A Dataset for Exploring Problems of Autonomous Navigation in Unconstrained Environments , 2018, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV).

[24]  Larry S. Davis,et al.  ACE: Adapting to Changing Environments for Semantic Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[25]  Angelica I. Avilés-Rivero,et al.  RainFlow: Optical Flow Under Rain Streaks and Rain Veiling Effect , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[26]  Andrew Zisserman,et al.  Recurrent Human Pose Estimation , 2016, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[27]  Roberto Ragazzoni,et al.  Adaptive-optics corrections available for the whole sky , 2000, Nature.

[28]  Wei Wu,et al.  Feedback Network for Image Super-Resolution , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[30]  Jitendra Malik,et al.  Human Pose Estimation with Iterative Error Feedback , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Jian Yang,et al.  Importance-Aware Semantic Segmentation for Autonomous Driving System , 2017, IJCAI.

[32]  Yuan Xie,et al.  Distortion-driven Turbulence Effect Removal using Variational Model , 2014, ArXiv.

[33]  Shree K. Nayar,et al.  Vision in bad weather , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[34]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[35]  Delu Zeng,et al.  Removing Rain from Single Images via a Deep Detail Network , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).