RiWNet: A moving object instance segmentation Network being Robust in adverse Weather conditions

Segmenting each moving object instance in a scene is essential for many applications. But like many other computer vision tasks, this task performs well in optimal weather, but then adverse weather tends to fail. To be robust in weather conditions, the usual way is to train network in data of given weather pattern or to fuse multiple sensors. We focus on a new possibility, that is, to improve its resilience to weather interference through the network’s structural design. First, we propose a novel FPN structure called RiWFPN with a progressive topdown interaction and attention refinement module. RiWFPN can directly replace other FPN structures to improve the robustness of the network in non-optimal weather conditions. Then we extend SOLOV2 to capture temporal information in video to learn motion information, and propose a moving object instance segmentation network with RiWFPN called RiWNet. Finally, in order to verify the effect of moving instance segmentation in different weather disturbances, we propose a VKTTI-moving dataset which is a moving instance segmentation dataset based on the VKTTI dataset, taking into account different weather scenes such as rain, fog, sunset, morning as well as overcast. The experiment proves how RiWFPN improves the network’s resilience to adverse weather effects compared to other FPN structures. We compare RiWNet to several other state-of-theart methods in some challenging datasets, and RiWNet shows better performance especially under adverse weather conditions.

[1]  Tao Kong,et al.  SOLOv2: Dynamic and Fast Instance Segmentation , 2020, NeurIPS.

[2]  Bin Luo,et al.  Permutation Preference Based Alternate Sampling and Clustering for Motion Segmentation , 2018, IEEE Signal Processing Letters.

[3]  Martin Jägersand,et al.  MODNet: Motion and Appearance based Moving Object Detection Network for Autonomous Driving , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[4]  Felix Heide,et al.  Seeing Through Fog Without Seeing Fog: Deep Multimodal Sensor Fusion in Unseen Adverse Weather , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Luc Van Gool,et al.  Night-to-Day Image Translation for Retrieval-based Localization , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[6]  Chi-Wing Fu,et al.  Depth-Attentional Features for Single-Image Rain Removal , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Erik G. Learned-Miller,et al.  A Detailed Rubric for Motion Segmentation , 2016, ArXiv.

[8]  Yun Zhang,et al.  DymSLAM: 4D Dynamic Scene Reconstruction Based on Geometrical Motion Segmentation , 2020, IEEE Robotics and Automation Letters.

[9]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Soon Ki Jung,et al.  Unsupervised Moving Object Detection in Complex Scenes Using Adversarial Regularizations , 2021, IEEE Transactions on Multimedia.

[11]  Felix Heide,et al.  Pixel-Accurate Depth Evaluation in Realistic Driving Scenarios , 2019, 2019 International Conference on 3D Vision (3DV).

[12]  Enric Gil Esteller,et al.  Microsoft Word-Worsening Perception_v2_MASTER.docx , 2021 .

[13]  Naila Murray,et al.  Virtual KITTI 2 , 2020, ArXiv.

[14]  Jitendra Malik,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence Segmentation of Moving Objects by Long Term Video Analysis , 2022 .

[15]  Ardhendu Behera,et al.  Unsupervised Monocular Depth Estimation for Night-time Images using Adversarial Domain Feature Adaptation , 2020, ECCV.

[16]  Lourdes Agapito,et al.  MaskFusion: Real-Time Recognition, Tracking and Reconstruction of Multiple Moving Objects , 2018, 2018 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).

[17]  Wilhelm Stork,et al.  CNN-Based Lidar Point Cloud De-Noising in Adverse Weather , 2020, IEEE Robotics and Automation Letters.

[18]  David Suter,et al.  Motion Segmentation of RGB-D Sequences: Combining Semantic and Motion Information Using Statistical Inference , 2020, IEEE Transactions on Image Processing.

[19]  Xun Xu,et al.  3D Rigid Motion Segmentation with Mixed and Unknown Number of Models , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Klaus Dietmayer,et al.  Robust Semantic Segmentation in Adverse Weather Conditions by means of Sensor Data Fusion , 2019, 2019 22th International Conference on Information Fusion (FUSION).

[21]  Wei Wang,et al.  DV-LOAM: Direct Visual LiDAR Odometry and Mapping , 2021, Remote. Sens..

[22]  V. Khanaa,et al.  An Advanced Moving Object Detection Algorithm for Automatic Traffic Monitoring In Real-World Limited Bandwidth Networks , 2015 .

[23]  Horst Bischof,et al.  Robustness of Object Detectors in Degrading Weather Conditions , 2021, 2021 IEEE International Intelligent Transportation Systems Conference (ITSC).

[24]  Quoc V. Le,et al.  NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Ning Xu,et al.  YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark , 2018, ArXiv.

[26]  Shuicheng Yan,et al.  Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[27]  Klaus Dietmayer,et al.  Robust Semantic Segmentation in Adverse Weather Conditions by means of Fast Video-Sequence Segmentation , 2020, 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC).

[28]  Xi Zhao,et al.  Motion Segmentation Based on Model Selection in Permutation Space for RGB Sensors , 2019, Sensors.

[29]  Fei Wu,et al.  FcaNet: Frequency Channel Attention Networks , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[30]  Thomas S. Huang,et al.  Non-Local Recurrent Network for Image Restoration , 2018, NeurIPS.

[31]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[32]  Zaïd Harchaoui,et al.  Object Discovery in Videos as Foreground Motion Clustering , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  S. Kolski,et al.  Detection, prediction, and avoidance of dynamic obstacles in urban environments , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[35]  Nicu Sebe,et al.  Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[36]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[37]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[38]  Victor Vaquero,et al.  FuseMODNet: Real-Time Camera and LiDAR Based Moving Object Detection for Robust Low-Light Autonomous Driving , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[39]  Hayder Radha,et al.  Multiscale Domain Adaptive YOLO for Cross-Domain Object Detection , 2021, ICIP 2021.

[40]  Ding Liu,et al.  Pyramid Attention Networks for Image Restoration , 2020, ArXiv.

[41]  Dong Liu,et al.  High-Resolution Representations for Labeling Pixels and Regions , 2019, ArXiv.

[42]  Michal Irani,et al.  Separating Signal from Noise Using Patch Recurrence across Scales , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Alan Yuille,et al.  DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution , 2020, ArXiv.

[44]  L. Gool,et al.  Semantic Understanding of Foggy Scenes with Purely Synthetic Data , 2019, 2019 IEEE Intelligent Transportation Systems Conference (ITSC).

[45]  Jun Liu,et al.  Object Detection based on OcSaFPN in Aerial Images with Noise , 2020, ArXiv.

[46]  Markus Lienkamp,et al.  A Deep Learning-based Radar and Camera Sensor Fusion Architecture for Object Detection , 2019, 2019 Sensor Data Fusion: Trends, Solutions, Applications (SDF).

[47]  Liangliang Cao,et al.  Automatic Adaptation of Object Detectors to New Domains Using Self-Training , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[48]  Yun Zhang,et al.  Quantized Residual Preference Based Linkage Clustering for Model Selection and Inlier Segmentation in Geometric Multi-Model Fitting , 2020, Sensors.

[49]  Dong Liu,et al.  Deep High-Resolution Representation Learning for Human Pose Estimation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Jun Liu,et al.  U2-ONet: A Two-Level Nested Octave U-Structure Network with a Multi-Scale Attention Mechanism for Moving Object Segmentation , 2020, Remote. Sens..

[51]  Maxim Likhachev,et al.  Motion planning in urban environments: Part I , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[52]  Ling Shao,et al.  Submodular Trajectories for Better Motion Segmentation in Videos , 2018, IEEE Transactions on Image Processing.

[53]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[54]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[55]  Ling Shao,et al.  See More, Know More: Unsupervised Video Object Segmentation With Co-Attention Siamese Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  In So Kweon,et al.  VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[57]  Shu Liu,et al.  Path Aggregation Network for Instance Segmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[58]  Kai Chen,et al.  Feature Pyramid Grids , 2020, ArXiv.

[59]  Qinmu Peng,et al.  Automatic Video Object Segmentation Based on Visual and Motion Saliency , 2019, IEEE Transactions on Multimedia.

[60]  Deva Ramanan,et al.  Towards Segmenting Anything That Moves , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[61]  Erik G. Learned-Miller,et al.  The Best of Both Worlds: Combining CNNs and Geometric Constraints for Hierarchical Motion Segmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[62]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.