On Robustness of Robotic and Autonomous Systems Perception

We propose an evaluation framework that emulates poor image exposure conditions, low-range image sensors, lossy compression, as well as noise types which are common in robot vision. We present a rigorous evaluation of the robustness of several high-level image recognition models and investigate their performance under distinct image distortions. On one hand, F1 score shows that the majority of CNN models are slightly affected by mild exposure, strong compression, and Poisson Noise. On the other hand, there is a large decrease in precision and accuracy in extreme misexposure, impulse noise, or signal-dependent noise. Using the proposed framework, we obtain a detailed evaluation of a variety of traditional image distortions, typically found in robotics and automated systems pipelines, provides insights and guidance for further development. We propose a pipeline-based approach to mitigate the adverse effects of image distortions by including an image pre-processing step which intends to estimate the proper exposure and reduce noise artifacts. Moreover, we explore the impacts of the image distortions on the segmentation task, a task that plays a primary role in autonomous navigation, obstacle avoidance, object picking and other robotics tasks.

[1]  Ruigang Yang,et al.  The ApolloScape Open Dataset for Autonomous Driving and Its Application , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Rodrigo Zelir Azzolin,et al.  Automated seam tracking system based on passive monocular vision for automated linear robotic welding process , 2017, INDIN.

[3]  Shingo Mabu,et al.  A Visual System of Citrus Picking Robot Using Convolutional Neural Networks , 2018, 2018 5th International Conference on Systems and Informatics (ICSAI).

[4]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Paulo Drews,et al.  A Low Cost System to Optimize Pesticide Application Based on Mobile Technologies and Computer Vision , 2018, 2018 Latin American Robotic Symposium, 2018 Brazilian Symposium on Robotics (SBR) and 2018 Workshop on Robotics in Education (WRE).

[6]  Mark Sandler,et al.  MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Andreas Geiger,et al.  MOTS: Multi-Object Tracking and Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Pascal Frossard,et al.  Analysis of classifiers’ robustness to adversarial perturbations , 2015, Machine Learning.

[9]  Benjamin Recht,et al.  Do ImageNet Classifiers Generalize to ImageNet? , 2019, ICML.

[10]  Aitor Saenz-Aguirre,et al.  Semantic Segmentation to Develop an Indoor Navigation System for an Autonomous Mobile Robot , 2020 .

[11]  Alexander G. Schwing,et al.  MaskRNN: Instance Level Video Object Segmentation , 2018, NIPS.

[12]  Javier Ruiz-del-Solar,et al.  RoboCup@Home: Analysis and results of evolving competitions for domestic and service robots , 2015, Artif. Intell..

[13]  Silvia Silva da Costa Botelho,et al.  CNN-Based Luminance And Color Correction For ILL-Exposed Images , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[14]  Yanhui Duan,et al.  Grasp Pose Detection with Affordance-based Task Constraint Learning in Single-view Point Clouds , 2020, J. Intell. Robotic Syst..

[15]  Paulo Drews,et al.  Vision-Based Obstacle Avoidance Using Deep Learning , 2016, 2016 XIII Latin American Robotics Symposium and IV Brazilian Robotics Symposium (LARS/SBR).

[16]  Jian Sun,et al.  ExFuse: Enhancing Feature Fusion for Semantic Segmentation , 2018, ECCV.

[17]  Richard Szeliski,et al.  Computer Vision - Algorithms and Applications , 2011, Texts in Computer Science.

[18]  N. Kasabov,et al.  Speckle Noise Removal in Image-based Detection of Refractive Index Changes in Porous Silicon Microarrays , 2019, Scientific Reports.

[19]  Sekou A.K. Diane,et al.  Multi-Aspect Environment Mapping with a Group of Mobile Robots , 2019, 2019 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus).

[20]  Chun-Hsu Ko,et al.  Development of a Comfort-Based Motion Guidance System for a Robot Walking Helper , 2020, J. Intell. Robotic Syst..

[21]  Lei Zhang,et al.  Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising , 2016, IEEE Transactions on Image Processing.

[22]  Yuyu Tian,et al.  Detection and segmentation of overlapped fruits based on optimized mask R-CNN application in apple harvesting robot , 2020, Comput. Electron. Agric..

[23]  Hossam E. Abd El Munim,et al.  LiteSeg: A Novel Lightweight ConvNet for Semantic Segmentation , 2019, 2019 Digital Image Computing: Techniques and Applications (DICTA).

[24]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Bin Wang,et al.  Siamese-ResNet: Implementing Loop Closure Detection based on Siamese Network , 2018, 2018 IEEE Intelligent Vehicles Symposium (IV).

[26]  Santwana Sagnika,et al.  A Comparative Study on Approaches to Speckle Noise Reduction in Images , 2015, 2015 International Conference on Computational Intelligence and Networks.

[27]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Wei Xie,et al.  Convolutional neural network based obstacle detection for unmanned surface vehicle. , 2019, Mathematical biosciences and engineering : MBE.

[29]  Qi Tian,et al.  The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking , 2018, ECCV.

[30]  George Papandreou,et al.  Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[31]  Nicolas Dobigeon,et al.  Bayesian Image Restoration under Poisson Noise and Log-concave Prior , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[32]  Chongruo Wu,et al.  ResNeSt: Split-Attention Networks , 2020, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[33]  Koichi Ito,et al.  Recent advances in biometrie security: A case study of liveness detection in face recognition , 2017, 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC).

[34]  Mario Fernando Montenegro Campos,et al.  Real-time monocular obstacle avoidance using Underwater Dark Channel Prior , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[35]  Alberto J. Rosales-Silva,et al.  High-Precision Visual-Tracking using the IMM Algorithm and Discrete GPI Observers (IMM-DGPIO) , 2020, Journal of Intelligent & Robotic Systems.

[36]  Keigo Hirakawa,et al.  Improved Denoising via Poisson Mixture Modeling of Image Sensor Noise , 2017, IEEE Transactions on Image Processing.

[37]  Silvia Silva da Costa Botelho,et al.  Seam tracking and welding bead geometry analysis for autonomous welding robot , 2017, 2017 Latin American Robotics Symposium (LARS) and 2017 Brazilian Symposium on Robotics (SBR).

[38]  Emmanuelle Gouillart,et al.  scikit-image: image processing in Python , 2014, PeerJ.

[39]  Cordelia Schmid,et al.  AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[40]  Lin Li,et al.  Altitude Information Acquisition of UAV Based on Monocular Vision and MEMS , 2020, J. Intell. Robotic Syst..

[41]  Michael S. Brown,et al.  Noise Flow: Noise Modeling With Conditional Normalizing Flows , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[42]  Puneet Kohli,et al.  Enabling Pedestrian Safety using Computer Vision Techniques: A Case Study of the 2018 Uber Inc. Self-driving Car Crash , 2018, Lecture Notes in Networks and Systems.

[43]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[44]  Andrew Zisserman,et al.  The AVA-Kinetics Localized Human Actions Video Dataset , 2020, ArXiv.

[45]  Jordi Pont-Tuset,et al.  The Open Images Dataset V4 , 2018, International Journal of Computer Vision.

[46]  Vijay Vasudevan,et al.  Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[47]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48]  Kaiming He,et al.  Mask R-CNN , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[49]  Lucas Ricardo Vieira Messias,et al.  Can Exposure, Noise and Compression Affect Image Recognition? An Assessment of the Impacts on State-of-the-Art ConvNets , 2019, 2019 Latin American Robotics Symposium (LARS), 2019 Brazilian Symposium on Robotics (SBR) and 2019 Workshop on Robotics in Education (WRE).

[50]  Jianxiong Xiao,et al.  DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[51]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[52]  Hyoungkwan Kim,et al.  Image retrieval using BIM and features from pretrained VGG network for indoor localization , 2018, Building and Environment.

[53]  Joan Bruna,et al.  Intriguing properties of neural networks , 2013, ICLR.

[54]  Michael Lindenbaum,et al.  Increasing CNN Robustness to Occlusions by Reducing Filter Support , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[55]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  Hui Zhou,et al.  Robust Multi-Modality Multi-Object Tracking , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[57]  Jahid Ali,et al.  A Comparative Study of Various Types of Image Noise and Efficient Noise Removal Techniques , 2013 .

[58]  Neil D. B. Bruce,et al.  On the Robustness of Deep Learning Models to Universal Adversarial Attack , 2018, 2018 15th Conference on Computer and Robot Vision (CRV).

[59]  Priyanka Kokil,et al.  Additive White Gaussian Noise Level Estimation for Natural Images Using Linear Scale-Space Features , 2020, Circuits Syst. Signal Process..

[60]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[61]  Jie Hu,et al.  A Novel Camera Fusion Method Based on Switching Scheme and Occlusion-Aware Object Detection for Real-Time Robotic Grasping , 2020, Journal of Intelligent & Robotic Systems.

[62]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[63]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[64]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[65]  Lasitha Piyathilaka,et al.  Human Activity Recognition for Domestic Robots , 2013, FSR.

[66]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[67]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[68]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[69]  Martin Molina,et al.  A Collaborative Approach for Surface Inspection Using Aerial Robots and Computer Vision , 2018, Sensors.