Improving Underwater Obstacle Detection using Semantic Image Segmentation

This paper presents two novel approaches for improving image-based underwater obstacle detection by combining sparse stereo point clouds with monocular semantic image segmentation. Generating accurate image-based obstacle maps in cluttered underwater environments, such as coral reefs, are essential for robust robotic path planning and navigation. However, these maps can be challenged by factors including visibility, lighting and dynamic objects (e.g. fish) that may lead to falsely identified free space or dynamic objects which trajectory planners may react to undesirably. We propose combining feature-based stereo matching with learning-based segmentation to produce a more robust obstacle map. This approach considers direct binary learning of the presence or absence of underwater obstacles, as well as a multiclass learning approach to classify their distance (near, mid and far) in the scene. An enhancement to the binary map is also shown by including depth information from sparse stereo matching to produce 3D obstacle maps of the scene. The performance is evaluated using field data collected in cluttered, and at times, visually degraded coral reef environments. The results show improved image-wide obstacle detection, rejection of transient objects (such as fish), and range estimation compared to feature-based sparse and dense stereo point clouds alone.

[1]  R. Sibson,et al.  A brief description of natural neighbor interpolation , 1981 .

[2]  Avinash C. Kak,et al.  Vision for Mobile Robot Navigation: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[4]  Silvia Silva da Costa Botelho,et al.  Transmission Estimation in Underwater Single Images , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[5]  Mario Fernando Montenegro Campos,et al.  Underwater stereo , 2004, Proceedings. 17th Brazilian Symposium on Computer Graphics and Image Processing.

[6]  Andreas Geiger,et al.  Simultaneous underwater visibility assessment, enhancement and improved stereo , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Stefan B. Williams,et al.  Autonomous underwater vehicle–assisted surveying of drowned reefs on the shelf edge of the Great Barrier Reef, Australia , 2010, J. Field Robotics.

[8]  Heiko Hirschmüller,et al.  Stereo Processing by Semiglobal Matching and Mutual Information , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Eduardo Romera,et al.  ERFNet: Efficient Residual Factorized ConvNet for Real-Time Semantic Segmentation , 2018, IEEE Transactions on Intelligent Transportation Systems.

[10]  Luz Abril Torres-Méndez,et al.  Ethologically inspired reactive exploration of coral reefs with collision avoidance: Bridging the gap between human and robot spatial understanding of unstructured environments , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11]  Roland Siegwart,et al.  Dynamic Objects Segmentation for Visual Localization in Urban Environments , 2018, ArXiv.

[12]  Eduard Vidal,et al.  Autonomous Underwater Navigation and Optical Mapping in Unknown Natural Environments , 2016, Sensors.

[13]  Michael Milford,et al.  Multimodal Trip Hazard Affordance Detection on Construction Sites , 2017, IEEE Robotics and Automation Letters.

[14]  Salah Sukkarieh,et al.  Orchard fruit segmentation using multi-spectral feature learning , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  Niko Sünderhauf,et al.  On the performance of ConvNet features for place recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[16]  Patrick Rives,et al.  Semantic representation for navigation in large-scale environments , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[17]  Michael Milford,et al.  Convolutional Neural Network-based Place Recognition , 2014, ICRA 2014.

[18]  Stefan B. Williams,et al.  Monitoring of Benthic Reference Sites: Using an Autonomous Underwater Vehicle , 2012, IEEE Robotics & Automation Magazine.

[19]  Gordon Wyeth,et al.  Place categorization and semantic mapping on a mobile robot , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[20]  Peter I. Corke,et al.  Low-cost vision-based AUV guidance system for reef navigation , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[21]  Cyrill Stachniss,et al.  Joint Stem Detection and Crop-Weed Classification for Plant-Specific Treatment in Precision Farming , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[22]  Kaiming He,et al.  Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[23]  Md Jahidul Islam,et al.  Enhancing Underwater Imagery Using Generative Adversarial Networks , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[24]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[25]  Andreas Geiger,et al.  Efficient Large-Scale Stereo Matching , 2010, ACCV.

[26]  Mario Fernando Montenegro Campos,et al.  Real-time monocular obstacle avoidance using Underwater Dark Channel Prior , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[27]  José García Rodríguez,et al.  A Review on Deep Learning Techniques Applied to Semantic Segmentation , 2017, ArXiv.

[28]  Jinwhan Kim,et al.  Efficient image mosaicing for multi-robot visual underwater mapping , 2014, Pattern Recognit. Lett..

[29]  Bir Bikram Dey,et al.  Vision-based reactive autonomous navigation with obstacle avoidance: Towards a non-invasive and cautious exploration of marine habitat , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[30]  Julius Ziegler,et al.  StereoScan: Dense 3d reconstruction in real-time , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[31]  Cyrill Stachniss,et al.  Bonnet: An Open-Source Training and Deployment Framework for Semantic Segmentation in Robotics using CNNs , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[32]  Peter I. Corke,et al.  Robotic detection and tracking of Crown-of-Thorns starfish , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[33]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[34]  Sajad Saeedi,et al.  AUV Navigation and Localization: A Review , 2014, IEEE Journal of Oceanic Engineering.