MODS - A USV-oriented object detection and obstacle segmentation benchmark

Small-sized unmanned surface vehicles (USV) are coastal water devices with a broad range of applications such as environmental control and surveillance. A crucial capability for autonomous operation is obstacle detection for timely reaction and collision avoidance, which has been recently explored in the context of camera-based visual scene interpretation. Owing to curated datasets, substantial advances in scene interpretation have been made in a related field of unmanned ground vehicles. However, the current maritime datasets do not adequately capture the complexity of real-world USV scenes and the evaluation protocols are not standardised, which makes cross-paper comparison of different methods difficult and hiders the progress. To address these issues, we introduce a new obstacle detection benchmark MODS, which considers two major perception tasks: maritime object detection and the more general maritime obstacle segmentation. We present a new diverse maritime evaluation dataset containing approximately 81k stereo images synchronized with an on-board IMU, with over 60k objects annotated. We propose a new obstacle segmentation performance evaluation protocol that reflects the detection accuracy in a way meaningful for practical USV navigation. Seventeen recent state-of-the-art object detection and obstacle segmentation methods are evaluated using the proposed protocol, creating a benchmark to facilitate development of the field.

[1]  Matej Kristan,et al.  Obstacle Tracking for Unmanned Surface Vessels Using 3-D Point Cloud , 2020, IEEE Journal of Oceanic Engineering.

[2]  Zhenyu He,et al.  The Seventh Visual Object Tracking VOT2019 Challenge Results , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[3]  Ricardo Ribeiro,et al.  A Data Set for Airborne Maritime Surveillance Environments , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Paul Newman,et al.  1 year, 1000 km: The Oxford RobotCar dataset , 2017, Int. J. Robotics Res..

[5]  Wei Zhang,et al.  A Review of Research on Light Visual Perception of Unmanned Surface Vehicles , 2020 .

[6]  Matej Kristan,et al.  Obstacle Detection for USVs by Joint Stereo-View Semantic Segmentation , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7]  Ian D. Reid,et al.  RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[9]  Fernando Seco Granja,et al.  A Short-Range Ship Navigation System Based on Ladar Imaging and Target Tracking for Improved Safety and Efficiency , 2009, IEEE Transactions on Intelligent Transportation Systems.

[10]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[11]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Matej Kristan,et al.  A water-obstacle separation and refinement network for unmanned surface vehicles , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[13]  Hao Chen,et al.  FCOS: Fully Convolutional One-Stage Object Detection , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[14]  Alessandro Farinelli,et al.  Waterline and obstacle detection in images from low-cost autonomous boats for environmental monitoring , 2020, Robotics Auton. Syst..

[15]  Donghwa Lee,et al.  Vision-Based Real-Time Obstacle Segmentation Algorithm for Autonomous Surface Vehicle , 2019, IEEE Access.

[16]  Xiaochun Cao,et al.  Omni-Directional Surveillance for Unmanned Water Vehicles , 2008 .

[17]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[18]  Thierry Bouwmans,et al.  Double-constrained RPCA based on saliency maps for foreground detection in automated maritime surveillance , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[19]  James M. Ferryman,et al.  Evaluating deep semantic segmentation networks for object detection in maritime surveillance , 2018, 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[20]  James M. Ferryman,et al.  PETS 2017: Dataset and Challenge , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[21]  Erhan Gundogdu,et al.  MARVEL: A Large-Scale Image Dataset for Maritime Vessels , 2016, ACCV.

[22]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[23]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Yi-Tung Chan Comprehensive comparative evaluation of background subtraction algorithms in open sea environments , 2021, Comput. Vis. Image Underst..

[25]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[26]  Gaurav S. Sukhatme,et al.  Obstacle detection and avoidance for an Autonomous Surface Vehicle using a profiling sonar , 2011, 2011 IEEE International Conference on Robotics and Automation.

[27]  Luca Iocchi,et al.  ARGOS-Venice Boat Classification , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[28]  Han Wang,et al.  Stereovision based obstacle detection system for unmanned surface vehicle , 2013, 2013 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[29]  Eugenio Culurciello,et al.  ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation , 2016, ArXiv.

[30]  Yongmei Ren,et al.  Surface Vehicle Detection and Tracking with Deep Learning and Appearance Feature , 2019, 2019 5th International Conference on Control, Automation and Robotics (ICCAR).

[31]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Abhishek Dutta,et al.  The VGG Image Annotator (VIA) , 2019, ArXiv.

[33]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Geoff A. W. West,et al.  Visual Maritime Attention Using Multiple Low-Level Features and Naïve Bayes Classification , 2011, 2011 International Conference on Digital Image Computing: Techniques and Applications.

[35]  Matej Kristan,et al.  Fast Image-Based Obstacle Detection From Unmanned Surface Vehicles , 2015, IEEE Transactions on Cybernetics.

[36]  Lawrence O. Hall,et al.  Horizon Detection Using Machine Learning Techniques , 2006, 2006 5th International Conference on Machine Learning and Applications (ICMLA'06).

[37]  Deepu Rajan,et al.  Object Detection in a Maritime Environment: Performance Evaluation of Background Subtraction Methods , 2019, IEEE Transactions on Intelligent Transportation Systems.

[38]  Paul Newman,et al.  The Oxford Radar RobotCar Dataset: A Radar Extension to the Oxford RobotCar Dataset , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[39]  Chiemela Onunka,et al.  Autonomous marine craft navigation: On the study of radar obstacle detection , 2010, 2010 11th International Conference on Control Automation Robotics & Vision.

[40]  James M. Ferryman,et al.  Saliency-Based Detection for Maritime Object Tracking , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[41]  Ricardo Ribeiro,et al.  Unmanned aircraft systems in maritime operations: Challenges addressed in the scope of the SEAGULL project , 2015, OCEANS 2015 - Genova.

[42]  Michael Teutsch,et al.  A Benchmark for Deep Learning Based Object Detection in Maritime Environments , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[43]  Parul Parashar,et al.  Neural Networks in Machine Learning , 2014 .

[44]  Gang Yu,et al.  BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation , 2018, ECCV.

[45]  Matej Kristan,et al.  The MaSTr1325 dataset for training deep USV obstacle detection models , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[46]  Johan Lilius,et al.  Comparing CNN-Based Object Detectors on Two Novel Maritime Datasets , 2020, 2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[47]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Abhishek Dutta,et al.  The VIA Annotation Software for Images, Audio and Video , 2019, ACM Multimedia.

[50]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[51]  Matej Kristan,et al.  Stereo obstacle detection for unmanned surface vehicles by IMU-assisted semantic segmentation , 2018, Robotics Auton. Syst..

[52]  Ross B. Girshick,et al.  LVIS: A Dataset for Large Vocabulary Instance Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[53]  Deepu Rajan,et al.  Video Processing From Electro-Optical Sensors for Object Detection and Tracking in a Maritime Environment: A Survey , 2016, IEEE Transactions on Intelligent Transportation Systems.

[54]  Luc Van Gool,et al.  Deep Extreme Cut: From Extreme Points to Object Segmentation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[55]  Daniel Cremers,et al.  MOT20: A benchmark for multi object tracking in crowded scenes , 2020, ArXiv.

[56]  Wei Xie,et al.  Convolutional neural network based obstacle detection for unmanned surface vehicle. , 2019, Mathematical biosciences and engineering : MBE.

[57]  Xiaogang Wang,et al.  Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[58]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[59]  Nikolaos D. Doulamis,et al.  Vision-based maritime surveillance system using fused visual attention maps and online adaptable tracker , 2013, 2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS).

[60]  Hyewon Lee,et al.  Image-Based Ship Detection and Classification for Unmanned Surface Vehicle Using Real-Time Object Detection Neural Networks , 2018 .

[61]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[62]  Alexandre M. Amory,et al.  A Survey on Unmanned Surface Vehicles for Disaster Robotics: Main Challenges and Directions , 2019, Sensors.

[63]  Yoshua Bengio,et al.  The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).