An instance segmentation framework for in-situ plankton taxa assessment

In this paper, we propose a deep learning instance segmentation framework for particle extraction of microscopic images that aims at calculating planktonic species distribution and concentration in-situ. The framework comprises three essential functional tasks on in-situ time-series images collected from an autonomous underwater vehicle: 1) manual labeling of the captured images, 2) object localization, segmentation, and identification, and 3) class distribution and planktonic organisms concentration calculation. Our proposed framework is based on the mask R-CNN architecture provided by the Detectron2 library developed by Facebook Artificial Intelligence Research (FAIR) for instance segmentation. Due to its modular design, we compare the performance of different networks by alternating the backbone sub-network in order to choose the most suitable architecture for the task of instance and semantic segmentation. We compile a custom annotated dataset from planktonic time-series images and train the different models over this dataset to perform the instance semantic segmentation. Evaluation results of the proposed framework, utilizing the best performing deep learning architecture along with the new annotated dataset, show better performance in terms of speed and accuracy of both in-situ segmentation and classification compared to traditional segmentation methods. In addition, we observe a significant improvement in the object classification quality when we train the model over our newly annotated dataset instead of training it over the dataset generated from the traditional methods. The inferred data from our novel instance segmentation framework, which provides the particle class distribution and concentration, can then be used to assist in constructing a dynamic probability density map of planktonic communities dispersion and abundance.

[1]  D. Zhang,et al.  Robust mean-shift tracking with corrected background-weighted histogram , 2012 .

[2]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Conny Aerts,et al.  Correcting for background changes in CoRoT exoplanet data , 2008 .

[4]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[5]  Zezhi Chen,et al.  Self-adaptive Gaussian mixture model for urban traffic monitoring system , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[6]  Annette Stahl,et al.  Automatic in-situ instance and semantic segmentation of planktonic organisms using Mask R-CNN , 2020, Global Oceans 2020: Singapore – U.S. Gulf Coast.

[7]  Qingrong Zhang,et al.  An Image Segmentation Algorithm in Image Processing Based on Threshold Segmentation , 2007, 2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System.

[8]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[9]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[11]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Robert J. Olson,et al.  Automated taxonomic classification of phytoplankton sampled with imaging‐in‐flow cytometry , 2007 .

[14]  Arne Johannes Holmin,et al.  StoX: An open source software for marine survey analyses , 2019, Methods in Ecology and Evolution.

[15]  Alan M. McIvor,et al.  Background Subtraction Techniques , 2000 .

[16]  Ying-Tung Hsiao,et al.  A contour based image segmentation algorithm using morphological edge detection , 2005, 2005 IEEE International Conference on Systems, Man and Cybernetics.

[17]  Astha Baxi,et al.  A Review on Otsu Image Segmentation Algorithm , 2013 .

[18]  K. Rajan,et al.  Advancing Ocean Observation with an AI-Driven Mobile Robotic Explorer , 2020 .

[19]  Pierre F. J. Lermusiaux,et al.  Uncertainty estimation and prediction for interdisciplinary ocean dynamics , 2006, J. Comput. Phys..

[20]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[21]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Emmanuelle Gouillart,et al.  scikit-image: image processing in Python , 2014, PeerJ.

[23]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[24]  Heidi M. Sosik,et al.  2014 labeled IFCB images , 2010 .

[25]  Zhenhua Guo,et al.  A Semi-Automated Image Analysis Procedure for In Situ Plankton Imaging Systems , 2015, PloS one.

[26]  P. C. Reida,et al.  The Continuous Plankton Recorder : concepts and history , from Plankton Indicator to undulating recorders , 2003 .

[27]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[29]  E J Davies,et al.  The use of wide-band transmittance imaging to size and classify suspended particulate matter in seawater. , 2017, Marine pollution bulletin.

[30]  Josef Kittler,et al.  Automatic watershed segmentation of randomly textured color images , 1997, IEEE Trans. Image Process..