Gear-Induced Concept Drift in Marine Images and Its Effect on Deep Learning Classification

In marine research, image data sets from the same area but collected at different times allow seafloor fauna communities to be monitored over time. However, ongoing technological developments have led to the use of different imaging systems and deployment strategies. Thus, instances of the same class exhibit slightly shifted visual features in images taken at slightly different locations or with different gear. These shifts are referred to as concept drift in the domains computational image analysis and machine learning as this phenomenon poses particular challenges for these fields. In this paper, we analyse four different data sets from an area in the Peru Basin and show how changes in imaging parameters affect the classification of 12 megafauna morphotypes with a 34-layer ResNet. Images were captured using the ocean floor observation system, a traditional sled-based system, or an autonomous underwater vehicle, which is used as an imaging platform capable of surveying larger regions. ResNet applied on separate individual data sets, i.e., without concept drift, showed that changing object distance was less important than the amount of training data. The results for the image data acquired with the ocean floor observation system showed higher performance values than data collected with the autonomous underwater vehicle. The results from this concept drift studies indicate that collecting image data from many dives with slightly different gear may result in training data well-suited for learning taxonomic classification tasks and that data volume can compensate for light concept drift.

[1]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[2]  Mykola Pechenizkiy,et al.  An Overview of Concept Drift Applications , 2016 .

[3]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[4]  Simone Palazzo,et al.  Quantitative performance analysis of object detection algorithms on underwater video footage , 2012, MAED '12.

[5]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[6]  Autun Purser,et al.  Seafloor images from the Peru Basin Disturbance and Colonization (DISCOL) area collected during SO242/2 , 2018 .

[7]  Phil F. Culverhouse,et al.  Automated identification of benthic epifauna with computer vision , 2019, Marine Ecology Progress Series.

[8]  Brett Hosking,et al.  On the impact of Citizen Science-derived data quality on deep learning based classification in marine images , 2019, PloS one.

[9]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[10]  Min Chen,et al.  Selection-based resampling ensemble algorithm for nonstationary imbalanced stream data learning , 2019, Knowl. Based Syst..

[11]  Autun Purser,et al.  Seabed photographs taken along OFOS profile SO242/2_233-1 during SONNE cruise SO242/2 , 2018 .

[12]  J. Gutt,et al.  Semi-Automated Image Analysis for the Assessment of Megafaunal Densities at the Arctic Deep-Sea Observatory HAUSGARTEN , 2012, PloS one.

[13]  J. C. Schlimmer,et al.  Incremental learning from noisy data , 2004, Machine Learning.

[14]  A. Purser,et al.  Megafauna community assessment of polymetallic-nodule fields with cameras: platform and methodology comparison , 2019, Biogeosciences.

[15]  Autun Purser,et al.  Seabed photographs taken along OFOS profile SO242/2_174-1 during SONNE cruise SO 242/2 , 2018 .

[16]  Autun Purser,et al.  Ocean Floor Observation and Bathymetry System (OFOBS): A New Towed Camera/Sonar System for Deep-Sea Habitat Surveys , 2019, IEEE Journal of Oceanic Engineering.

[17]  B. Bett,et al.  Autonomous Underwater Vehicles (AUVs): Their past, present and future contributions to the advancement of marine geoscience , 2014 .

[18]  David J. Kriegman,et al.  Automated annotation of coral reef survey images , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Yu Sun,et al.  Concept Drift Adaptation by Exploiting Historical Knowledge , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[20]  James G. Bellingham,et al.  AUTONOMOUS UNDERWATER VEHICLES (AUVs) , 2001 .

[21]  Roberto Danovaro,et al.  New High-Tech Flexible Networks for the Monitoring of Deep-Sea Ecosystems. , 2019, Environmental science & technology.

[22]  Roberto Souto Maior de Barros,et al.  A large-scale comparison of concept drift detectors , 2018, Inf. Sci..

[23]  Tim W. Nattkemper,et al.  Strategies for Tackling the Class Imbalance Problem in Marine Image Classification , 2018, CVAUI/IWCF/MIPPSNA@ICPR.

[24]  Carlos Eduardo Cancino Chacón,et al.  Strategies for Conceptual Change in Convolutional Neural Networks , 2017, ArXiv.

[25]  Liming Chen,et al.  Discriminative Transfer Learning Using Similarities and Dissimilarities , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[26]  Gerhard Widmer,et al.  Learning in the Presence of Concept Drift and Hidden Contexts , 1996, Machine Learning.

[27]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[28]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[29]  Pål Buhl-Mortensen,et al.  Computer vision enables short- and long-term analysis of Lophelia pertusa polyp behaviour and colour from an underwater observatory , 2019, Scientific Reports.

[30]  Olav Rune Godø,et al.  The LoVe Ocean Observatory is in Operation , 2014 .

[31]  Cyril Rakovski,et al.  Optimal Multi-Stage Arrhythmia Classification Approach , 2020, Scientific Reports.

[32]  Tim W Nattkemper,et al.  MAIA—A machine learning assisted image annotation method for environmental monitoring and exploration , 2018, PloS one.

[33]  Adam F. Gobi,et al.  Towards generalized benthic species recognition and quantification using computer vision , 2010, OCEANS'10 IEEE SYDNEY.

[34]  O. Pizarro,et al.  Automated species detection: An experimental approach to kelp detection from sea-oor AUV images , 2012 .

[35]  Dominique Pelletier,et al.  Underwater video techniques for observing coastal marine biodiversity: A review of sixty years of publications (1952–2012) , 2014 .

[36]  Hui Yang,et al.  Dynamical Casimir–Polder force on a two-level atom with superposition state in a cavity comprising a dielectric , 2020, Scientific Reports.

[37]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[38]  João Gama,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[39]  Tim W. Nattkemper,et al.  BIIGLE 2.0 - Browsing and Annotating Large Marine Image Collections , 2017, Front. Mar. Sci..

[40]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[41]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[42]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[43]  Autun Purser,et al.  Seabed photographs taken along OFOS profile SO242/2_155-1 during SONNE cruise SO242/2 , 2018 .

[44]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[45]  Kevin Köser,et al.  An acquisition, curation and management workflow for sustainable, terabyte-scale marine image analysis , 2018, Scientific Data.

[46]  Robert B. Fisher,et al.  Deep Image Representations for Coral Image Classification , 2019, IEEE Journal of Oceanic Engineering.

[47]  Brendan J. Godley,et al.  Camera technology for monitoring marine biodiversity and human impact , 2016 .

[48]  Matthew Dunbabin,et al.  Automated Counting of the Northern Pacific Sea Star in the Derwent Using Shape Recognition , 2007, 9th Biennial Conference of the Australian Pattern Recognition Society on Digital Image Computing Techniques and Applications (DICTA 2007).