Automated classification of fauna in seabed photographs: The impact of training and validation dataset size, with considerations for the class imbalance

Abstract Machine learning is rapidly developing as a tool for gathering data from imagery and may be useful in identifying (classifying) visible specimens in large numbers of seabed photographs. Application of an automated classification workflow requires manually identified specimens to be supplied for training and validating the model. These training and validation datasets are generally generated by partitioning the available manual identified specimens; typical ratios of training to validation dataset sizes are 75:25 or 80:20. However, this approach does not facilitate the desired scalability, which would require models to successfully classify specimens in hundreds of thousands to millions of images after training on a relatively small subset of manually identified specimens. A second problem is related to the ‘class imbalance’, where natural community structure means that fewer specimens of rare morphotypes are available for model training. We investigated the impact of independent variation of the training and validation dataset sizes on the performance of a convolutional neural network classifier on benthic invertebrates visible in a very large set of seabed photographs captured by an autonomous underwater vehicle at the Porcupine Abyssal Plain Sustained Observatory. We tested the impact of increasing training dataset size on specimen classification in a single validation dataset, and then tested the impact of increasing validation set size, evaluating ecological metrics in addition to computer vision metrics. Computer vision metrics (recall, precision, F1-score) indicated that classification improved with increasing training dataset size. In terms of ecological metrics, the number of morphotypes recorded increased, while diversity decreased with increasing training dataset size. Variation and bias in diversity metrics decreased with increasing training dataset size. Multivariate dispersion in apparent community composition was reduced, and bias from expert-derived data declined with increasing training dataset size. In contrast, classification success and resulting ecological metrics did not differ significantly with varying validation dataset sizes. Thus, the selection of an appropriate training dataset size is key to ensuring robust automated classifications of benthic invertebrates in seabed photographs, in terms of ecological results, and validation may be conducted on a comparatively small dataset with confidence that similar results will be obtained in a larger production dataset. In addition, our results suggest that automated classification of less common morphotypes may be feasible, providing that the overall training dataset size is sufficiently large. Thus, tactics for reducing class imbalance in the training dataset may produce improvements in the resulting ecological metrics.

[1]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[2]  Qiao Hu,et al.  Automatic plankton image recognition with co-occurrence matrices and Support Vector Machine , 2005 .

[3]  Phil Culverhouse,et al.  Time to automate identification , 2010, Nature.

[4]  Roberto Danovaro,et al.  New High-Tech Flexible Networks for the Monitoring of Deep-Sea Ecosystems. , 2019, Environmental science & technology.

[5]  Jennifer M. Durden,et al.  First network analysis of interspecific associations of abyssal benthic megafauna reveals potential vulnerability of abyssal hill community , 2020, Progress in Oceanography.

[6]  A. Magurran,et al.  Measuring Biological Diversity , 2004 .

[7]  Alexandra Branzan Albu,et al.  Expert, Crowd, Students or Algorithm: who holds the key to deep‐sea imagery ‘big data’ processing? , 2017 .

[8]  Jennifer M. Durden,et al.  Response of deep-sea deposit-feeders to detrital inputs: A comparison of two abyssal time-series sites , 2020, Deep Sea Research Part II: Topical Studies in Oceanography.

[9]  Saul Greenberg,et al.  Three critical factors affecting automated image species recognition performance for camera traps , 2020, Ecology and evolution.

[10]  H. Ruhl,et al.  Differences in the carbon flows in the benthic food webs of abyssal hill and plain habitats , 2017 .

[11]  Jennifer M. Durden,et al.  Comparison of image annotation data generated by multiple investigators for benthic ecology , 2016 .

[12]  Jennifer M. Durden,et al.  A new method for ecological surveying of the abyss using autonomous underwater vehicle photography , 2014 .

[13]  Tim W. Nattkemper,et al.  COATL - a learning architecture for online real-time detection and classification assistance for environmental data , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[14]  Jennifer M. Durden,et al.  Monitoring mosaic biotopes in a marine conservation zone by autonomous underwater vehicle , 2019, Conservation biology : the journal of the Society for Conservation Biology.

[15]  Roberto Danovaro,et al.  Ecological variables for developing a global deep-ocean monitoring and conservation strategy , 2020, Nature Ecology & Evolution.

[16]  Tim W. Nattkemper,et al.  Gear-Induced Concept Drift in Marine Images and Its Effect on Deep Learning Classification , 2020, Frontiers in Marine Science.

[17]  Marti J. Anderson,et al.  Distance‐Based Tests for Homogeneity of Multivariate Dispersions , 2006, Biometrics.

[18]  Tim W. Nattkemper,et al.  Strategies for Tackling the Class Imbalance Problem in Marine Image Classification , 2018, CVAUI/IWCF/MIPPSNA@ICPR.

[19]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[20]  Phil F. Culverhouse,et al.  Fish identification from videos captured in uncontrolled underwater environments , 2016 .

[21]  Michael F. Vardaro,et al.  Global Observing Needs in the Deep Ocean , 2019, Front. Mar. Sci..

[22]  Simone Marini,et al.  Tracking Fish Abundance by Underwater Image Recognition , 2018, Scientific Reports.

[23]  K. R. Clarke,et al.  Non‐parametric multivariate analyses of changes in community structure , 1993 .

[24]  Changshui Zhang,et al.  DeepFish: Accurate underwater live fish recognition with a deep architecture , 2016, Neurocomputing.

[25]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Brian J. Bett,et al.  Effectiveness of a deep-sea cold-water coral Marine Protected Area, following eight years of fisheries closure , 2016 .

[27]  Phil F. Culverhouse,et al.  Automated identification of benthic epifauna with computer vision , 2019, Marine Ecology Progress Series.

[28]  Rebecca L. Selden,et al.  Climate-Driven Shifts in Marine Species Ranges: Scaling from Organisms to Communities. , 2020, Annual review of marine science.

[29]  Jennifer M. Durden,et al.  Subtle variation in abyssal terrain induces significant change in benthic megafaunal abundance, diversity, and community structure , 2020 .

[30]  Ajmal Mian,et al.  Fish species classification in unconstrained underwater environments based on deep learning , 2016 .

[31]  Russell B. Wynn,et al.  Assessing the Repeatability of Automated Seafloor Classification Algorithms, with Application in Marine Protected Area Monitoring , 2020, Remote. Sens..

[32]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[33]  Elizabeth L. Sander,et al.  Rarefaction and extrapolation with Hill numbers: a framework for sampling and estimation in species diversity studies , 2014 .

[34]  Jörg Ontrup,et al.  Use of machine-learning algorithms for the automated detection of cold-water coral habitats: a pilot study , 2009 .

[35]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[36]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[37]  B. Bett,et al.  Biological effects 26 years after simulated deep-sea mining , 2019, Scientific Reports.

[38]  Jennifer M. Durden,et al.  Abyssal hills - hidden source of increased habitat heterogeneity, benthic megafaunal biomass and diversity in the deep sea , 2015 .

[39]  Jennifer M. Durden,et al.  Megafaunal variation in the abyssal landscape of the Clarion Clipperton Zone , 2019, Progress in oceanography.

[40]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Jonas Osterloff,et al.  A computer vision approach for monitoring the spatial and temporal shrimp distribution at the LoVe observatory , 2016 .

[42]  Anne Jordt,et al.  Perspectives in visual imaging for marine biology and ecology: from acquisition to understanding , 2016 .

[43]  Jennifer M. Durden,et al.  Enduring science: Three decades of observing the Northeast Atlantic from the Porcupine Abyssal Plain Sustained Observatory (PAP-SO) , 2021, Progress in Oceanography.

[44]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[45]  David S.M. Billett,et al.  Long-term change in the megabenthos of the Porcupine Abyssal Plain (NE Atlantic) , 2001 .

[46]  Chris Roelfsema,et al.  Towards Automated Annotation of Benthic Survey Images: Variability of Human Experts and Operational Modes of Automation , 2015, PloS one.

[47]  Paolo Menesatti,et al.  Challenges to the assessment of benthic populations and biodiversity as a result of rhythmic behaviour: Video solutions from cabled observatories , 2012 .

[48]  J. Gutt,et al.  Semi-Automated Image Analysis for the Assessment of Megafaunal Densities at the Arctic Deep-Sea Observatory HAUSGARTEN , 2012, PloS one.

[49]  Brian J. Bett,et al.  Temporal variability in phytodetritus and megabenthic activity at the seabed in the deep northeast Atlantic , 2001 .

[50]  Megafauna , 2021, Encyclopedic Dictionary of Archaeology.

[51]  Elizabeth Cook,et al.  Changing coasts: marine aliens and artificial structures , 2012 .

[52]  David S.M. Billett,et al.  Long-term change in the abyssal NE Atlantic: The ‘Amperima Event’ revisited , 2010 .

[53]  Jaime S. Davies,et al.  Marine Litter Distribution and Density in European Seas, from the Shelves to Deep Basins , 2014, PloS one.