An investigation on deep learning approaches for diatoms classification

Diatoms are one of the largest groups of microalgae present in marine, freshwater and transitional environments and their reactivity to environmental changes makes them suitable to be employed as biomarkers for monitoring tasks. Anyway, their presence in a large number of species makes it arduous to perform diatoms taxonomy during monitoring tasks considering that, to date, analysis is conducted by marine biologists on the basis of their own experience and, hence, in a subjective way. Hence, the need for automatic and objective methodologies for the identification and classification of diatoms samples rises. Research efforts in the field of Computer Vision led to a plethora of highly effective deep learning strategies surpassing human capabilities for image classification, as showed in the recent Imagenet challenge editions where they were initially introduced. Despite the very promising results of the proposed solutions, the difficulty arises to determine which technique is most suitable among them for real tasks and in particular for diatoms classification. This work proposes an end-to-end pipeline for automatic recognition of diatoms, acquired by means of holographic microscopy in water samples, employing deep learning techniques. In particular the most recently introduced Convolution Neural Networks (CNNs) architectures have been deeply investigated and compared in order to highlight the pros and cons of each of them. Moreover, in order to feed the CNNs training stages with a suitable amount of labeled data, a strategy to build a synthetic dataset, starting from a single image per class available from commercial glass slides specifically prepared for taxonomy purposes, is introduced. Besides, models ensembling strategies, in order to improve the single model scores, have been exploited. Finally, the proposed approach has been validated employing a dataset built up of holographic images of diatoms sampled in natural water bodies.

[1]  Saso Dzeroski,et al.  Hierarchical classification of diatom images using ensembles of predictive clustering trees , 2012, Ecol. Informatics.

[2]  Patrick Mäder,et al.  Combining high-throughput imaging flow cytometry and deep learning for efficient species and life-cycle stage identification of phytoplankton , 2018, BMC ecology.

[3]  Michael Unser,et al.  Deep Convolutional Neural Network for Inverse Problems in Imaging , 2016, IEEE Transactions on Image Processing.

[4]  A. Ozcan,et al.  Deep learning enables cross-modality super-resolution in fluorescence microscopy , 2018, Nature Methods.

[5]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[6]  Jürgen Schmidhuber,et al.  Highway Networks , 2015, ArXiv.

[7]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[8]  Gabriel Cristóbal,et al.  Automated Diatom Classification (Part A): Handcrafted Feature Approaches , 2017 .

[9]  Hai Su,et al.  Deep Learning in Microscopy Image Analysis: A Survey , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[10]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Vittorio Bianco,et al.  Learning Diatoms Classification from a Dry Test Slide by Holographic Microscopy , 2020, Sensors.

[12]  Kaiming He,et al.  Designing Network Design Spaces , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Quoc V. Le,et al.  EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[14]  Swapna A. Jaywant,et al.  A Comprehensive Review of Microfluidic Water Quality Monitoring Sensors , 2019, Sensors.

[15]  Janice L. Pappas,et al.  Legendre shape descriptors and shape group determination of specimens in the Cymbella cistula species complex , 2003 .

[16]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[17]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Hyun-seok Min,et al.  Quantitative Phase Imaging and Artificial Intelligence: A Review , 2018, IEEE Journal of Selected Topics in Quantum Electronics.

[19]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[20]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[21]  Horst Bunke,et al.  Diatom identification: a double challenge called ADIAC , 1999, Proceedings 10th International Conference on Image Analysis and Processing.

[22]  Gabriel Cristóbal,et al.  Automated Diatom Classification (Part B): A Deep Learning Approach , 2017 .

[23]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[24]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[25]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[27]  William Graf,et al.  Deep learning for cellular image analysis , 2019, Nature Methods.

[28]  Rita Strack,et al.  Deep learning in imaging , 2018, Nature Methods.