Improving plankton image classification using context metadata

Advances in both hardware and software are enabling rapid proliferation of in situ plankton imaging methods, requiring more effective machine learning approaches to image classification. Deep Learning methods, such as convolutional neural networks (CNNs), show marked improvement over traditional feature-based supervised machine learning algorithms, but require careful optimization of hyperparameters and adequate training sets. Here, we document some best practices in applying CNNs to zooplankton andmarine snow images and note where our results differ from contemporary Deep Learning findings in other domains. We boost the performance of CNN classifiers by incorporating metadata of different types and illustrate how to assimilate metadata beyond simple concatenation. We utilize both geotemporal (e.g., sample depth, location, time of day) and hydrographic (e.g., temperature, salinity, chlorophyll a) metadata and show that either type by itself, or both combined, can substantially reduce error rates. Incorporation of contextmetadata also boosts performance of the feature-based classifiers we evaluated: RandomForest, Extremely Randomized Trees, Gradient Boosted Classifier, Support Vector Machines, and Multilayer Perceptron. For our assessments, we use an original data set of 350,000 in situ images (roughly 50%marine snow and 50% nonsnow sorted into 26 categories) from a novel in situ Zooglider.We document asymptotically increasing performance with more computationally intensive techniques, such as substantially deeper networks and artificially augmented data sets. Our best model achieves 92.3% accuracy with our 27-class data set. We provide guidance for further refinements that are likely to provide additional gains in classifier accuracy. The burgeoning number of digital imaging methods available to aquatic ecologists, both in situ (Davis et al. 1992; Samson et al. 2001; Benfield et al. 2003; Watson 2004; Olson and Sosik 2007; Cowen and Guigland 2008; Picheral et al. 2010; Schulz et al. 2010; Thompson et al. 2012; Briseño-Avena et al. 2015; Ohman et al. 2018) and in the laboratory (Sieracki et al. 1998; Gorsky et al. 2010), is generating rapidly expanding libraries of digital images useful in a variety of scientific applications. However, the accumulation of large numbers of images increases the need for much more efficient machine learning methods in order to automate the processes of image classification, data extraction, and analysis. Until recently, most automated image classification has employed methods we refer to as “feature-based,” in that they operate on a set of descriptive geometric features calculated from the digital images, such as area, shape, aspect ratio, fractal dimension, textures, and grayscale histograms (e.g. Peura and Iivarinen 1997). The feature-based algorithms then derive a mapping from the calculated values to labels corresponding to the type of organism. Ideally, thismapping will extrapolate to future images. Some of the feature-based algorithms that have been applied to classification of plankton images with varying degrees of success include Random Forest (Grosjean et al. 2004; Gorsky et al. 2010), support vector machines (SVMs) (Hu and Davis 2005; Sosik and Olson 2007; Ellen et al. 2015), andmultilayer perceptron (MLP) (Wilkins et al. 1996), among others. Since 2012, “Deep Learning” algorithms (Krizhevsky et al. 2012; LeCun and Ranzato 2013; LeCun et al. 2015) have outperformed feature-based classifiers in a variety of fields, including natural language processing (Socher et al. 2013), time series analysis (Graves et al. 2013), variational autoencoders (algorithms that learn to generate or alter existing data, such as image correction; Kingma and Welling 2013), plankton image analysis (Orenstein et al. 2015; Dai et al. 2016; Dieleman et al. 2016b; Graff and Ellen 2016; Wang et al. 2016; Zheng et al. 2017; Orenstein and Beijbom 2017; Luo et al. 2018), et al. Multiple algorithms have been characterized as examples of Deep Learning, the commonality being the use of repetitive layers of algorithmic structure that operate on the prior layers rather than the *Correspondence: jellen@ucsd.edu Present address: Department of Computer Science, University of California, Irvine, Irvine, California This is an open access article under the terms of the Creative Commons Attribution-NonCommercial License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.

[1]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Antonio Criminisi,et al.  Decision Forests: A Unified Framework for Classification, Regression, Density Estimation, Manifold Learning and Semi-Supervised Learning , 2012, Found. Trends Comput. Graph. Vis..

[3]  Guangrong Ji,et al.  ZooplanktoNet: Deep convolutional network for zooplankton classification , 2016, OCEANS 2016 - Shanghai.

[4]  Jessica Y. Luo,et al.  A Tale of Two Crowds: Public Engagement in Plankton Classification , 2017, Front. Mar. Sci..

[5]  Scott M. Gallager,et al.  Semi‐automated image analysis for the identification of bivalve larvae from a Cape Cod estuary , 2012 .

[6]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[7]  Charles Elkan,et al.  The Foundations of Cost-Sensitive Learning , 2001, IJCAI.

[8]  Matthew J. Hausknecht,et al.  Beyond short snippets: Deep networks for video classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[10]  Scott Samson,et al.  A system for high-resolution zooplankton imaging , 2001 .

[11]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  D. Hubel,et al.  Shape and arrangement of columns in cat's striate cortex , 1963, The Journal of physiology.

[13]  John Salvatier,et al.  Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.

[14]  R. Cowen,et al.  In situ ichthyoplankton imaging system (ISIIS): system design and preliminary results , 2008 .

[15]  D. Hubel Single unit activity in striate cortex of unrestrained cats , 1959, The Journal of physiology.

[16]  Amin Sarafraz,et al.  Automated plankton image analysis using convolutional neural networks , 2018, Limnology and Oceanography: Methods.

[17]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  J. Wallace,et al.  A Pacific Interdecadal Climate Oscillation with Impacts on Salmon Production , 1997 .

[19]  Nan Wang,et al.  Automatic plankton image classification combining multiple view features via multiple kernel learning , 2017, BMC Bioinformatics.

[20]  Masakazu Matsugu,et al.  Subject independent facial expression recognition with robust face detection using a convolutional neural network , 2003, Neural Networks.

[21]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[22]  R. Feely,et al.  Autonomous Ocean Measurements in the California Current Ecosystem , 2013 .

[23]  Geoffrey E. Hinton,et al.  Dynamic Routing Between Capsules , 2017, NIPS.

[24]  R. Davis,et al.  Zooglider: An autonomous vehicle for optical and acoustic sensing of zooplankton , 2018, Limnology and Oceanography: Methods.

[25]  J. R. Pomerantz,et al.  A century of Gestalt psychology in visual perception: II. Conceptual and theoretical foundations. , 2012, Psychological bulletin.

[26]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[27]  O. Zielinski,et al.  Imaging of plankton specimens with the lightframe on-sight keyspecies investigation (LOKI) system , 2010 .

[28]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[29]  M. Wertheimer Untersuchungen zur Lehre von der Gestalt. II , 1923 .

[30]  Charles S. Yentsch,et al.  An imaging-in-flow system for automated analysis of marine microplankton , 1998 .

[31]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[32]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[33]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[34]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[35]  Thomas Pock,et al.  Convolutional Networks for Shape from Light Field , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[37]  Gerhard Widmer,et al.  Learning in the presence of concept drift and hidden contexts , 2004, Machine Learning.

[38]  Philippe Grosjean,et al.  Enumeration, measurement, and identification of net zooplankton samples using the ZOOSCAN digital imaging system , 2004 .

[39]  Sander Dieleman,et al.  Rotation-invariant convolutional neural networks for galaxy morphology prediction , 2015, ArXiv.

[40]  Deng Cai,et al.  Deep Rotation Equivariant Network , 2017, Neurocomputing.

[41]  Eduard H. Hovy,et al.  Recursive Deep Models for Discourse Parsing , 2014, EMNLP.

[42]  R. Lippmann,et al.  An introduction to computing with neural nets , 1987, IEEE ASSP Magazine.

[43]  Anne-Laure Boulesteix,et al.  Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics , 2012, WIREs Data Mining Knowl. Discov..

[44]  Richard P. Lippmann,et al.  An introduction to computing with neural nets , 1987 .

[45]  G. Gorsky,et al.  The Underwater Vision Profiler 5: An advanced instrument for high spatial resolution studies of particle size spectra and zooplankton , 2010 .

[46]  A Graff Casey,et al.  Correlating Filter Diversity with Convolutional Neural Network Accuracy , 2016 .

[47]  M. Trevorrow,et al.  Measurement of Zooplankton Distributions with a High-Resolution Digital Camera System , 2003 .

[48]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[49]  Guangrong Ji,et al.  Multi features combination for automated zooplankton classification , 2016, OCEANS 2016 - Shanghai.

[50]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[51]  R. Olson,et al.  A submersible imaging‐in‐flow instrument to analyze nano‐and microplankton: Imaging FlowCytobot , 2007 .

[52]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[53]  Oscar Beijbom,et al.  Transfer Learning and Deep Feature Extraction for Planktonic Image Data Sets , 2017, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[54]  Marc Picheral,et al.  Digital zooplankton image analysis using the ZooScan integrated system , 2010 .

[55]  Fei-Fei Li,et al.  Improving Image Classification with Location Context , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[56]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[57]  Leslie N. Smith,et al.  A disciplined approach to neural network hyper-parameters: Part 1 - learning rate, batch size, momentum, and weight decay , 2018, ArXiv.

[58]  C. Davis The Video Plankton Recorder (VPR) : Design and initial results , 1992 .

[59]  Jules S. Jaffe,et al.  ZOOPS- O2: A broadband echosounder with coordinated stereo optical imaging for observing plankton in situ , 2015 .

[60]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[61]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[62]  Robert J. Olson,et al.  Automated taxonomic classification of phytoplankton sampled with imaging‐in‐flow cytometry , 2007 .

[63]  Nishio Takayuki,et al.  Deep Learning Tutorial , 2018 .

[64]  S. Palmer,et al.  A century of Gestalt psychology in visual perception: I. Perceptual grouping and figure-ground organization. , 2012, Psychological bulletin.

[65]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[66]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[67]  Yoshua Bengio,et al.  Practical Recommendations for Gradient-Based Training of Deep Architectures , 2012, Neural Networks: Tricks of the Trade.

[68]  Lynne Boddy,et al.  A comparison of some neural and non-neural methods for identification of phytoplankton from flow cytometry data , 1996, Comput. Appl. Biosci..

[69]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[70]  Qiao Hu,et al.  Automatic plankton image recognition with co-occurrence matrices and Support Vector Machine , 2005 .

[71]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[72]  J. Díez,et al.  Validation methods for plankton image classification systems , 2017 .

[73]  Michael P. Rogers Python Tutorial , 2009 .

[74]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[75]  Xiu Li,et al.  Deep residual networks for plankton classification , 2016, OCEANS 2016 MTS/IEEE Monterey.

[76]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[77]  Heidi M. Sosik,et al.  WHOI-Plankton- A Large Scale Fine Grained Visual Recognition Benchmark Dataset for Plankton Classification , 2015, ArXiv.

[78]  Md. Moniruzzaman,et al.  Deep Learning on Underwater Marine Object Detection: A Survey , 2017, ACIVS.

[79]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[80]  M. Ohman,et al.  CCE IV: El Niño-related zooplankton variability in the southern California Current System , 2018, Deep Sea Research Part I: Oceanographic Research Papers.

[81]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[82]  Hongyu Li,et al.  Quantifying California current plankton samples with efficient machine learning techniques , 2015, OCEANS 2015 - MTS/IEEE Washington.

[83]  Davide Castelvecchi,et al.  Can we open the black box of AI? , 2016, Nature.

[84]  James F. Blinn Jim Blinn's corner: dirty pixels , 1998 .