Fish species classification in underwater video monitoring using Convolutional Neural Networks

This report presents a case study for automatic fish species classification in underwater video monitoring of fish passes. Although the presented approach is based on the FishCam monitoring system, it can be used with any video-based monitoring system. The presented classification scheme in this study, is based on Convolutional Neural Networks that do not require the calculation of any hand-engineered image features. Instead, these networks use the raw video image as input. Additionally, this study investigates, if the classification accuracy can be increased by adding additional meta-information (date of migration and fish length) to the network. The approach is tested on a subset of 10 fish species (8099 individuals) occurring in Austrian river. On an independent test set, the presented approach achieves a classification accuracy of 93 %.

[1]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[2]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[3]  Shahrul Azman Mohd Noah,et al.  Fish Recognition Based on Robust Features Extraction from Size and Shape Measurements Using Neural Network , 2010 .

[4]  Robert B. Fisher,et al.  Automatic fish classification for underwater species behavior understanding , 2010, ARTEMIS '10.

[5]  R. Kastner,et al.  Automated Techniques for Detection and Recognition of Fishes using Computer Vision Algorithms , 2012 .

[6]  Dah-Jye Lee,et al.  Contour matching for a fish recognition and migration-monitoring system , 2004, SPIE Optics East.

[7]  Andrew Rova,et al.  One Fish, Two Fish, Butterfish, Trumpeter: Recognizing Fish in Underwater Video , 2007, MVA.

[8]  King-Sun Fu,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[10]  Abdel Karim Baareh A Hybrid Memetic Algorithm (Genetic Algorithm and Tabu Local Search) with Back-Propagation Classifier for Fish Recognition , 2013 .

[11]  Jonathan Tompson,et al.  Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation , 2014, NIPS.

[12]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[13]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.