Fish Keypoints Detection for Ecology Monitoring Based on Underwater Visual Intelligence

This paper introduces a fishery ecology monitoring system for cultivation pools, and proposes a new stereo keypoint detection method followed by curve fitting analysis to estimate the fish posture and length. The system, which can be employed for aquaculture monitoring, is featured by its exploitation of underwater visual intelligence and deep neural-network architecture. As input, stereo image pairs are obtained by underwater binocular camera. A deep neural-network under Faster R-CNN architecture is built to detect fish from the stereo image inputs. Another network under Stacked Hourglass architecture is constructed to detect specific keypoints of each fish. For ecology monitoring, detected keypoints are used in the estimation of the fishes' posture and length. Unlike other size estimation methods which also apply a binocular camera, our method naturally bypasses the pixel-wise matching difficulty in global stereo matching algorithms. Experiment shows that our system is applicable for online fish ecology monitoring, with efficient and accurate estimation performance.

[1]  Xing Mei,et al.  On building an accurate stereo matching system on graphics hardware , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[2]  Kai Chen,et al.  MMDetection: Open MMLab Detection Toolbox and Benchmark , 2019, ArXiv.

[3]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Jean-Thierry Lapresté,et al.  Dry camera calibration for underwater applications , 2003, Machine Vision and Applications.

[6]  Francesca Antonucci,et al.  Automated sorting for size, sex and skeletal anomalies of cultured seabass using external shape analysis , 2013 .

[7]  Aníbal Ollero,et al.  Computer vision and robotics techniques in fish farms , 2003, Robotica.

[8]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Yuning Jiang,et al.  Acquisition of Localization Confidence for Accurate Object Detection , 2018, ECCV.

[10]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Zhengyou Zhang,et al.  A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Heiko Hirschmüller,et al.  Stereo Processing by Semiglobal Matching and Mutual Information , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Tal Hassner,et al.  Facial Landmark Detection with Tweaked Convolutional Neural Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Pietro Perona,et al.  Benchmarking and Error Diagnosis in Multi-instance Pose Estimation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[15]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Jia Deng,et al.  Stacked Hourglass Networks for Human Pose Estimation , 2016, ECCV.

[17]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Larry S. Davis,et al.  Soft-NMS — Improving Object Detection with One Line of Code , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[19]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).