Applying convolutional networks to underwater tracking without training

Underwater target tracking is one of the most important tasks in recent upsurge of smart aquaculture, especially in the application of AI-driven cage culture. In [1] CNT was shown quite effective in the task of land-based object tracking, it is fast as no huge amount of data is required for training. However, an initial bounding box must be selected manually in CNT for tracking a single object, not to mention the fact that applications of CNT and other convolutional networks in underwater operations are rarely reported. In this paper we present an improved version of CNT (called Fast-CNT2) which is capable of performing underwater multi-target tracking. Firstly, GMM is applied to the input video for extracting regions each containing a moving target (e.g. fish); then a respective region containing a fish is identified and bounded with a box; finally, multi-target tracking is implemented with Fast-CNT2. Experimental results show that our method can successfully track multiple fish.

[1]  Xiaogang Wang,et al.  STCT: Sequentially Training Convolutional Networks for Visual Tracking , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[3]  Qingshan Liu,et al.  Robust Visual Tracking via Convolutional Networks Without Training , 2015, IEEE Transactions on Image Processing.

[4]  Jung-Hua Wang,et al.  Fast Visual Tracking Based on Convolutional Networks , 2018, Sensors.

[5]  Ferdinand van der Heijden,et al.  Efficient adaptive density estimation per image pixel for the task of background subtraction , 2006, Pattern Recognit. Lett..

[6]  Bohyung Han,et al.  Learning Multi-domain Convolutional Neural Networks for Visual Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Harry Gifford Hierarchical k-Means for Unsupervised Learning , 2013 .

[8]  Zoran Zivkovic,et al.  Improved adaptive Gaussian mixture model for background subtraction , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[9]  Marc Van Droogenbroeck,et al.  ViBe: A Universal Background Subtraction Algorithm for Video Sequences , 2011, IEEE Transactions on Image Processing.