One Fish, Two Fish, Butterfish, Trumpeter: Recognizing Fish in Underwater Video

This paper presents a deformable template object recognition method for classifying fish species in underwater video. This method can be a component of a system that automatically identifies fish by species, improving upon previous works which only detect and track fish and those that rely on significant inter-species shape variations or special equipment. Our method works with video shot by a standard uncalibrated camera in a natural setting rather than the calibrated stereo cameras and man-made imaging environments described in other publications. We use deformable template matching which employs an efficient combination of shape contexts and large-scale spatial structure preservation. Experimental results demonstrate the improvement of deformable template matching over raw SVM texturebased classification.

[1]  Martin A. Fischler,et al.  The Representation and Matching of Pictorial Structures , 1973, IEEE Transactions on Computers.

[2]  Yali Amit,et al.  Joint Induction of Shape Features and Tree Classifiers , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[4]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[5]  Lawrence M. Dill,et al.  FOOD AVAILABILITY AND TIGER SHARK PREDATION RISK INFLUENCE BOTTLENOSE DOLPHIN HABITAT USE , 2002 .

[6]  Christof Koch,et al.  Detection and tracking of objects in underwater video , 2004, CVPR 2004.

[7]  Björn Stenger,et al.  Shape context and chamfer matching in cluttered scenes , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[8]  François Michaud,et al.  Intelligent system for automated fish sorting and counting , 2000, Proceedings. 2000 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2000) (Cat. No.00CH37113).

[9]  Jitendra Malik,et al.  Efficient shape matching using shape contexts , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Yoram Singer,et al.  Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..

[11]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[12]  Ulf Grenander,et al.  Hands: A Pattern Theoretic Study of Biological Shapes , 1990 .

[13]  Mario Fernando Montenegro Campos,et al.  Particle Filter-Based Predictive Tracking for Robust Fish Counting , 2005, XVIII Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI'05).

[14]  Pietro Perona,et al.  Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[16]  Nigel J. B. McFarlane,et al.  Estimating Dimensions of Free-Swimming Fish Using 3D Point Distribution Models , 2000, Comput. Vis. Image Underst..

[17]  Dah-Jye Lee,et al.  An automated fish species classification and migration monitoring system , 2003, IECON'03. 29th Annual Conference of the IEEE Industrial Electronics Society (IEEE Cat. No.03CH37468).

[18]  Daniel P. Huttenlocher,et al.  Pictorial Structures for Object Recognition , 2004, International Journal of Computer Vision.

[19]  Joachim M. Buhmann,et al.  Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.

[20]  A. Yuille Deformable Templates for Face Recognition , 1991, Journal of Cognitive Neuroscience.

[21]  N. J. C. Strachan,et al.  Recognition of fish species by colour and shape , 1993, Image Vis. Comput..

[22]  Jitendra Malik,et al.  Shape matching and object recognition using low distortion correspondences , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).