Categorization of Underwater Habitats Using Dynamic Video Textures

In this paper, we deal with the problem of categorizing different underwater habitat types. Previous works on solving this categorization problem are mostly based on the analysis of underwater images. In our work, we design a system capable of categorizing underwater habitats based on underwater video content analysis since the temporally correlated information may make contribution to the categorization task. However, the task is very challenging since the underwater scene in the video is continuously varying because of the changing scene and surface conditions, lighting, and the viewpoint. To that end, we investigate the utility of two approaches to underwater video classification: the common spatio-temporal interest points (STIPs) and the video texture dynamic systems, where we model the underwater footage using dynamic textures and construct a categorization framework using the approach of the Bag-of-Systems(BoSs). We also introduce a new underwater video data set, which is composed of more than 100 hours of annotated video sequences. Our results indicate that, for the underwater habitat identification, the dynamic texture approach has multiple benefits over the traditional STIP-based video modeling.

[1]  Serge J. Belongie,et al.  Behavior recognition via sparse spatio-temporal features , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[2]  Stefan B. Williams,et al.  Active learning using a Variational Dirichlet Process model for pre-clustering and classification of underwater stereo imagery , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[4]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[5]  Ivan Laptev,et al.  On Space-Time Interest Points , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[6]  Dani Lischinski,et al.  Texture Mixing and Texture Movie Synthesis Using Statistical Learning , 2001, IEEE Trans. Vis. Comput. Graph..

[7]  Nuno Vasconcelos,et al.  Classifying Video with Kernel Dynamic Textures , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  David G. Stork,et al.  Pattern Classification , 1973 .

[9]  Stefano Soatto,et al.  Dynamic Textures , 2003, International Journal of Computer Vision.

[10]  Stefan B. Williams,et al.  Towards autonomous habitat classification using Gaussian Mixture Models , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  Luc Van Gool,et al.  An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector , 2008, ECCV.

[12]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[13]  Martin Szummer,et al.  Temporal texture modeling , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[14]  Geoffrey A. Hollinger,et al.  Active Classification: Theory and Application to Underwater Inspection , 2011, ISRR.

[15]  N. Nandhakumar,et al.  Multisensor integration for underwater scene classification , 1991, Conference Proceedings 1991 IEEE International Conference on Systems, Man, and Cybernetics.