A semantic event-detection approach and its application to detecting hunts in wildlife vide

We propose a three-level video-event detection methodology and apply it to animal-hunt detection in wildlife documentaries. The first level extracts color, texture, and motion features, and detects shot boundaries and moving object blobs. The mid-level employs a neural network to determine the object class of the moving object blobs. This level also generates shot descriptors that combine features from the first level and inferences from the mid-level. The shot descriptors are then used by the domain-specific inference process at the third level to detect video segments that match the user defined event model. The proposed approach has been applied to the detection of hunts in wildlife documentaries. Our method can be applied to different events by adapting the classifier at the intermediate level and by specifying a new event model at the highest level. Event-based video indexing, summarization, and browsing are among the applications of the proposed approach.

[1]  Yücel Altunbasak,et al.  Content-based video retrieval and compression: a unified solution , 1997, Proceedings of International Conference on Image Processing.

[2]  Shih-Fu Chang,et al.  Spatio-temporal video search using the object based video representation , 1997, Proceedings of International Conference on Image Processing.

[3]  Sanjeev R. Kulkarni,et al.  Automated analysis and annotation of basketball video , 1997, Electronic Imaging.

[4]  Stephen W. Smoliar,et al.  Content-based video browsing tools , 1995, Electronic Imaging.

[5]  Andrew Lippman,et al.  Models for Automatic Classiication of Video Sequences , 1997 .

[6]  Shih-Fu Chang,et al.  A fully automated content-based video search engine supporting spatiotemporal queries , 1998, IEEE Trans. Circuits Syst. Video Technol..

[7]  Alex Pentland,et al.  Fractal-Based Description of Natural Scenes , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Azriel Rosenfeld,et al.  A Comparative Study of Texture Measures for Terrain Classification , 1975, IEEE Transactions on Systems, Man, and Cybernetics.

[9]  Martin Szummer,et al.  Temporal texture modeling , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[10]  Richard W. Conners,et al.  A Theoretical Comparison of Texture Algorithms , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Andrew Heybey,et al.  I/Browse: the Bellcore video library tool kit , 1996, Electronic Imaging.

[12]  Marco Ceccarelli,et al.  Visual search in a SMASH system , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[13]  Boon-Lock Yeo,et al.  Video visualization for compact presentation and fast browsing of pictorial content , 1997, IEEE Trans. Circuits Syst. Video Technol..

[14]  Stephen S. Intille Tracking using a local closed-world assumption : tracking in the football domain , 1994 .

[15]  Jonathan D. Courtney Automatic video indexing via object motion analysis , 1997, Pattern Recognit..

[16]  S. P. Mudur,et al.  Three-dimensional computer vision: a geometric viewpoint , 1993 .

[17]  Forouzan Golshani,et al.  Motion recovery for video content classification , 1995, TOIS.

[18]  Remi Depommier,et al.  Content-based browsing of video sequences , 1994, MULTIMEDIA '94.

[19]  M. Ibrahim Sezan,et al.  A robust real-time face tracking algorithm , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[20]  Yoshinao Aoki,et al.  Indexing of baseball telecast for content-based video retrieval , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[21]  R. Wilcox Introduction to Robust Estimation and Hypothesis Testing , 1997 .

[22]  Brendan J. Frey,et al.  Probabilistic multimedia objects (multijects): a novel approach to video indexing and retrieval in multimedia systems , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[23]  Giridharan Iyengar,et al.  Models for automatic classification of video sequences , 1997, Electronic Imaging.

[24]  Nirupam Sarkar,et al.  Improved fractal geometry based texture segmentation technique , 1993 .

[25]  James M. Keller,et al.  Texture description and segmentation through fractal geometry , 1989, Comput. Vis. Graph. Image Process..

[26]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[27]  Alberto Del Bimbo,et al.  A Spatial Logic for Symbolic Description of Image Contents , 1994, J. Vis. Lang. Comput..

[28]  M. Smith,et al.  Video Skimming for Quick Browsing based on Audio and Image Characterization , 1995 .

[29]  Boon-Lock Yeo,et al.  Analysis And Presentation Of Soccer Highlights From Digital Video , 1995 .

[30]  N. Haering,et al.  Locating deciduous trees , 1997, 1997 Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries.

[31]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[32]  Niels da Vitoria Lobo,et al.  Features and Classification Methods to Locate Deciduous Trees in Images , 1999, Comput. Vis. Image Underst..

[33]  Niels Haering,et al.  A framework for the design of event detectors , 1999 .

[34]  Nuno Vasconcelos,et al.  A Bayesian framework for semantic content characterization , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[35]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[36]  Joseph Naor,et al.  Multiple Resolution Texture Analysis and Classification , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[38]  B. S. Manjunath,et al.  Content-based search of video using color, texture, and motion , 1997, Proceedings of International Conference on Image Processing.