Nonparametric motion characterization using causal probabilistic models for video indexing and retrieval

This paper describes an original approach for content-based video indexing and retrieval. We aim at providing a global interpretation of the dynamic content of video shots without any prior motion segmentation and without any use of dense optic flow fields. To this end, we exploit the spatio-temporal distribution, within a shot, of appropriate local motion-related measurements derived from the spatio-temporal derivatives of the intensity function. These distributions are then represented by causal Gibbs models. To be independent of camera movement, the motion-related measurements are computed in the image sequence generated by compensating the estimated dominant image motion in the original sequence. The statistical modeling framework considered makes the exact computation of the conditional likelihood of a video shot belonging to a given motion or more generally to an activity class feasible. This property allows us to develop a general statistical framework for video indexing and retrieval with query-by-example. We build a hierarchical structure of the processed video database according to motion content similarity. This results in a binary tree where each node is associated to an estimated causal Gibbs model. We consider a similarity measure inspired from Kullback-Leibler divergence. Then, retrieval with query-by-example is performed through this binary tree using the maximum a posteriori (MAP) criterion. We have obtained promising results on a set of various real image sequences.

[1]  K. Pahlavan,et al.  Texture Modeling by Multiple Pairwise Pixel Interactions , 1996 .

[2]  Patrick Bouthemy,et al.  A unified approach to shot change detection and camera motion characterization , 1999, IEEE Trans. Circuits Syst. Video Technol..

[3]  Nuno Vasconcelos,et al.  A probabilistic architecture for content-based image retrieval , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[4]  Patrick Bouthemy,et al.  Motion characterization from temporal cooccurrences of local motion-based measures for video indexing , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[5]  Wei Xiong,et al.  Query by video clip , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[6]  Donald Geman,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .

[7]  Chahab Nastar,et al.  Efficient query refinement for image retrieval , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[8]  Michal Irani,et al.  Detecting and Tracking Multiple Moving Objects Using Temporal Integration , 1992, ECCV.

[9]  Edoardo Ardizzone,et al.  Automatic Video Database Indexing and Retrieval , 2004, Multimedia Tools and Applications.

[10]  Randal C. Nelson,et al.  Qualitative recognition of motion using temporal texture , 1992, CVGIP Image Underst..

[11]  M. Basseville Distance measures for signal processing and pattern recognition , 1989 .

[12]  Simone Santini,et al.  Similarity Measures , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Alberto Del Bimbo,et al.  Visual information retrieval , 1999 .

[14]  Patrick Bouthemy,et al.  Scene Segmentation and Image Feature Extraction for Video Indexing and Retrieval , 1999, VISUAL.

[15]  Thierry Pun,et al.  Correspondence analysis and hierarchical indexing for content-based image retrieval , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[16]  Patrick Pérez,et al.  Statistical motion-based video indexing and retrieval , 2000, RIAO.

[17]  Haluk Derin,et al.  Video Data Compression for Multimedia Computing , 1997 .

[18]  Stephen W. Smoliar,et al.  An integrated system for content-based video retrieval and browsing , 1997, Pattern Recognit..

[19]  Thomas S. Huang,et al.  Constructing table-of-content for videos , 1999, Multimedia Systems.

[20]  S. Suzuki,et al.  Feature extraction of temporal texture based on spatiotemporal motion trajectory , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[21]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[22]  Song-Chun Zhu,et al.  Filters, Random Fields and Maximum Entropy (FRAME): Towards a Unified Theory for Texture Modeling , 1998, International Journal of Computer Vision.

[23]  C. Stiller,et al.  Estimating motion in image sequences , 1999, IEEE Signal Process. Mag..

[24]  Patrick Bouthemy,et al.  Motion-Based Feature Extraction and Ascendant Hierarchical Classification for Video Indexing and Retrieval , 1999, VISUAL.

[25]  J. Odobez,et al.  Separation of Moving Regions from Background in an Image Sequence Acquired with a Mobil Camera , 1997 .

[26]  Patrick Bouthemy,et al.  Determining a Structured Spatio-Temporal Representation of Video Content for Efficient Visualization and Indexing , 1998, ECCV.

[27]  Martin Szummer,et al.  Temporal texture modeling , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[28]  Patrick Bouthemy,et al.  Multimodal Estimation of Discontinuous Optical Flow using Markov Random Fields , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Patrick Bouthemy,et al.  Non-Parametric Motion Activity Analysis for Statistical Retrieval with Partial Query , 2004, Journal of Mathematical Imaging and Vision.

[30]  Patrick Bouthemy,et al.  Moving object detection in color image sequences using region-level graph labeling , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[31]  R. Brunelli,et al.  A Survey on the Automatic Indexing of Video Data, , 1999, J. Vis. Commun. Image Represent..

[32]  Paul A. Viola,et al.  Texture recognition using a non-parametric multi-scale statistical model , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[33]  Georgy L. Gimel'farb,et al.  Texture Modelling by Multiple Pairwise Pixel , 2008 .

[34]  John C. Dalton,et al.  Hierarchical browsing and search of large image databases , 2000, IEEE Trans. Image Process..

[35]  Anil K. Jain,et al.  Feature Selection: Evaluation, Application, and Small Sample Performance , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  Gérard Govaert,et al.  Clustering in Pattern Recognition , 1981 .

[37]  Nuno Vasconcelos,et al.  Statistical models of video structure for content analysis and characterization , 2000, IEEE Trans. Image Process..

[38]  Rangasami L. Kashyap,et al.  Models for motion-based video indexing and retrieval , 2000, IEEE Trans. Image Process..

[39]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Michal Irani,et al.  Video indexing based on mosaic representations , 1998, Proc. IEEE.

[41]  Haim Schweitzer,et al.  Organizing image databases as visual-content search trees , 1999, Image Vis. Comput..

[42]  Boon-Lock Yeo,et al.  Extracting story units from long programs for video browsing and navigation , 1996, Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems.

[43]  Dragutin Petkovic,et al.  Content-based representation and retrieval of visual media: A state-of-the-art review , 1996, Multimedia Tools and Applications.

[44]  Charles A. Bouman,et al.  ViBE: a new paradigm for video database browsing and search , 1998, Proceedings. IEEE Workshop on Content-Based Access of Image and Video Libraries (Cat. No.98EX173).

[45]  Ronan Fablet Modelisation statistique non parametrique et reconnaissance du mouvement dans des sequences d'images ; application a l'indexation video , 2001 .

[46]  Patrick Bouthemy,et al.  Computation and analysis of image motion: A synopsis of current problems and methods , 1996, International Journal of Computer Vision.

[47]  A. Murat Tekalp,et al.  Effective content representation for video , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[48]  L. Younes Estimation and annealing for Gibbsian fields , 1988 .

[49]  Jean-Marc Odobez,et al.  Robust Multiresolution Estimation of Parametric Motion Models , 1995, J. Vis. Commun. Image Represent..

[50]  Jonathan D. Courtney Automatic video indexing via object motion analysis , 1997, Pattern Recognit..