Multiple Moving Object Detection for Fast Video Content Description in Compressed Domain

Indexing deals with the automatic extraction of information with the objective of automatically describing and organizing the content. Thinking of a video stream, different types of information can be considered semantically important. Since we can assume that the most relevant one is linked to the presence of moving foreground objects, their number, their shape, and their appearance can constitute a good mean for content description. For this reason, we propose to combine both motion information and region-based color segmentation to extract moving objects from an MPEG2 compressed video stream starting only considering low-resolution data. This approach, which we refer to as "rough indexing," consists in processing P-frame motion information first, and then in performing I-frame color segmentation. Next, since many details can be lost due to the low-resolution data, to improve the object detection results, a novel spatiotemporal filtering has been developed which is constituted by a quadric surface modeling the object trace along time. This method enables to effectively correct possible former detection errors without heavily increasing the computational effort.

[1]  King Ngi Ngan,et al.  Video segmentation for content-based coding , 1999, IEEE Trans. Circuits Syst. Video Technol..

[2]  John R. Smith,et al.  MPEG-7 multimedia description schemes , 2001, IEEE Trans. Circuits Syst. Video Technol..

[3]  Boon-Lock Yeo,et al.  On the extraction of DC sequence from MPEG compressed video , 1995, Proceedings., International Conference on Image Processing.

[4]  Yo-Sung Ho,et al.  A VOP generation tool: automatic segmentation of moving objects in image sequences based on spatio-temporal information , 1999, IEEE Trans. Circuits Syst. Video Technol..

[5]  R. Venkatesh Babu,et al.  Content-based video retrieval using motion descriptors extracted from compressed domain , 2002, 2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353).

[6]  Sergio A. Velastin,et al.  Pedestrian Detection using MPEG-2 Motion Vectors , 2003 .

[7]  Michael G. Strintzis,et al.  Real-time compressed-domain spatiotemporal segmentation and ontologies for video indexing and retrieval , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Jenny Benois-Pineau,et al.  Retrieval of objects in video by similarity based on graph matching , 2007, Pattern Recognit. Lett..

[9]  Patrick Bouthemy,et al.  A unified approach to shot change detection and camera motion characterization , 1999, IEEE Trans. Circuits Syst. Video Technol..

[10]  Zhu Liu,et al.  Multimedia content analysis-using both audio and visual clues , 2000, IEEE Signal Process. Mag..

[11]  Nozha Boujemaa,et al.  Region-based retrieval: coarse segmentation with fine color signature , 2002, Proceedings. International Conference on Image Processing.

[12]  Fatih Porikli Real-time video object segmentation for MPEG-encoded video sequences , 2004, IS&T/SPIE Electronic Imaging.

[13]  Shih-Fu Chang,et al.  An integrated approach for content-based video object segmentation and retrieval , 1999, IEEE Trans. Circuits Syst. Video Technol..

[14]  Faouzi Kossentini,et al.  Retrieval of video objects by compressed domain shape features , 2000, ICECS 2000. 7th IEEE International Conference on Electronics, Circuits and Systems (Cat. No.00EX445).

[15]  Emile A. Hendriks,et al.  Temporal stabilization of video object segmentation for 3D-TV applications , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[16]  S. Aign,et al.  Overview of the MPEG-4 Standard and Error Resilience Investigations , 1998 .

[17]  Jenny Benois-Pineau,et al.  Suivi et indexation des objets dans des séquences vidéo avec la mise à jour par confirmation rétrograde , 2001 .

[18]  Ferran Marqués,et al.  Region-based representations of image and video: segmentation tools for multimedia services , 1999, IEEE Trans. Circuits Syst. Video Technol..

[19]  Alberto Signoroni,et al.  Interactive segmentation of biomedical images and volumes using connected operators , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[20]  Sanjeev R. Kulkarni,et al.  Rapid estimation of camera motion from compressed video with application to video annotation , 2000, IEEE Trans. Circuits Syst. Video Technol..

[21]  Jenny Benois-Pineau,et al.  Extraction of foreground objects from an MPEG2 video stream in rough-indexing framework , 2003, IS&T/SPIE Electronic Imaging.

[22]  R. A. Vingerhoeds Modelisation et identification en traitement du signal: M. Najim , 1990, Autom..

[23]  Yousry S. El Gamal,et al.  Compressed video indexing based on object motion , 2000, Visual Communications and Image Processing.

[24]  R. Venkatesh Babu,et al.  Video object segmentation: a compressed domain approach , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  O. Sukmarg,et al.  Fast object detection and segmentation in MPEG compressed domain , 2000, 2000 TENCON Proceedings. Intelligent Systems and Technologies for the New Millennium (Cat. No.00CH37119).

[26]  Dominique Barba,et al.  Spatio-temporal segmentation of image sequences for object-oriented low bit-rate image coding , 1996, Signal Process. Image Commun..

[27]  Luc Vincent,et al.  Watersheds in Digital Spaces: An Efficient Algorithm Based on Immersion Simulations , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Jenny Benois-Pineau,et al.  A New Method for Region-Based Depth Ordering in a Video Sequence: Application to Frame Interpolation , 2002, J. Vis. Commun. Image Represent..