Video Object Relevance Metrics for Overall Segmentation Quality Evaluation

Video object segmentation is a task that humans perform efficiently and effectively, but which is difficult for a computer to perform. Since video segmentation plays an important role for many emerging applications, as those enabled by the MPEG-4 and MPEG-7 standards, the ability to assess the segmentation quality in view of the application targets is a relevant task for which a standard, or even a consensual, solution is not available. This paper considers the evaluation of overall segmentation partitions quality, highlighting one of its major components: the contextual relevance of the segmented objects. Video object relevance metrics are presented taking into account the behaviour of the human visual system and the visual attention mechanisms. In particular, contextual relevance evaluation takes into account the context where an object is found, exploiting, for instance, the contrast to neighbours or the position in the image. Most of the relevance metrics proposed in this paper can also be used in contexts other than segmentation quality evaluation, such as object-based rate control algorithms, description creation, or image and video quality evaluation.

[1]  Sangwook Lee,et al.  Comparison of subjective video quality assessment methods for multimedia applications , 2007 .

[2]  Fernando Pereira,et al.  Estimation of video object's relevance , 2000, 2000 10th European Signal Processing Conference.

[3]  Neil W. Bergmann,et al.  A technique for image quality assessment based on a human visual system model , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[4]  Fred Stentiford,et al.  An estimator for visual attention through competitive novelty with application to image compression , 2001 .

[5]  Jiying Zhao,et al.  A JPEG codec adaptive to region importance , 1997, MULTIMEDIA '96.

[6]  A. Murat Tekalp,et al.  Metrics for performance evaluation of video object segmentation and tracking without ground-truth , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[7]  Thomas S. Huang,et al.  Relevance feedback techniques in interactive content-based image retrieval , 1997, Electronic Imaging.

[8]  Jean Serra,et al.  Image Analysis and Mathematical Morphology , 1983 .

[9]  Sugato Chakravarty,et al.  Methodology for the subjective assessment of the quality of television pictures , 1995 .

[10]  Paulo Villegas,et al.  Perceptually-weighted evaluation criteria for segmentation masks in video sequences , 2004, IEEE Transactions on Image Processing.

[11]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.

[12]  Information technology — Coding of audio-visual objects — Part 3 : Audio Technologies de l ' information — Codage des objets audiovisuels — Partie , 1999 .

[13]  Christophe De Vleeschouwer,et al.  Automatic detection of interest areas of an image or of a sequence of images , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[14]  Fernando Pereira,et al.  Objective evaluation of video segmentation quality , 2003, IEEE Trans. Image Process..

[15]  Shuichi Matsumoto,et al.  Picture quality assessment system by three-layered bottom-up noise weighting considering human visual perception , 1999 .

[16]  CHARLOTTE K. PERILLO We Changed , 1966 .