Extension of the forward-backward motion compensation scheme for MPEG coded sequences: a sub-space approach

A different predictive view for the efficient coding of a given video. is proposed in this paper. Each image in a sequence can be seen as a vector in a hyperspace and the whole video as a curve, each point of the curve representing a given frame in the video. The whole video can be reconstructed from its video-samples: any image in the hyperspace can be obtained by means of a reconstruction algorithm, in analogy with the reconstruction of an analog signal from its samples. For this aim an appropriate interpolating kernel function should be used due to the multi-dimensional nature of the problem. To obtain the predicted image only the description of its position in the vector hyperspace is needed, i.e. by the vector of distances of the image at hand and the selected video samples (key-frames). A video key-frames codebook is used to synthesize video; it should allow the reconstruction of a good quality prediction of each image to be transmitted with poor requirements on the side motion information. Coding information reduces to the position in the hyperspace of the image to be predicted, the construction of the predicted image to the problem of the efficient multidimensional interpolation of key-frames. The focus of this paper is on the analysis phase of a given video sequence. Preliminary results are presented.

[1]  Narendra Ahuja,et al.  Segmentation and Factorization-Based Motion and Structure Estimation for Long Image Sequences , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Ferran Marqués,et al.  Region-based representations of image and video: segmentation tools for multimedia services , 1999, IEEE Trans. Circuits Syst. Video Technol..

[3]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[4]  Vincenzo Di Lecce,et al.  An Evaluation of the Effectiveness of Image Features for Image Retrieval , 1999, J. Vis. Commun. Image Represent..

[5]  Frederic Dufaux,et al.  Motion estimation techniques for digital TV: a review and a new contribution , 1995, Proc. IEEE.

[6]  Cataldo Guaragnella,et al.  Object oriented motion estimation by sliced-block matching algorithm , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[7]  Ioannis Patras,et al.  Video Segmentation by MAP Labeling of Watershed Segments , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  M. Minenna,et al.  A new non competitive unsupervised neural network for clustering , 1994, Proceedings of IEEE International Symposium on Circuits and Systems - ISCAS '94.

[9]  Takeo Kanade,et al.  Shape and motion without depth , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[10]  Rajesh N. Davé,et al.  Robust clustering methods: a unified view , 1997, IEEE Trans. Fuzzy Syst..

[11]  Graeme A. Jones,et al.  Foreground-background segmentation by cellular neural networks , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[12]  Toshikazu Kato,et al.  Query by Visual Example - Content based Image Retrieval , 1992, EDBT.

[13]  Thomas Wiegand,et al.  Using multiple global motion models for improved block-based video coding , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[14]  Augusto Celentano,et al.  Similarity evaluation in image retrieval using the Hough transform , 1996 .

[15]  Cataldo Guaragnella,et al.  Motion estimation and modeling for video sequences , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[16]  Anil K. Jain,et al.  Artificial neural networks for feature extraction and multivariate data projection , 1995, IEEE Trans. Neural Networks.