Computer Vision in Next Generation Image and Video Coding

In this paper we will explore the various ways in which Computer Vision techniques can aid and sometimes even replace the classical algorithms used in many still image and video coding standards. Because compression factor is one of the most important performance criteria of image/video coding we will examine the effectiveness of these techniques in achieving low and very-low bit rates. Motivation for the recent demand of very low-bit rate algorithms stems in part from the introduction of advanced multimedia applications. The desktop computer, for instance, is fast becoming much more than a number crunching machine and will likely become the main channel for interpersonal communications as well as our interface to what is being called the information superhighway. The video inputs to such systems will not only need to capture and code images and video but also obtain some measure of understanding of the scenes before them. Classical or first generation coding algorithms, while indispensable in many fields of communications, fall short of this task. Only a fusion of such algorithms with the work currently being done in computer vision, image understanding, and artificial intelligence will produce a viable system. Since, traditionally, these communities have been rather disparate, it is necessary to begin at a basic level and explore the ways in which techniques from each field can be integrated to create a system that is truly more than the sum of its parts. With that end in mind, this paper will address some of the common aspects of image and video coding with a concentration on one of the more important research areas — the estimation of 2D and 3D motion to use in efficiently compressing video sequences using 3D models.

[1]  Kiyoharu Aizawa,et al.  Model-based image coding advanced video coding techniques for very low bit-rate applications , 1995, Proc. IEEE.

[2]  Chi-Fa Chen,et al.  Hybrid coders with motion compensation , 1992, Multidimens. Syst. Signal Process..

[3]  Thomas Sikora,et al.  Digital video coding standards and their role in video communications , 1995, Proc. IEEE.

[4]  Arun N. Netravali,et al.  Digital Pictures: Representation and Compression , 1988 .

[5]  Hsueh-Ming Hang,et al.  Image and video coding standards , 1993, AT&T Technical Journal.

[6]  Pertti Roivainen,et al.  3-D Motion Estimation in Model-Based Facial Image Coding , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Wei Li,et al.  New trends in very low bitrate video coding , 1995, Proc. IEEE.

[8]  Bernd Girod,et al.  The Efficiency of Motion-Compensating Prediction for Hybrid Coding of Video Sequences , 1987, IEEE J. Sel. Areas Commun..

[9]  Andy C. Downton,et al.  A switched model-based coder for video signals , 1994, IEEE Trans. Circuits Syst. Video Technol..

[10]  T. Murakami,et al.  3-D motion estimation of human head for model-based image coding , 1993 .

[11]  Thomas W. Ryan,et al.  Wavelet-domain texture modeling for image compression , 1994, Proceedings of 1st International Conference on Image Processing.

[12]  D. E. Pearson,et al.  Developments in model-based video coding , 1995, Proc. IEEE.

[13]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1991, CACM.

[14]  Thomas S. Huang,et al.  Head Pose Computation for Very Low Bit-rate Video Coding , 1995, CAIP.

[15]  J. Kowalczuk,et al.  VLSI for digital television , 1995 .

[16]  D. W. Lin,et al.  Video on phone lines: technology and applications , 1995 .

[17]  Ronald W. Schafer,et al.  A non-expansive pyramidal morphological image coder , 1994, Proceedings of 1st International Conference on Image Processing.