Image sequence coding at very low bit rates: a review

This paper presents a review of promising techniques for very low bit-rate, below 64 kb/s, image sequence coding. Image sequence coding at such low rates will be a crucial technique in forthcoming visual services, e.g., visual information transmission and storage. A typical application is to transmit moving videophone scenes through the existing analog telephone lines or via a mobile channel. Two types of potential coding techniques are addressed: waveform-based image sequence coding and model-based image sequence coding.

[1]  Gary J. Sullivan,et al.  Motion compensation for video compression using control grid interpolation , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[2]  M.L. Liou,et al.  Visual telephony as an ISDN application , 1990, IEEE Communications Magazine.

[3]  Thomas S. Huang PCM picture transmission , 1965, IEEE Spectrum.

[4]  M. Kunt,et al.  Second-generation image-coding techniques , 1985, Proceedings of the IEEE.

[5]  Kenji Mase An Application of Optical Flow - Extraction of Facial Expression - , 1990, MVA.

[6]  Gunnar Karlsson,et al.  Three dimensional sub-band coding of video , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[7]  Hiroshi Harashima,et al.  Model-based/waveform hybrid coding for videotelephone images , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[8]  Jens-Rainer Ohm Temporal domain sub-band video coding with motion compensation , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Pertti Roivainen,et al.  3-D Motion Estimation in Model-Based Facial Image Coding , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Lajos Hanzo,et al.  Transmission of subband-coded images via mobile channels , 1993, IEEE Trans. Circuits Syst. Video Technol..

[11]  Dana H. Ballard,et al.  Computer Vision , 1982 .

[12]  Arun N. Netravali,et al.  Digital Pictures: Representation and Compression , 1988 .

[13]  N. D. Duffy,et al.  A Texture Mapping Approach to 3‐D Facial Image Synthesis , 1988, Comput. Graph. Forum.

[14]  Michael Hötter,et al.  Object-oriented analysis-synthesis coding based on moving two-dimensional objects , 1990, Signal Process. Image Commun..

[15]  Demetri Terzopoulos,et al.  Analysis of facial images using physical and anatomical models , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[16]  Daniel Cohen-Or,et al.  Volume graphics , 1993, Computer.

[17]  Ahmad Fadzil M. Hani,et al.  Video subband VQ coding at 64 kbit/s using short-kernel filter banks with an improved motion estimation technique , 1991, Signal Process. Image Commun..

[18]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1992 .

[19]  John W. Woods,et al.  Subband Image Coding , 1990 .

[20]  Jake K. Aggarwal,et al.  TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE , 2008 .

[21]  R.H.J.M. Plompen,et al.  Motion video coding for visual telephony , 1990 .

[22]  Alex Pentland,et al.  Recovery of Nonrigid Motion and Structure , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Stefano Tubaro,et al.  Use of pan/zoom information in image segmentation and interpolation , 1993 .

[24]  Nikil Jayant,et al.  Signal Compression: Technology Targets and Research Directions , 1992, IEEE J. Sel. Areas Commun..

[25]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[26]  Dimitris N. Metaxas,et al.  Dynamic 3D Models with Local and Global Deformations: Deformable Superquadrics , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Yao Wang,et al.  Active mesh-a feature seeking and tracking image sequence representation scheme , 1994, IEEE Trans. Image Process..

[28]  H. Busch,et al.  Subdividing non-rigid 3D objects into quasi-rigid parts , 1989 .

[29]  A. Jacquin Fractal image coding: a review , 1993, Proc. IEEE.

[30]  H. Brusewitz,et al.  Motion compensation with triangles , 1990 .

[31]  William Kenneth Pratt,et al.  Image transmission techniques , 1979 .

[32]  Nariman Farvardin,et al.  Three-dimensional subband coding of video , 1995, IEEE Trans. Image Process..

[33]  Reinhard Koch,et al.  Dynamic 3-D Scene Analysis Through Synthesis Feedback Control , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  S. Sabri,et al.  Video conferencing systems , 1985, Proceedings of the IEEE.

[35]  Thomas S. Huang,et al.  Image Sequence Processing and Dynamic Scene Analysis , 1983, NATO ASI Series.

[36]  Parke,et al.  Parameterized Models for Facial Animation , 1982, IEEE Computer Graphics and Applications.

[37]  Peter Strobach,et al.  Space-variant regular decomposition quadtrees in adaptive interframe coding , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[38]  R. J. Clarke,et al.  Motion estimation and compensation for image sequence coding , 1992, Signal Process. Image Commun..

[39]  Eric Dubois,et al.  Noise Reduction in Image Sequences Using Motion-Compensated Temporal Filtering , 1984, IEEE Trans. Commun..

[40]  Michael Mills,et al.  Blockmatching motion estimation algorithms-new results , 1990 .

[41]  Don E. Pearson,et al.  Texture mapping in model-based image coding , 1990, Signal Process. Image Commun..

[42]  Hiroshi Harashima,et al.  Iterative motion estimation method using triangular patches for motion compensation , 1991, Other Conferences.

[43]  F Kappei,et al.  Modelling Of A Natural 3-D Scene Consisting Of Moving Objects From A Sequence Of Monocular Tv Images , 1988, Other Conferences.

[44]  Haibo Li Segmentation of the facial area for videophone applications , 1992 .

[45]  Eric L. W. Grimson,et al.  From Images to Surfaces: A Computational Study of the Human Early Visual System , 1981 .

[46]  Denis Laurendeau,et al.  3-D Sensing for industrial computer vision , 1988 .

[47]  Thomas S. Huang,et al.  Estimating three-dimensional motion parameters of a rigid planar patch , 1981 .

[48]  Haibo Li,et al.  A new motion-compensated technique for video compression , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[49]  Alex Pentland,et al.  Modal Descriptions for Recognition and Tracking , 1992, MVA.

[50]  D. Pearson Model-based image coding , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[51]  Kiyoharu Aizawa,et al.  Human facial motion modeling, analysis, and synthesis for video compression , 1991, Other Conferences.

[52]  Murat Kunt,et al.  Image Sequence Coding Using Motion Compensated Subband Decomposition , 1993 .

[53]  Alex Pentland,et al.  Closed-Form Solutions for Physically Based Shape Modeling and Recognition , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[54]  Haibo Li,et al.  Fractal-based image sequence compression scheme , 1993 .

[55]  W. F. Schreiber Picture coding , 1967 .

[56]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[57]  Haibo Li,et al.  Two-view facial movement estimation , 1994, IEEE Trans. Circuits Syst. Video Technol..

[58]  A.N. Netravali,et al.  Picture coding: A review , 1980, Proceedings of the IEEE.

[59]  Alex Pentland,et al.  Perceptual Organization and the Representation of Natural Form , 1986, Artif. Intell..

[60]  Guner S. Robinson,et al.  A survey ol digital picture coding , 1974, Computer.

[61]  D. Legall,et al.  MPEG : A video compression standard for multimedia applications , 1991 .

[62]  M. Ibrahim Sezan,et al.  Motion-adaptive weighted averaging for temporal filtering of noisy image sequences , 1992, Electronic Imaging.

[63]  P. W. Jones,et al.  Digital Image Compression Techniques , 1991 .

[64]  Jörn Ostermann Object-based analysis-synthesis coding (OBASC) based on the source model of moving flexible 3-D objects , 1994, IEEE Trans. Image Process..

[65]  Ruzena Bajcsy,et al.  Recovery of Parametric Models from Range Images: The Case for Superquadrics with Global Deformations , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[66]  F. Glazer,et al.  Scene Matching by Hierarchical Correlation , 1983 .

[67]  Robert F. Sproull,et al.  Principles in interactive computer graphics , 1973 .

[68]  Wesley E. Snyder,et al.  Motion estimation optimization , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[69]  A. Habibi Hybrid Coding of Pictorial Data , 1974, IEEE Trans. Commun..

[70]  Anil K. Jain,et al.  Displacement Measurement and Its Application in Interframe Image Coding , 1981, IEEE Trans. Commun..

[71]  C. D. Kuglin,et al.  Video-Rate Image Correlation Processor , 1977, Optics & Photonics.

[72]  R. Forchheimer,et al.  Recursive estimation of facial expression and movement , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[73]  Mohammad Ghanbari,et al.  Generalized block-matching motion estimation , 1992, Other Conferences.

[74]  J. Salz,et al.  Algorithms for estimation of three-dimensional motion , 1985, AT&T Technical Journal.

[75]  Didier Le Gall,et al.  MPEG: a video compression standard for multimedia applications , 1991, CACM.

[76]  Anil K. Jain,et al.  Image data compression: A review , 1981, Proceedings of the IEEE.

[77]  O. Rioul,et al.  Wavelets and signal processing , 1991, IEEE Signal Processing Magazine.

[78]  Barr,et al.  Superquadrics and Angle-Preserving Transformations , 1981, IEEE Computer Graphics and Applications.

[79]  Gary J. Sullivan,et al.  Multi-hypothesis motion compensation for low bit-rate video coding , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[80]  H. Harashima,et al.  Analysis and synthesis of facial expressions in knowledge-based coding of facial image sequences , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[81]  Olivier D. Faugeras,et al.  Representing Stereo Data with the Delaunay Triangulation , 1990, Artif. Intell..

[82]  Jörn Ostermann,et al.  Object-oriented analysis-synthesis coding of moving images , 1989, Signal Process. Image Commun..

[83]  M. Hötter,et al.  Image segmentation based on object oriented mapping parameter estimation , 1988 .

[84]  Allen Gersho,et al.  Joint motion compensated prediction and interpolation of video sequences , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[85]  Bernd Girod,et al.  The Efficiency of Motion-Compensating Prediction for Hybrid Coding of Video Sequences , 1987, IEEE J. Sel. Areas Commun..

[86]  Henri Nicolas,et al.  Region-based motion estimation using deterministic relaxation schemes for image sequence coding , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[87]  O. J. Tretiak,et al.  Design considerations in PCM transmission of low-resolution monochrome still pictures , 1967 .

[88]  Bernd Girod,et al.  Motion-compensating prediction with fractional-pel accuracy , 1993, IEEE Trans. Commun..

[89]  Masahide Kaneko,et al.  Coding of facial image sequence based on a 3-D model of the head and motion detection , 1991, J. Vis. Commun. Image Represent..

[90]  Christoph Stiller Motion estimation for coding of moving video at 8 kbit/s with Gibbs-modeled vectorfield smoothing , 1990, Other Conferences.

[91]  Norbert Diehl,et al.  Object-oriented motion estimation and segmentation in image sequences , 1991, Signal Process. Image Commun..

[92]  J. Roese,et al.  Interframe Cosine Transform Image Coding , 1977, IEEE Trans. Commun..

[93]  Mehmet K. Özkan,et al.  Temporally adaptive filtering of noisy image sequences using a robust motion estimation algorithm , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[94]  Petros Maragos,et al.  Motion displacement estimation using an affine model for image matching , 1991 .

[95]  Robert Forchheimer,et al.  Image coding-from waveforms in animation , 1989, IEEE Trans. Acoust. Speech Signal Process..

[96]  Reinhard Koch,et al.  Automatic Reconstruction of Buildings from Stereoscopic Image Sequences , 1993, Comput. Graph. Forum.

[97]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[98]  Reginald L. Lagendijk,et al.  Motion compensated frame rate conversion of motion pictures , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[99]  Lyman P. Hurd,et al.  Fractal image compression , 1993 .

[100]  Joern Ostermann,et al.  Modeling of 3-D moving objects for an analysis-synthesis coder , 1990, Other Conferences.

[101]  B.P. Yuhas,et al.  Integration of acoustic and visual speech signals using neural networks , 1989, IEEE Communications Magazine.

[102]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[103]  R. L. Baker,et al.  Global zoom/pan estimation and compensation for video compression , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[104]  Wolfgang Guse,et al.  Coding of moving video at 1 mbit/s: movies on CD , 1990, Other Conferences.

[105]  J. D. Robbins,et al.  Motion-compensated television coding: Part I , 1979, The Bell System Technical Journal.

[106]  Arnaud E. Jacquin,et al.  Image coding based on a fractal theory of iterated contractive image transformations , 1992, IEEE Trans. Image Process..

[107]  D. Boekee,et al.  A pel-recursive Wiener-based displacement estimation algorithm , 1987 .

[108]  M. Bierling,et al.  Displacement Estimation By Hierarchical Blockmatching , 1988, Other Conferences.

[109]  Michael Hoetter Differential estimation of the global motion parameters zoom and pan , 1989 .

[110]  Josef Kittler,et al.  A differential method for simultaneous estimation of rotation, change of scale and translation , 1990, Signal Process. Image Commun..

[111]  Kiyoharu Aizawa,et al.  Model-based analysis synthesis image coding (MBASIC) system for a person's face , 1989, Signal Process. Image Commun..

[112]  Vicki Bruce,et al.  COMPUTER RECOGNITION OF FACES , 1989 .

[113]  Staffan Efucsson Fixed and Adaptive Predictors for Hybrid Predictive/Transform Coding , 1985 .

[114]  Thomas Engelhardt,et al.  Coding of arbitrarily shaped image segments based on a generalized orthogonal transform , 1989, Signal Process. Image Commun..

[115]  G. Knowles,et al.  Video compression using 3D wavelet transforms , 1990 .

[116]  P. Pirsch,et al.  Advances in picture coding , 1985, Proceedings of the IEEE.

[117]  Kiyoharu Aizawa,et al.  An intelligent facial image coding driven by speech and phoneme , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[118]  Bill Welsh,et al.  Model-based image coding , 1990 .