Knowledge based image sequence compression

In this thesis, most commonly encountered video compression techniques and international coding standards are studied. The study leads to the idea of a reconfigurable codec which can adapt itself to the specific requirements of diverse applications so as to achieve improved performance. Firstly, we propose a multiple layer affine motion compensated codec which acts as a basic building block of the reconfigurable multiple tool video codec. A detailed investigation of the properties of the proposed codec is carried out. The experimental results reveal that the gain in coding efficiency from improved motion prediction and segmentation is proportional to the spatial complexity of the sequence being encoded. Secondly, a framework for the reconfigurable multiple tool video codec is developed and its key parts are discussed in detail. Two important concepts virtual codec and virtual tool are introduced. A prototype of the proposed reconfigurable multiple tool video codec is implemented. The codec structure and the constituent tools of the codec included in the prototype are extensively tested and evaluated to prove the concept. The results confirm that different applications require different codec configurations to achieve optimum performance. Thirdly, a knowledge based tool selection system for the reconfigurable codec is proposed and developed. Human knowledge as well as sequence properties are taken into account in the tool selection procedure. It is shown that the proposed tool selection mechanism gives promising results. Finally, concluding remarks are offered and future research directions are suggested.

[1]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[2]  M. Goldberg,et al.  Adptive vector quantization for image sequence coding , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  T Koga,et al.  MOTION COMPENSATED INTER-FRAME CODING FOR VIDEO CONFERENCING , 1981 .

[4]  Yair Shoham,et al.  Efficient bit allocation for an arbitrary set of quantizers [speech coding] , 1988, IEEE Trans. Acoust. Speech Signal Process..

[5]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Kiyoharu Aizawa,et al.  Adaptive discrete cosine transform image coding using gain/Shape vector quantizers , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Jorma Rissanen,et al.  Generalized Kraft Inequality and Arithmetic Coding , 1976, IBM J. Res. Dev..

[8]  P. Pirsch,et al.  Advances in picture coding , 1985, Proceedings of the IEEE.

[9]  Martin Vetterli,et al.  Fast 2-D discrete cosine transform , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  C. Sparrow The Fractal Geometry of Nature , 1984 .

[11]  Robert M. Gray,et al.  A low complexity multiresolution approach to image compression using pruned nested tree-structured vector quantization , 1994, Proceedings of 1st International Conference on Image Processing.

[12]  Josef Kittler,et al.  Robust motion estimation and multistage vector quantisation for sequence compression , 1994, Proceedings of 1st International Conference on Image Processing.

[13]  Josef Kittler,et al.  Variable block size video coding with motion prediction and motion segmentation , 1995, Electronic Imaging.

[14]  D. M. Monro,et al.  Fractal approximation of image blocks , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  Josef Kittler,et al.  Video coding using affine motion compensated prediction , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[16]  Touradj Ebrahimi,et al.  Dynamic coding for visual communications , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[17]  Marcel Breeuwer,et al.  An introduction to source coding , 1993 .

[18]  Josef Kittler,et al.  A low bit rate hierarchical video codec , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[19]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[20]  R. Srinivasan,et al.  Predictive Coding Based on Efficient Motion Estimation , 1985, IEEE Trans. Commun..

[21]  William G. Wee,et al.  Fractal-based image compression: a fast algorithm using wavelet transform , 1994, Other Conferences.

[22]  John W. Woods,et al.  Subband coding of images , 1986, IEEE Trans. Acoust. Speech Signal Process..

[23]  Josef Kittler,et al.  A hybrid codec for very low bit rate video coding , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[24]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[25]  Tor A. Ramstad,et al.  An inner product space approach to image coding by contractive transformations , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[26]  O. Rioul,et al.  Wavelets and signal processing , 1991, IEEE Signal Processing Magazine.

[27]  Anthony G. Constantinides,et al.  Low bit rate video coding using variable block size model , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[28]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[29]  Frank Rubin,et al.  Arithmetic stream coding using fixed precision registers , 1979, IEEE Trans. Inf. Theory.

[30]  Martin Vetterli,et al.  A theory of multirate filter banks , 1987, IEEE Trans. Acoust. Speech Signal Process..

[31]  Donald M. Monro,et al.  Generalized fractal transforms: complexity issues , 1993, [Proceedings] DCC `93: Data Compression Conference.

[32]  Bhaskar Ramamurthi,et al.  Image vector quantization with a perceptually-based cell classifier , 1984, ICASSP.

[33]  P. P. Vaidyanathan,et al.  Multirate digital filters, filter banks, polyphase networks, and applications: a tutorial , 1990, Proc. IEEE.

[34]  Glen G. Langdon,et al.  An Overview of the Basic Principles of the Q-Coder Adaptive Binary Arithmetic Coder , 1988, IBM J. Res. Dev..

[35]  David Malah,et al.  Change detection and texture analysis for image sequence coding , 1994, Signal Process. Image Commun..

[36]  Jan Biemond,et al.  Sub-band coding of images using predictive vector quantization , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[37]  G.S. Moschytz,et al.  Practical fast 1-D DCT algorithms with 11 multiplications , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[38]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[39]  Y. Ninomiya,et al.  A Motion-Compensated Interframe Coding Scheme for Television Pictures , 1982, IEEE Trans. Commun..