In recent years, real-time video communication over the internet has been widely utilized for applications like video
conferencing. Streaming live video over heterogeneous IP networks, including wireless networks, requires video coding
algorithms that can support various levels of quality in order to adapt to the network end-to-end bandwidth and
transmitter/receiver resources. In this work, a scalable video coding and compression algorithm based on the Contourlet
Transform is proposed. The algorithm allows for multiple levels of detail, without re-encoding the video frames, by just
dropping the encoded information referring to higher resolution than needed. Compression is achieved by means of lossy
and lossless methods, as well as variable bit rate encoding schemes. Furthermore, due to the transformation utilized, it
does not suffer from blocking artifacts that occur with many widely adopted compression algorithms. Another highly
advantageous characteristic of the algorithm is the suppression of noise induced by low-quality sensors usually
encountered in web-cameras, due to the manipulation of the transform coefficients at the compression stage. The
proposed algorithm is designed to introduce minimal coding delay, thus achieving real-time performance. Performance is
enhanced by utilizing the vast computational capabilities of modern GPUs, providing satisfactory encoding and decoding
times at relatively low cost. These characteristics make this method suitable for applications like video-conferencing that
demand real-time performance, along with the highest visual quality possible for each user. Through the presented
performance and quality evaluation of the algorithm, experimental results show that the proposed algorithm achieves
better or comparable visual quality relative to other compression and encoding methods tested, while maintaining a
satisfactory compression ratio. Especially at low bitrates, it provides more human-eye friendly images compared to
algorithms utilizing block-based coding, like the MPEG family, as it introduces fuzziness and blurring instead of
artificial block artifacts.
[1]
Edward H. Adelson,et al.
The Laplacian Pyramid as a Compact Image Code
,
1983,
IEEE Trans. Commun..
[2]
Mark J. T. Smith,et al.
A filter bank for the directional decomposition of images: theory and design
,
1992,
IEEE Trans. Signal Process..
[3]
Dimitris Maroulis,et al.
Transform Feature Extraction Scheme for Ultrasound Thyroid Texture Classification
,
2010
.
[4]
Zhao Yifan,et al.
Contourlet-Based Feature Extraction on Texture Images
,
2008,
2008 International Conference on Computer Science and Software Engineering.
[5]
Jerome M. Shapiro,et al.
Embedded image coding using zerotrees of wavelet coefficients
,
1993,
IEEE Trans. Signal Process..
[6]
Minh N. Do,et al.
Ieee Transactions on Image Processing the Contourlet Transform: an Efficient Directional Multiresolution Image Representation
,
2022
.
[7]
I. Daubechies,et al.
Biorthogonal bases of compactly supported wavelets
,
1992
.
[8]
Ian Lewis,et al.
Proceedings of the SPIE
,
2012
.
[9]
Zhe Liu,et al.
Image denoising using Contourlet and two-dimensional Principle Component Analysis
,
2010,
2010 International Conference on Image Analysis and Signal Processing.
[10]
Zhe Liu.
Minimum Distance Texture Classification of SAR Images in Contourlet Domain
,
2008,
2008 International Conference on Computer Science and Software Engineering.
[11]
Nasser Kehtarnavaz,et al.
Real-Time Image and Video Processing 2012
,
2012
.
[12]
M. Vetterli.
Multi-dimensional sub-band coding: Some theory and algorithms
,
1984
.
[13]
Rico Malvar,et al.
YCoCg-R: A Color Space with RGB Reversibility and Low Dynamic Range
,
2003
.