Analysis of coding tools and improvement of text readability for screen content

Current video coding standards perform well for video sequences captured by a real camera. The aperture of the camera's optical system smooths the content and attenuates higher frequencies. New application scenarios, enabled by the growing number of high bit rate internet gateways, however, make it necessary to take a closer look at the efficiency of such standards in handling artificial content. Remote desktop applications for example often include text parts. As a consequence, these content types contain sharp edges or high frequencies, which are considered less important in natural video and are therefore treated less carefully. The frequent result is an increased occurrence of artefacts or the loss of information that is actually important to the user. This paper gives an analysis of such artificially created video sequences, evaluates the performance of current coding tools for this type of content and proposes a simple, yet effective way to maintain readability of text within video material using only well considered encoder control and without the need of large additional modules.

[1]  Pengwei Hao,et al.  Compound image compression for real-time computer screen image transmission , 2005, IEEE Transactions on Image Processing.

[2]  Tristan Richardson RealVNC The RFB Protocol , 1998 .

[3]  Stefanos D. Kollias,et al.  Low bit-rate coding of image sequences using adaptive regions of interest , 1998, IEEE Trans. Circuits Syst. Video Technol..

[4]  Feng Wu,et al.  Enable Efficient Compound Image Compression in H.264/AVC Intra Coding , 2007, 2007 IEEE International Conference on Image Processing.

[5]  Jörn Ostermann,et al.  Low bit rate ROI based video coding for HDTV aerial surveillance video sequences , 2011, CVPR 2011 WORKSHOPS.

[6]  Jian Wang,et al.  Skin-tone Macroblock detection for video coding , 2011, 2011 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB).

[7]  Guangming Shi,et al.  Compress compound images in H.264/MPEG-4 AVC by fully exploiting spatial correlation , 2009, 2009 IEEE International Symposium on Circuits and Systems.