Efficient Video Coding Considering a Video as a 3D Data Cube

All existing video coding standards consider a video as a temporal (along T-axis) collection of two dimensional pictures (formed by XY axes) and compress them by exploiting spatial and temporal redundancy in the pictures. A recent optimal compression plane (OCP) determination technique shows that better compression can be achieved by relaxing the physical meaning of axes by exploring information redundancy in a fuller extent where a video is considered as a 3D data cube. Spatial and temporal dimensions are determined based on the statistical redundancy along each axis. Treating a video as a 3D data cube revolutionizes the traditional video features such as background, motion, object, zooming, panning, etc. In this paper we apply dynamic background modeling to the OCP plane to exploit the newly generated background in the video for further improving the coding performance. The experimental results reveal that the proposed approach outperforms the existing state-of-the-art OCP technique as well as the H.264 video coding standard.

[1]  T. Yorozu,et al.  Electron Spectroscopy Studies on Magneto-Optical Media and Plastic Substrate Interface , 1987, IEEE Translation Journal on Magnetics in Japan.

[2]  H. E. Chandler,et al.  Technical writer's handbook , 1982, IEEE Transactions on Professional Communication.

[3]  Wen Gao,et al.  Dual Frame Motion Compensation With Optimal Long-Term Reference Frame Selection and Bit Allocation , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[5]  Dar-Shyang Lee,et al.  Effective Gaussian mixture learning for video background subtraction , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Manoranjan Paul,et al.  Video Coding Focusing on Block Partitioning and Occlusion , 2010, IEEE Transactions on Image Processing.

[7]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[8]  B. Noble,et al.  On certain integrals of Lipschitz-Hankel type involving products of bessel functions , 1955, Philosophical Transactions of the Royal Society of London. Series A, Mathematical and Physical Sciences.

[9]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[10]  Bu-Sung Lee,et al.  Video coding using the most common frame in scene , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  J. Maxwell A Treatise on Electricity and Magnetism , 1873, Nature.

[12]  Pamela C. Cosman,et al.  Selection of Long-Term Reference Frames in Dual-Frame Video Coding Using Simulated Annealing , 2008, IEEE Signal Processing Letters.

[13]  Ying Li,et al.  Lossless Video Sequence Compression Using Adaptive Prediction , 2007, IEEE Transactions on Image Processing.

[14]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .