Efficient video decoding on GPUs by point based rendering

To accelerate computation intensive video decoding tasks, we present a novel framework to offload most decoding operations to current GPUs. Our method is based on rendering graphics points and suitable for block-based video standards. By representing video blocks as graphics points, we achieve great flexibility and high parallelism to utilize the GPU's pipelined stream processing architecture. The computational resources within texture units and blending units are also exploited to facilitate computations. We propose a high performance implementation of IDCT on GPUs, which efficiently excludes most zero-value coefficients to save the bandwidth and the computations. Compared with the existing quad-based representation, our point based implementation of MC greatly reduces data transfer and redundancy. We have demonstrated the efficiency of our proposed framework by a MPEG-2 decoder. Our results indicate a significant improvement over prior CPU and GPU solutions.

[1]  Rüdiger Westermann,et al.  A particle system for interactive visualization of 3D flows , 2005, IEEE Transactions on Visualization and Computer Graphics.

[2]  Wesley De Neve,et al.  GPU-assisted decoding of video samples represented in the YCoCg-R color space , 2005, MULTIMEDIA '05.

[3]  Jens H. Krüger,et al.  A Survey of General‐Purpose Computation on Graphics Hardware , 2007, Eurographics.

[4]  Pedro V. Sander,et al.  Applications of Explicit Early-Z Culling , 2004 .

[5]  Anil Kokaram,et al.  Fast image interpolation for motion estimation using graphics hardware , 2004, IS&T/SPIE Electronic Imaging.

[6]  William J. Dally,et al.  Media processing applications on the Imagine stream processor , 2002, Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors.

[7]  Rüdiger Westermann,et al.  Linear algebra operators for GPU implementation of numerical algorithms , 2003, SIGGRAPH Courses.

[8]  A. Hirvonen,et al.  H.263 video decoding on programmable graphics hardware , 2005, Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005..

[9]  Rüdiger Westermann,et al.  UberFlow: a GPU-based particle engine , 2004, SIGGRAPH '04.

[10]  Harry Shum,et al.  Accelerate Video Decoding With Generic GPU , 2005, IEEE Trans. Circuits Syst. Video Technol..

[11]  Wayne Luk,et al.  Have GPUs made FPGAs redundant in the field of video processing? , 2005, Proceedings. 2005 IEEE International Conference on Field-Programmable Technology, 2005..

[12]  Leif Kobbelt,et al.  A survey of point-based techniques in computer graphics , 2004, Comput. Graph..

[13]  Huifang Chen,et al.  Techniques for efficient DCT/IDCT implementation on generic GPU , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[14]  Pat Hanrahan,et al.  Understanding the efficiency of GPU algorithms for matrix-matrix multiplication , 2004, Graphics Hardware.

[15]  Jooheung Lee,et al.  High performance array processor for video decoding , 2005, IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design (ISVLSI'05).