GPU Accelerated H.264 Video Compression for Broadcast
暂无分享,去创建一个
[1] James Demmel,et al. Benchmarking GPUs to tune dense linear algebra , 2008, HiPC 2008.
[2] David H. Bailey. A High-Performance FFT Algorithm for Vector Supercomputers , 1987, PPSC.
[3] Franz Franchetti,et al. Discrete fourier transform on multicore , 2009, IEEE Signal Processing Magazine.
[4] R. W. Johnson,et al. A methodology for designing, modifying, and implementing Fourier transform algorithms on various architectures , 1990 .
[5] Zhiyi Yang,et al. Parallel Image Processing Based on CUDA , 2008, 2008 International Conference on Computer Science and Software Engineering.
[6] Anjul Patney,et al. Efficient computation of sum-products on GPUs through software-managed cache , 2008, ICS '08.
[7] Naga K. Govindaraju,et al. High performance discrete Fourier transforms on graphics processors , 2008, HiPC 2008.
[8] Samuel Williams,et al. Roofline: an insightful visual performance model for multicore architectures , 2009, CACM.
[9] Steven G. Johnson,et al. The Design and Implementation of FFTW3 , 2005, Proceedings of the IEEE.
[10] James Demmel,et al. LU, QR and Cholesky Factorizations using Vector Capabilities of GPUs , 2008 .
[11] Pradeep Dubey,et al. Fast sort on CPUs and GPUs: a case for bandwidth oblivious SIMD sort , 2010, SIGMOD Conference.