Tips, tricks and troubles: optimizing for cell and GPU
暂无分享,去创建一个
Carsten Griwodz | Pål Halvorsen | Håkon Kvale Stensland | Håvard Espeland | C. Griwodz | P. Halvorsen | H. Stensland | H. Espeland
[1] Borko Furht,et al. Exploring NVIDIA-CUDA for video coding , 2010, MMSys '10.
[2] Michael Kistler,et al. Exploring the Viability of the Cell Broadband Engine for Bioinformatics Applications , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.
[3] Fabrizio Petrini,et al. Multicore Surprises: Lessons Learned from Optimizing Sweep3D on the Cell Broadband Engine , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.
[4] Alexander Ottesen,et al. Efficient parallelisation techniques for applications running on GPUs using the CUDA framework , 2009 .
[5] Michael Boyer. Automated Dynamic Analysis of CUDA Programs , 2008 .
[6] Nagarajan Ranganathan,et al. JAGUAR: a fully pipelined VLSI architecture for JPEG image compression standard , 1995, Proc. IEEE.
[7] Y. Arai,et al. A Fast DCT-SQ Scheme for Images , 1988 .
[8] Anthony Skjellum,et al. Accelerating Reed-Solomon coding in RAID systems with GPUs , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.
[9] Rob van Nieuwpoort,et al. Evaluating multi-core platforms for HPC data-intensive kernels , 2009, CF '09.