Efficient Parallel Scan Algorithms for GPUs
暂无分享,去创建一个
[1] Guy E. Blelloch,et al. Vector Models for Data-Parallel Computing , 1990 .
[2] Erik Lindholm,et al. NVIDIA Tesla: A Unified Graphics and Computing Architecture , 2008, IEEE Micro.
[3] Ahmed Sameh,et al. The Illiac IV system , 1972 .
[4] Kenneth E. Iverson,et al. A programming language , 1899, AIEE-IRE '62 (Spring).
[5] Naga K. Govindaraju,et al. Fast scan algorithms on graphics processors , 2008, ICS '08.
[6] Guy E. Blelloch,et al. Implementation of a portable nested data-parallel language , 1993, PPOPP '93.
[7] Anselmo Lastra,et al. Fast Summed‐Area Table Generation and its Applications , 2005, Comput. Graph. Forum.
[8] Yao Zhang,et al. Scan primitives for GPU computing , 2007, GH '07.
[9] Sanjeev Saxena,et al. On Parallel Prefix Computation , 1994, Parallel Process. Lett..
[10] John D. Owens,et al. A Work-Efficient Step-Efficient Prefix Sum Algorithm , 2006 .
[11] Guy E. Blelloch,et al. Scan primitives for vector computers , 1990, Proceedings SUPERCOMPUTING '90.
[12] Mark J. Harris,et al. Parallel Prefix Sum (Scan) with CUDA , 2011 .
[13] Guy E. Blelloch,et al. Scans as Primitive Parallel Operations , 1989, ICPP.
[14] W. Daniel Hillis,et al. Data parallel algorithms , 1986, CACM.
[15] Kevin Skadron,et al. Scalable parallel programming , 2008, 2008 IEEE Hot Chips 20 Symposium (HCS).