论文信息 - Latency Considerations of Depth-first GPU Ray Tracing

Latency Considerations of Depth-first GPU Ray Tracing

Despite the potential divergence of depth-first ray tracing [AL09], it is nevertheless the most efficient approach on massively parallel graphics processors. Due to the use of specialized caching strategies that were originally developed for texture access, it has been shown to be compute rather than bandwidth limited. Especially with recents developments however, not only the raw bandwidth, but also the latency for both memory access and read after write register dependencies can become a limiting factor. In this paper we will analyze the memory and instruction dependency latencies of depth first ray tracing. We will show that ray tracing is in fact latency limited on current GPUs and propose three simple strategies to better hide the latencies. This way, we come significantly closer to the maximum performance of the GPU.

Michael Guthe

[1] Kun Zhou,et al. Real-time KD-tree construction on graphics hardware , 2008, SIGGRAPH 2008.

[2] Andreas Dietrich,et al. Spatial splits in bounding volume hierarchies , 2009, High Performance Graphics.

[3] Alexander Keller,et al. Instant ray tracing: the bounding interval hierarchy , 2006, EGSR '06.

[4] Timo Aila,et al. Understanding the efficiency of ray traversal on GPUs , 2009, High Performance Graphics.

[5] Tero Karras,et al. Architecture considerations for tracing incoherent rays , 2010, HPG '10.

[6] S. Boulos,et al. Getting rid of packets - Efficient SIMD single-ray traversal using multi-branching BVHs - , 2008, 2008 IEEE Symposium on Interactive Ray Tracing.

[7] Timo Aila,et al. Megakernels considered harmful: wavefront path tracing on GPUs , 2013, HPG '13.

[8] Dinesh Manocha,et al. Fast BVH Construction on GPUs , 2009, Comput. Graph. Forum.

[9] Dietger van Antwerpen,et al. Improving SIMD efficiency for parallel Monte Carlo light transport on the GPU , 2011, HPG '11.

[10] Kenneth E. Batcher,et al. Sorting networks and their applications , 1968, AFIPS Spring Joint Computing Conference.