RayCore

We present RayCore, a mobile ray-tracing hardware architecture. RayCore facilitates high-quality rendering effects, such as reflection, refraction, and shadows, on mobile devices by performing real-time Whitted ray tracing. RayCore consists of two major components: ray-tracing units (RTUs) based on a unified traversal and intersection pipeline and a tree-building unit (TBU) for dynamic scenes. The overall RayCore architecture offers considerable benefits in terms of die area, memory access, and power consumption. We have evaluated our architecture based on FPGA and ASIC evaluations and demonstrate its performance on different benchmarks. According to the results, our architecture demonstrates high performance per unit area and unit energy, making it highly suitable for use in mobile devices.

[1]  David K. McAllister,et al.  OptiX: a general purpose ray tracing engine , 2010, ACM Trans. Graph..

[2]  Dinesh Manocha,et al.  Effective traversal algorithms and hardware architecture for pyramidal inverse displacement mapping , 2014, Comput. Graph..

[3]  Ulf Assarsson,et al.  BART: A Benchmark for Animated Ray Tracing , 2000 .

[4]  Daniel Kopta,et al.  An energy and bandwidth efficient ray tracing architecture , 2013, HPG '13.

[5]  Kellogg S. Booth,et al.  Report from the chair , 1986 .

[6]  John Hart,et al.  ACM Transactions on Graphics , 2004, SIGGRAPH 2004.

[7]  Charles D. Hansen,et al.  RTSAH Traversal Order for Occlusion Rays , 2011, Comput. Graph. Forum.

[8]  Anoop Gupta,et al.  The Design and Analysis of a Cache Architecture for Texture Mapping , 1997, ISCA.

[9]  Jeong-Soo Park,et al.  The design of a texture mapping unit with effective MIP-map level selection for real-time ray tracing , 2011, IEICE Electron. Express.

[10]  Philipp Slusallek,et al.  B-KD trees for hardware accelerated ray tracing of dynamic scenes , 2006, GH '06.

[11]  Ingo Wald,et al.  Ray tracing deformable scenes using dynamic bounding volume hierarchies , 2007, TOGS.

[12]  Philipp Slusallek,et al.  Distributed interactive ray tracing of dynamic scenes , 2003, IEEE Symposium on Parallel and Large-Data Visualization and Graphics, 2003. PVG 2003..

[13]  E. Brunvand,et al.  Estimating Performance of a Ray-Tracing ASIC Design , 2006, 2006 IEEE Symposium on Interactive Ray Tracing.

[14]  Phillipp Kaestner Ray Tracing From The Ground Up , 2016 .

[15]  Philipp Slusallek,et al.  Realtime ray tracing of dynamic scenes on an FPGA chip , 2004, Graphics Hardware.

[16]  Tack-Don Han,et al.  Efficient ray sorting for the tracing of incoherent rays , 2012, IEICE Electron. Express.

[17]  Ingo Wald,et al.  Fast, parallel, and asynchronous construction of BVHs for ray tracing animated scenes , 2008, Comput. Graph..

[18]  Daniel Kopta,et al.  Efficient MIMD architectures for high-performance ray tracing , 2010, 2010 IEEE International Conference on Computer Design.

[19]  Kun Zhou,et al.  Real-time KD-tree construction on graphics hardware , 2008, SIGGRAPH 2008.

[20]  Tero Karras,et al.  Architecture considerations for tracing incoherent rays , 2010, HPG '10.

[21]  Tack-Don Han,et al.  T&I engine: traversal and intersection engine for hardware accelerated ray tracing , 2011, SA '11.

[22]  Kun Zhou,et al.  Real-time KD-tree construction on graphics hardware , 2008, SIGGRAPH Asia '08.

[23]  Hye-Sun Kim,et al.  Cache-oblivious ray reordering , 2010, TOGS.

[24]  Alexander Reshetov,et al.  Multi-level ray tracing algorithm , 2005, ACM Trans. Graph..

[25]  Guy E. Blelloch,et al.  Efficient BVH construction via approximate agglomerative clustering , 2013, HPG '13.

[26]  Young-Jun Kim,et al.  A Reconfigurable SIMT Processor for Mobile Ray Tracing With Contention Reduction in Shared Memory , 2013, IEEE Transactions on Circuits and Systems I: Regular Papers.

[27]  Sanjay J. Patel,et al.  Tradeoffs in designing accelerator architectures for visual computing , 2008, 2008 41st IEEE/ACM International Symposium on Microarchitecture.

[28]  Sung-Bong Yang,et al.  gkDtree: A group-based parallel update kd-tree for interactive ray tracing , 2013, J. Syst. Archit..

[29]  Tero Karras,et al.  Maximizing parallelism in the construction of BVHs, octrees, and k-d trees , 2012, EGGH-HPG'12.

[30]  Ingo Wald,et al.  Realtime ray tracing and interactive global illumination , 2004, Ausgezeichnete Informatikdissertationen.

[31]  P. Shirley,et al.  Interactive Distribution Ray Tracing , 2006 .

[32]  Dinesh Manocha,et al.  Memory-Scalable GPU Spatial Hierarchy Construction , 2011, IEEE Transactions on Visualization and Computer Graphics.

[33]  Karthik Ramani,et al.  StreamRay: a stream filtering architecture for coherent ray tracing , 2009, ASPLOS.

[34]  Maxim Shevtsov,et al.  Highly Parallel Fast KD‐tree Construction for Interactive Ray Tracing of Dynamic Scenes , 2007, Comput. Graph. Forum.

[35]  Mary K. Vernon,et al.  Toward a multicore architecture for real-time ray-tracing , 2008, 2008 41st IEEE/ACM International Symposium on Microarchitecture.

[36]  H.-P. Seidel,et al.  On the Fast Construction of Spatial Hierarchies for Ray Tracing , 2006, 2006 IEEE Symposium on Interactive Ray Tracing.

[37]  I. Wald,et al.  On building fast kd-Trees for Ray Tracing, and on doing that in O(N log N) , 2006, 2006 IEEE Symposium on Interactive Ray Tracing.

[38]  Christoforos E. Kozyrakis,et al.  Understanding sources of inefficiency in general-purpose chips , 2010, ISCA.

[39]  Dinesh Manocha,et al.  SATO: Surface Area Traversal Order for Shadow Ray Tracing , 2014, Comput. Graph. Forum.

[40]  Gordon Stoll,et al.  Razor: An architecture for dynamic multiresolution ray tracing , 2011, TOGS.

[41]  Daniel Kopta,et al.  A Mobile Accelerator Architecture for Ray Tracing , 2012 .

[42]  W.R. Mark,et al.  Fast kd-tree Construction with an Adaptive Error-Bounded Heuristic , 2006, 2006 IEEE Symposium on Interactive Ray Tracing.

[43]  Michael Manzke,et al.  A hardware unit for fast SAH-optimised BVH construction , 2013, ACM Trans. Graph..

[44]  Dinesh Manocha,et al.  HART: A Hybrid Architecture for Ray Tracing Animated Scenes , 2015, IEEE Transactions on Visualization and Computer Graphics.

[45]  Ingo Wald,et al.  Combining Single and Packet-Ray Tracing for Arbitrary Ray Distributions on the Intel MIC Architecture , 2012, IEEE Transactions on Visualization and Computer Graphics.

[46]  Robert L. Cook,et al.  Distributed ray tracing , 1984, SIGGRAPH.

[47]  Ingo Wald,et al.  State of the Art in Ray Tracing Animated Scenes , 2009, Comput. Graph. Forum.

[48]  Leonidas J. Guibas,et al.  Bidirectional Estimators for Light Transport , 1995 .

[49]  Adam Herout,et al.  Yet Faster Ray-Triangle Intersection (Using SSE4) , 2010, IEEE Transactions on Visualization and Computer Graphics.

[50]  Kunle Olukotun,et al.  The Future of Microprocessors , 2005, ACM Queue.

[51]  Youngsam Shin,et al.  SGRT: a scalable mobile GPU architecture based on ray tracing , 2012, SIGGRAPH Talks.

[52]  Tack-Don Han,et al.  MobiRT: an implementation of OpenGL ES-based CPU-GPU hybrid ray tracer for mobile devices , 2010, SIGGRAPH ASIA.

[53]  Jacco Bikker,et al.  Real-time Ray Tracing through the Eyes of a Game Developer , 2007, 2007 IEEE Symposium on Interactive Ray Tracing.

[54]  Timo Aila,et al.  Ray tracing is the future and ever will be... , 2013, SIGGRAPH '13.

[55]  Timo Aila,et al.  Understanding the efficiency of ray traversal on GPUs , 2009, High Performance Graphics.

[56]  P. Slusallek,et al.  RPU: a programmable ray processing unit for realtime ray tracing , 2005, SIGGRAPH '05.

[57]  Daniel Kopta,et al.  TRaX: A Multicore Hardware Architecture for Real-Time Ray Tracing , 2009, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[58]  Turner Whitted,et al.  An improved illumination model for shaded display , 1979, CACM.

[59]  Tomas Akenine-Möller,et al.  A Benchmark for Animated Ray Tracing , 2001, IEEE Computer Graphics and Applications.

[60]  Greg Humphreys,et al.  Physically Based Rendering, Second Edition: From Theory To Implementation , 2010 .

[61]  Christiaan P. Gribble,et al.  GPU ray tracing with rayforce , 2013, SIGGRAPH '13.

[62]  Timo Aila,et al.  Fast parallel construction of high-quality bounding volume hierarchies , 2013, HPG '13.

[63]  Günter Enderle,et al.  The Evolution of Computer Graphics , 1984 .

[64]  Dinesh Manocha,et al.  Fast BVH Construction on GPUs , 2009, Comput. Graph. Forum.

[65]  Tack-Don Han,et al.  SGRT: a mobile GPU architecture for real-time ray tracing , 2013, HPG '13.

[66]  Erik Brunvand,et al.  Fast ray tracing and the potential effects on graphics and gaming courses , 2008, Comput. Graph..

[67]  Cheong-Ghil Kim,et al.  An FPGA implementation of whitted-style ray tracing accelerator , 2008, 2008 IEEE Symposium on Interactive Ray Tracing.

[68]  Alexander Keller,et al.  Instant ray tracing: the bounding interval hierarchy , 2006, EGSR '06.

[69]  John C. Hart,et al.  Parallel SAH k-D tree construction , 2010, HPG '10.

[70]  Markus Wagner,et al.  Interactive Rendering with Coherent Ray Tracing , 2001, Comput. Graph. Forum.

[71]  Xinguo Liu,et al.  SAH KD-tree construction on GPU , 2011, HPG '11.

[72]  Hans-Peter Seidel,et al.  Maximum mipmaps for fast, accurate, and scalable dynamic height field rendering , 2008, I3D '08.

[73]  Young-Jun Kim,et al.  MRTP: Mobile Ray Tracing Processor With Reconfigurable Stream Multi-Processors for High Datapath Utilization , 2012, IEEE Journal of Solid-State Circuits.