Multi-GPU Compositeless Parallel Rendering Algorithm

Due to its high performance/cost ratio, a single PC equipped with multi-GPU is an attractive platform for large scale scene rendering and visualization. In this paper, we present a compositeless parallel rendering algorithm on shared memory multi-GPU system. Our algorithm is based on hybrid sort-first and sort-last render mode. By utilizing the DMA asynchronous transfer in modern video cards, we implement asynchronous image read back and implicit image compositing. Using the compositeless algorithm, we totally remove image compositing stage in parallel rendering contrasting with the traditional parallel rendering methods. The theoretical analysis and experiments demonstrate that our algorithm is practical and scalable for large scale scene rendering and high-resolution display.