NPU-Based Image Compositing in a Distributed Visualization System

This paper describes the first use of a network processing unit (NPU) to perform hardware-based image composition in a distributed rendering system. The image composition step is a notorious bottleneck in a clustered rendering system. Furthermore, image compositing algorithms do not necessarily scale as data size and number of nodes increase. Previous researchers have addressed the composition problem via software and/or custom-built hardware. We used the heterogeneous multicore computation architecture of the Intel IXP28XX NPU, a fully programmable commercial off-the-shelf (COTS) technology, to perform the image composition step. With this design, we have attained a nearly four-times performance increase over traditional software-based compositing methods, achieving sustained compositing rates of 22-28 fps on a 1.021times1.024 image. This system is fully scalable with a negligible penalty in frame rate, is entirely COTS, and is flexible with regard to operating system, rendering software, graphics cards, and node architecture. The NPU-based compositor has the additional advantage of being a modular compositing component that is eminently suitable for integration into existing distributed software visualization packages.

[1]  Bill Carlson Intel Internet Exchange Architecture and Applications: A Practical Guide to Intel's Network Processors , 2003 .

[2]  M.W. El-Kharashi,et al.  A study on design approaches for network processor units , 2003, 2003 IEEE Pacific Rim Conference on Communications Computers and Signal Processing (PACRIM 2003) (Cat. No.03CH37490).

[3]  Joe Michael Kniss,et al.  Interactive Texture-Based Volume Rendering for Large Data Sets , 2001, IEEE Computer Graphics and Applications.

[4]  Naohisa Sakamoto,et al.  Hybrid hardware-accelerated image composition for sort-last parallel rendering on graphics clusters with commodity image compositor , 2004, 2004 IEEE Symposium on Volume Visualization and Graphics.

[5]  Kwan-Liu Ma,et al.  The design and evaluation of a pipelined image compositing device for massively parallel volume rendering , 2003 .

[6]  Fumihiko Ino,et al.  An improved binary-swap compositing for sort-last parallel rendering on distributed memory multiprocessors , 2003, Parallel Comput..

[7]  Greg Humphreys,et al.  Chromium: a stream-processing framework for interactive rendering on clusters , 2002, SIGGRAPH.

[8]  Gerald T. Seidler,et al.  Simulation of the densification of real open-celled foam microstructures , 2005 .

[9]  Thomas Ertl,et al.  Hierarchical Visualization and Compression of Large Volume Datasets Using GPU Clusters , 2004, EGPGV.

[10]  Christophe Mion,et al.  Pipelined Sort-last Rendering: Scalability, Performance and Beyond , 2006 .

[11]  Kwan-Liu Ma,et al.  Parallel volume rendering using binary-swap compositing , 1994, IEEE Computer Graphics and Applications.

[12]  Douglas Comer,et al.  Network Systems Design Using Network Processors , 2003 .

[13]  Christophe Mion,et al.  COTS cluster-based sort-last rendering: performance evaluation and pipelined implementation , 2005, VIS 05. IEEE Visualization, 2005..

[14]  Laurent Moll,et al.  Sepia: scalable 3D compositing using PCI Pamette , 1999, Seventh Annual IEEE Symposium on Field-Programmable Custom Computing Machines (Cat. No.PR00375).

[15]  Jason Leigh,et al.  A Sort-Last Rendering System over an Optical Backplane , 2005 .

[16]  DuffTom,et al.  Compositing digital images , 1984 .

[17]  Luís Paulo Santos,et al.  Parallel Graphics and Visualization , 2008, Comput. Graph..

[18]  Gordon Stoll,et al.  Lightning-2: a high-performance display subsystem for PC clusters , 2001, SIGGRAPH.

[19]  Kwan-Liu Ma,et al.  The Design and Evaluation of a Pipelined Image Compositing Device for Massively Parallel Volume Rendering , 2003, VG.

[20]  Tom Duff,et al.  Compositing digital images , 1984, SIGGRAPH.

[21]  M. J. Prelle,et al.  MultiCore Framework: An API for Programming Heterogeneous Multicore Processors , 2006 .

[22]  Santiago V. Lombeyda,et al.  Scalable interactive volume rendering using off-the-shelf components , 2001, Proceedings IEEE 2001 Symposium on Parallel and Large-Data Visualization and Graphics (Cat. No.01EX520).