Squeezing a Cray-class supercomputer into a single-user package

The authors describe how advances in compiler technology and the speed and density of low-cost, high-speed semiconductors were used to construct Titan, a single-user supercomputer with integral high-performance graphics. They discuss the technology constraints, the Titan architecture, and the use of parallelism on many levels.<<ETX>>