The quest for petascale computing

Although the challenges to achieving petascale computing within the next decade are daunting, several software and hardware technologies are emerging that could help us reach this goal. The authors review these technologies and consider new algorithms capable of exploiting a petascale computer's architecture. One petaflop per second is a rate of computation corresponding to 10/sup 15/ floating-point operations per second. To be of use in scientific computing, a computer capable of this prodigious speed needs a main memory of tens or hundreds of terabytes and enormous amounts of mass storage. Sophisticated compilers and high memory and I/O bandwidth are also essential to exploit the architecture efficiently. To mask the hardware and software complexities from the scientific end user, it would be advantageous to access and use a petascale computer through an advanced problem-solving environment. Immersive visualization environments could play an important role in analyzing and navigating the output from petascale computations. Thus, petascale computing is capable of driving the next decade of research in high-performance computing and communications and will require advances across all aspects of it.

[1]  J. Dongarra Performance of various computers using standard linear equations software , 1990, CARN.

[2]  L M Adleman,et al.  Molecular computation of solutions to combinatorial problems. , 1994, Science.

[3]  Victor Eijkhout,et al.  Algorithmic bombardment for the iterative solution of linear systems: a poly-iterative approach , 1994 .

[4]  David H. Bailey Onward to petaflops computing , 1997, CACM.

[5]  Andrew S. Grimshaw,et al.  The Legion vision of a worldwide virtual computer , 1997, Commun. ACM.

[6]  Toshikazu Ebisuzaki,et al.  GRAPE-4: A Massively Parallel Special-Purpose Computer for Collisional N-Body Simulations , 1997 .

[7]  Jack Dongarra,et al.  Top500 Supercomputer Sites - 13th edition , 1998 .

[8]  The marketplace of high-performance computing , 1999, Parallel Comput..

[9]  Lov K. Grover,et al.  Quantum computation , 1999, Proceedings Twelfth International Conference on VLSI Design. (Cat. No.PR00013).

[10]  Tommaso Toffoli,et al.  Programmable matter methods , 1999, Future Gener. Comput. Syst..

[11]  Ian T. Foster,et al.  The Globus project: a status report , 1999, Future Gener. Comput. Syst..

[12]  John Shalf,et al.  Numerical Relativity in a Distributed Environment , 1999, PPSC.

[13]  Yuefan Deng,et al.  New trends in high performance computing , 2001, Parallel Computing.

[14]  Jack J. Dongarra,et al.  Automated empirical optimizations of software and the ATLAS project , 2001, Parallel Comput..

[15]  R. Schaller,et al.  Technological innovation in the semiconductor industry: A case study of the International Technology Roadmap for Semiconductors (ITRS) , 2001, PICMET '01. Portland International Conference on Management of Engineering and Technology. Proceedings Vol.1: Book of Summaries (IEEE Cat. No.01CH37199).

[16]  David P. Anderson,et al.  SETI@home-massively distributed computing for SETI , 2001, Comput. Sci. Eng..