Optimizing Computing and Energy Performances in Heterogeneous Clusters of CPUs and GPUs

1.

[1]  Majid Sarrafzadeh,et al.  Energy-aware high performance computing with graphic processing units , 2008, CLUSTER 2008.

[2]  Thomas Jost,et al.  An efficient multi-algorithms sparse linear solver for GPUs , 2009, PARCO.

[3]  N.K. Govindaraju,et al.  A Memory Model for Scientific Algorithms on Graphics Processors , 2006, ACM/IEEE SC 2006 Conference (SC'06).

[4]  Michael Allen Heroux,et al.  A proposal for a sparse blas toolkit , 1992 .

[5]  D. Szyld,et al.  On asynchronous iterations , 2000 .

[6]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[7]  Thomas Jost,et al.  Optimizing computing and energy performances on GPU clusters: experimentation on a PDE solver , 2010 .

[8]  Stéphane Vialle,et al.  High dimensional pricing of exotic European contracts on a GPU Cluster, and comparison to a CPU cluster , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[9]  Raphaël Couturier,et al.  Synchronous and asynchronous solution of a 3D transport model in a grid computing environment , 2006 .

[10]  Jacques M. Bahi,et al.  Asynchronous Iterative Algorithms for Nonexpansive Linear Systems , 2000, J. Parallel Distributed Comput..

[11]  Xiaohan Ma,et al.  Statistical Power Consumption Analysis and Modeling for GPU-based Computing , 2011 .

[12]  Raphaël Couturier,et al.  Parallel Iterative Algorithms: From Sequential to Grid Computing (Chapman & Hall/crc Numerical Analy & Scient Comp. Series) , 2007 .

[13]  Thomas Jost,et al.  Impact of Asynchronism on GPU Accelerated Parallel Iterative Computations , 2010, PARA.

[14]  Mark Horowitz,et al.  Energy dissipation in general purpose microprocessors , 1996, IEEE J. Solid State Circuits.

[15]  Jacques M. Bahi,et al.  Evaluation of the asynchronous iterative algorithms in the context of distant heterogeneous clusters , 2005, Parallel Comput..

[16]  Hiroaki Kobayashi,et al.  SPRAT: Runtime processor selection for energy-aware computing , 2008, 2008 IEEE International Conference on Cluster Computing.

[17]  Raphaël Couturier,et al.  Asynchronism for iterative algorithms in a global computing environment , 2002, Proceedings 16th Annual International Symposium on High Performance Computing Systems and Applications.

[18]  Jack J. Dongarra,et al.  A proposal for a set of level 3 basic linear algebra subprograms , 1987, SGNM.

[19]  Sally A. McKee,et al.  Hitting the memory wall: implications of the obvious , 1995, CARN.

[20]  Jacques M. Bahi,et al.  An Efficient and Robust Decentralized Algorithm for Detecting the Global Convergence in Asynchronous Iterative Algorithms , 2008, VECPAR.

[21]  Daniel B. Szyld,et al.  Block and asynchronous two-stage methods for mildly nonlinear systems , 1999, Numerische Mathematik.