Field, experimental, and analytical data on large-scale HPC systems and evaluation of the implications for exascale system design
暂无分享,去创建一个
David R. Kaeli | Nathan DeBardeleben | Paolo Rech | Sean Blanchard | D. Kaeli | Nathan Debardeleben | S. Blanchard | P. Rech
[1] Franck Cappello,et al. Addressing failures in exascale computing , 2014, Int. J. High Perform. Comput. Appl..
[2] Laura Monroe,et al. GPU Behavior on a Large HPC Cluster , 2013, Euro-Par Workshops.
[3] David R. Kaeli,et al. Calculating Architectural Vulnerability Factors for Spatial Multi-Bit Transient Faults , 2014, 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture.
[4] Luigi Carro,et al. Impact of GPUs Parallelism Management on Safety-Critical and HPC Applications Reliability , 2014, 2014 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks.
[5] David R. Kaeli,et al. Using hardware vulnerability factors to enhance AVF analysis , 2010, ISCA.
[6] Zhongliang Chen,et al. Exploring the heterogeneous design space for both performance and reliability , 2014, 2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC).
[7] David R. Kaeli,et al. Examining the Impact of ACE interference on Multi-Bit AVF Estimates , 2015 .
[8] Joel Emer,et al. A systematic methodology to compute the architectural vulnerability factors for a high-performance microprocessor , 2003, Proceedings. 36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003. MICRO-36..
[9] David R. Kaeli,et al. Multi2Sim: A simulation framework for CPU-GPU computing , 2012, 2012 21st International Conference on Parallel Architectures and Compilation Techniques (PACT).
[10] Luigi Carro,et al. Understanding GPU errors on large-scale HPC systems and the implications for system design and operation , 2015, 2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA).