Reliability of GPU-based heterogeneous systems

[1]  Yufei Lin,et al.  HiAL-Ckpt: A hierarchical application-level checkpointing for CPU-GPU hybrid systems , 2010, 2010 5th International Conference on Computer Science & Education.

[2]  Behnam Pourghassemi cudaCR: An In-kernel Application-level Checkpoint/Restart Scheme for CUDA Applications , 2017 .

[3]  Jingling Xue,et al.  PartialRC: A Partial Recomputing Method for Efficient Fault Recovery on GPGPUs , 2012, Journal of Computer Science and Technology.

[4]  Hiroaki Kobayashi,et al.  CheCUDA: A Checkpoint/Restart Tool for CUDA Applications , 2009, 2009 International Conference on Parallel and Distributed Computing, Applications and Technologies.