Analysis of Node Failures in High Performance Computers Based on System Logs
暂无分享,去创建一个
[1] Franck Cappello,et al. Toward Exascale Resilience: 2014 update , 2014, Supercomput. Front. Innov..
[2] Franck Cappello,et al. Addressing failures in exascale computing , 2014, Int. J. High Perform. Comput. Appl..
[3] Franck Cappello,et al. Failure prediction for HPC systems and applications , 2013, Int. J. High Perform. Comput. Appl..
[4] Mark S. Squillante,et al. Failure data analysis of a large-scale heterogeneous server environment , 2004, International Conference on Dependable Systems and Networks, 2004.
[5] Alexandru Iosup,et al. A Model for Space-Correlated Failures in Large-Scale Distributed Systems , 2010, Euro-Par.
[6] Alexandru Iosup,et al. Analysis and modeling of time-correlated failures in large-scale distributed systems , 2010, 2010 11th IEEE/ACM International Conference on Grid Computing.