Migration Techniques in HPC Environments

Process migration is an important feature in modern computing centers as it allows for a more efficient use and maintenance of hardware. Especially in virtualized infrastructures it is successfully exploited by schemes for load balancing and energy efficiency. One can divide the tools and techniques into three groups: Process-level migration, virtual machine migration, and container-based migration.

[1]  Kai Li,et al.  Libckpt: Transparent Checkpointing under UNIX , 1995, USENIX.

[2]  Jason Duell,et al.  Requirements for Linux Checkpoint/Restart , 2002 .

[3]  Jason Duell,et al.  The Lam/Mpi Checkpoint/Restart Framework: System-Initiated Checkpointing , 2005, Int. J. High Perform. Comput. Appl..

[4]  Nathan Regola,et al.  Recommendations for Virtualization Technologies in High Performance Computing , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[5]  Dejan S. Milojicic,et al.  Process migration , 1999, ACM Comput. Surv..

[6]  Parosh Aziz Abdulla Impact of Architecture and Technology for Extreme Scale on Software and Algorithm Design , 2010 .

[7]  Miron Livny,et al.  Checkpoint and Migration of UNIX Processes in the Condor Distributed Processing System , 1997 .

[8]  Geoffrey C. Fox,et al.  Analysis of Virtualization Technologies for High Performance Computing Environments , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[9]  Gil Neiger,et al.  Intel virtualization technology , 2005, Computer.

[10]  Fabienne Anhalt,et al.  Linux-based virtualization for HPC clusters , 2009 .

[11]  Hao Jiang,et al.  A quantitative study of virtual machine live migration , 2013, CAC.

[12]  Wu-chun Feng,et al.  The design, implementation, and evaluation of mpiBLAST , 2003 .

[13]  Axel Keller,et al.  Virtualized HPC: a contradiction in terms? , 2012, Softw. Pract. Exp..

[14]  Christian Engelmann,et al.  Proactive process-level live migration in HPC environments , 2008, HiPC 2008.

[15]  Gil Neiger,et al.  Intel ® Virtualization Technology for Directed I/O , 2006 .

[16]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[17]  A. Kivity,et al.  kvm : the Linux Virtual Machine Monitor , 2007 .

[18]  Andrew Warfield,et al.  Xen and the art of virtualization , 2003, SOSP '03.

[19]  David H. Bailey,et al.  The Nas Parallel Benchmarks , 1991, Int. J. High Perform. Comput. Appl..

[20]  Jason Duell,et al.  The design and implementation of Berkeley Lab's linuxcheckpoint/restart , 2005 .

[21]  Jason Duell,et al.  Berkeley Lab Checkpoint/Restart (BLCR) for Linux Clusters , 2006 .

[22]  Adit Ranadive,et al.  Performance implications of virtualizing multicore cluster machines , 2008, HPCVirt '08.

[23]  John Conley,et al.  Technical Report 2 , 2014 .