Dynamic Process Management with Allocation-internal Co-Scheduling towards Interactive Supercomputing

Heading towards exascale, the challenges for process management with respect to flexibility and efficiency grow accordingly. Running more than one application simultaneously on a node can be the solution for better resource utilization. However, we believe that this approach of coscheduling can also be the way to go for gaining a degree of process malleability and dynamicity that can enable some kind of interactivity also in the domain of high-performance computing. In this paper, we present the recent advances made in this respect within ParaStation MPI, a high performance MPI library supplemented by a complete framework comprising a scalable and dynamic process manager. The paper presents four new scheduling policies, implemented in ParaStation MPI, for starting multiple MPI sessions concurrently and interactively within a single allocation of nodes. The features of these policies are detailed and evaluated by applying the Dynamic Job Scheduler Benchmark (djsb), a tool developed by the Barcelona Supercomputing Center especially for measuring interactivity and dynamicity metrics.

[1]  Rajeev Thakur,et al.  PMI: A Scalable Parallel Process-Management Interface for Extreme-Scale Systems , 2010, EuroMPI.

[2]  Thomas Lippert,et al.  The DEEP Project - Pursuing Cluster-Computing in the Many-Core Era , 2013, 2013 42nd International Conference on Parallel Processing.

[3]  Felix Wolf,et al.  A Batch System with Fair Scheduling for Evolving Applications , 2014, 2014 43rd International Conference on Parallel Processing.