Extensible Resource Scheduling for Parallel Scientific Applications

The resource requirements and processing characteristics of parallel scientiic applications are quite diverse. In this paper, we present a new resource management approach for scheduling such parallel applications that combines multiple scheduling paradigms with a fault-tolerance paradigm into a coherent system. Results from a prototype implementation demonstrate that our system provides signiicant performance improvements over existing methods in a controllable and extensible manner.