Dynamic Task-Scheduling and Resource Management for GPU Accelerators in Medical Imaging

For medical imaging applications, a timely execution of tasks is essential. Hence, running multiple applications on the same system, scheduling with the capability of task preemption and prioritization becomes mandatory. Using GPUs as accelerators in this domain, imposes new challenges since GPU's common FIFO scheduling does not support task prioritization and preemption. As a remedy, this paper investigates the employment of resource management and scheduling techniques for applications from the medical domain for GPU accelerators. A scheduler supporting both, priority-based and LDF scheduling is added to the system such that high-priority tasks can interrupt tasks already enqueued for execution. The scheduler is capable of utilizing multiple GPUs in a system to minimize the average response time of applications. Moreover, it supports simultaneous execution of multiple tasks to hide data transfers latencies. We show that the scheduler interrupts scheduled and already enqueued applications to fulfill the timing requirements of high-priority dynamic tasks.

[1]  Satoshi Matsuoka,et al.  NVCR: A Transparent Checkpoint-Restart Library for NVIDIA CUDA , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[2]  Jürgen Teich,et al.  Frameworks for GPU Accelerators: A comprehensive evaluation using 2D/3D image registration , 2011, 2011 IEEE 9th Symposium on Application Specific Processors (SASP).

[3]  Hiroaki Kobayashi,et al.  CheCUDA: A Checkpoint/Restart Tool for CUDA Applications , 2009, 2009 International Conference on Parallel and Distributed Computing, Applications and Technologies.

[4]  Tor M. Aamodt,et al.  Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow , 2007, 40th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2007).

[5]  Laurie A. Smith King,et al.  Vforce: An Extensible Framework for Reconfigurable Supercomputing , 2007, Computer.

[6]  Jürgen Weese,et al.  Voxel-based 2-D/3-D registration of fluoroscopy images and CT scans for image-guided surgery , 1997, IEEE Transactions on Information Technology in Biomedicine.

[7]  Tom R. Halfhill NVIDIA's Next-Generation CUDA Compute and Graphics Architecture, Code-Named Fermi, Adds Muscle for Parallel Processing , 2009 .

[8]  Giorgio Buttazzo,et al.  Hard Real-Time Computing Systems: Predictable Scheduling Algorithms and Applications , 1997 .

[9]  Jürgen Teich,et al.  Detector defect correction of medical images on graphics processors , 2011, Medical Imaging.

[10]  Timo Aila,et al.  Understanding the efficiency of ray traversal on GPUs , 2009, High Performance Graphics.

[11]  André Brinkmann,et al.  Cooperative multitasking for heterogeneous accelerators in the Linux Completely Fair Scheduler , 2011, ASAP 2011 - 22nd IEEE International Conference on Application-specific Systems, Architectures and Processors.