Abstract The problem of scheduling on-line a stream of multi-tasked jobs on a set of parallel processors is considered. The goal is to minimize the average sojourn time experienced by each individual job in the system. The arriving jobs are comprised of parallel applications, each consisting of multiple independent tasks that are instantaneously assigned to processor queues upon its arrival to the system. The processors independently and concurrently service the tasks in their local queues which are first-come-first-served (FCFS). A job can depart from the system only when all its tasks have been executed and reassembled into the original single-job structure; until then, completed tasks are placed in a reassembly queue. All tasks have i.i.d. memoryless exponential service times and any task can be processed by any processor. Migration of tasks between queues — after their initial allocation — is not allowed. This model captures the main features of multiprocessor computer systems executing parallel programs. The key scheduling issue is as follows. When some queue backlogs are small, an incoming job should spread its tasks to those lightly loaded queues in order to take advantage of the parallel processing gain and lower its processing delay. On the other hand, when all queues are fairly congested, the job should schedule all its tasks sequentially in a single queue to avoid excessive reassembly delay (due to backlog fluctuations) and lower its task synchronization delay. In this paper, the trade-off between these two objectives is quantified and it is shown that the optimal schedule’s structure is based on backlog thresholds. Moreover, an off-line mechanism for determining these thresholds is provided, and further characteristics of the optimal scheduling policy under special cases is discussed. Finally, asymptotics and approximations for systems comprised of a large number of processors are considered.
[1]
Bruce S. Davie,et al.
Computer Networks: A Systems Approach
,
1996
.
[2]
Asser N. Tantawi,et al.
Performance analysis of parallel processing systems
,
1987,
SIGMETRICS '87.
[3]
I. Olkin,et al.
Inequalities: Theory of Majorization and Its Applications
,
1980
.
[4]
Asser N. Tantawi,et al.
Approximate Analysis of Fork/Join Synchronization in Parallel Queues
,
1988,
IEEE Trans. Computers.
[5]
F. Baccelli,et al.
The fork-join queue and related systems with synchronization constraints: stochastic ordering and computable bounds
,
1989,
Advances in Applied Probability.
[6]
D. Yao,et al.
Scheduling parallel processors: Structural properties and optimal policies
,
1996
.
[7]
Donald F. Towsley,et al.
Acyclic fork-join queuing networks
,
1989,
JACM.
[8]
P. Konstantopoulos,et al.
Stationary and stability of fork-join networks
,
1989,
Journal of Applied Probability.
[9]
Armand M. Makowski,et al.
Optimal Scheduling for a Distributed Parallel Processing Model
,
1992
.
[10]
J. D. T. Oliveira,et al.
The Asymptotic Theory of Extreme Order Statistics
,
1979
.
[11]
François Baccelli,et al.
On the execution of parallel programs on multiprocessor systems—a queuing theory approach
,
1990,
JACM.
[12]
J. Angus.
The Asymptotic Theory of Extreme Order Statistics
,
1990
.