Multi-site Scheduling with Multiple Job Reservations and Forecasting Methods

Most previous research on job scheduling for multi-site distributed systems does not take into consideration behavioral trends when applying a scheduling method. In this paper, we address the scheduling of parallel jobs in a multi-site environment, where each site has a homogeneous cluster of non-dedicated processors where users submit jobs to be executed locally, while at the same time, external parallel jobs are submitted to a meta-scheduler. We use collected load data to model the performance trends that each site exhibits in order to predict load values via time-series analysis and then perform scheduling based on the predicted values.

[1]  P. Sadayappan,et al.  Scheduling of Parallel Jobs in a Heterogeneous Multi-site Environement , 2003, JSSPP.

[2]  Peter E. Strazdins,et al.  A comparison of local and gang scheduling on a Beowulf cluster , 2004, 2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935).

[3]  Yu-Kwong Kwok,et al.  On Exploiting Heterogeneity for Cluster Based Parallel Multithreading Using Task Duplication , 2003, The Journal of Supercomputing.

[4]  Michael Mitzenmacher,et al.  How Useful Is Old Information? , 2000, IEEE Trans. Parallel Distributed Syst..

[5]  Shuichi Ichikawa,et al.  An execution-time estimation model for heterogeneous clusters , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[6]  Uwe Schwiegelshohn,et al.  On Advantages of Grid Computing for Parallel Job Scheduling , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[7]  Jorge L. V. Barbosa,et al.  Resource scheduling on grid: handling uncertainty , 2003, Proceedings. First Latin American Web Congress.

[8]  P. Sadayappan,et al.  Distributed job scheduling on computational Grids using multiple simultaneous requests , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[9]  Jay Lepreau,et al.  Computer System Performance Problem Detection Using Time Series Model , 1993, USENIX Summer.

[10]  Achim Streit,et al.  Enhanced Algorithms for Multi-site Scheduling , 2002, GRID.

[11]  R. Deal Simulation Modeling and Analysis (2nd Ed.) , 1994 .

[12]  Ishfaq Ahmad,et al.  Optimal task assignment in heterogeneous distributed computing systems , 1998, IEEE Concurr..

[13]  Helen D. Karatza Gang scheduling in a distributed system under processor failures and time-varying gang size , 2003, The Ninth IEEE Workshop on Future Trends of Distributed Computing Systems, 2003. FTDCS 2003. Proceedings..

[14]  Averill M. Law,et al.  Simulation Modeling and Analysis , 1982 .

[15]  Michael Mitzenmacher,et al.  The Asymptotics of Selecting the Shortest of Two, Improved , 1999 .

[16]  J.M. Schopf,et al.  Stochastic Scheduling , 1999, ACM/IEEE SC 1999 Conference (SC'99).