Real-Time Divisible Load Scheduling for Cluster Computing

Cluster computing has emerged as a new paradigm for solving large-scale problems. To enhance QoS and provide performance guarantees in cluster computing environments, various real-time scheduling algorithms and workload models have been investigated. Computational loads that can be arbitrarily divided into independent pieces represent many real-world applications. Divisible load theory (DLT) provides insight into distribution strategies for such computations. However, the problem of providing performance guarantees to divisible load applications has not yet been systematically studied. This paper investigates such algorithms for a cluster environment. Design parameters that affect the performance of these algorithms and scenarios when the choice of these parameters have significant effects are studied. A novel algorithmic approach integrating DLT and EDF (earliest deadline first) scheduling is proposed. For comparison, we also propose a heuristic algorithm. Intensive experimental results show that the application of DLT to real-time cluster-based scheduling leads to significantly better scheduling approaches

[1]  Gene F. Franklin,et al.  Feedback Control of Dynamic Systems , 1986 .

[2]  Krithi Ramamritham,et al.  Distributed Scheduling of Tasks with Deadlines and Resource Requirements , 1989, IEEE Trans. Computers.

[3]  V. Rich Personal communication , 1989, Nature.

[4]  Krithi Ramamritham,et al.  Efficient Scheduling Algorithms for Real-Time Multiprocessor Systems , 1989, IEEE Trans. Parallel Distributed Syst..

[5]  Srinivasan Keshav,et al.  A control-theoretic approach to flow control , 1991, SIGCOMM '91.

[6]  Phillip Krueger,et al.  On-Line Hard Real-Time Scheduling of Parallel Tasks on Partitionable Multiprocessors , 1994, 1994 Internatonal Conference on Parallel Processing Vol. 2.

[7]  Marco Spuri,et al.  Implications of Classical Scheduling Results for Real-Time Systems , 1995, Computer.

[8]  Giorgio Ventre,et al.  Distributed advance reservation of real-time connections , 1997, Multimedia Systems.

[9]  B. Pasik-Duncan,et al.  Adaptive Control , 1996, IEEE Control Systems.

[10]  Debasish Ghose,et al.  Scheduling Divisible Loads in Parallel and Distributed Systems , 1996 .

[11]  Giorgio Ventre,et al.  Distributed advance reservation of real-time connections , 1997, Multimedia Systems.

[12]  Prasant Mohapatra Dynamic Real-Time Task Scheduling on Hypercubes , 1997, J. Parallel Distributed Comput..

[13]  Olov Schelén,et al.  Advance reservations for predictive service in the Internet , 1997, Multimedia Systems.

[14]  Warren Smith,et al.  A Resource Management Architecture for Metacomputing Systems , 1998, JSSPP.

[15]  C. Siva Ram Murthy,et al.  An Efficient Dynamic Scheduling Algorithm For Multiprocessor Real-Time Systems , 1998, IEEE Trans. Parallel Distributed Syst..

[16]  Klara Nahrstedt,et al.  A distributed resource management architecture that supports advance reservations and co-allocation , 1999, 1999 Seventh International Workshop on Quality of Service. IWQoS'99. (Cat. No.98EX354).

[17]  Klara Nahrstedt,et al.  CPU service classes for multimedia applications , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[18]  Sang Hyuk Son,et al.  Design and evaluation of a feedback control EDF scheduling algorithm , 1999, Proceedings 20th IEEE Real-Time Systems Symposium (Cat. No.99CB37054).

[19]  Pawel Wolniewicz,et al.  Experiments with Scheduling Divisible Tasks in Clusters of Workstations , 2000, Euro-Par.

[20]  Baruch Awerbuch,et al.  An Opportunity Cost Approach for Job Assignment in a Scalable Computing Cluster , 2000, IEEE Trans. Parallel Distributed Syst..

[21]  Gerhard Fohler,et al.  Efficient scheduling of sporadic, aperiodic, and periodic tasks with complex constraints , 2000, Proceedings 21st IEEE Real-Time Systems Symposium.

[22]  Ian Foster,et al.  A quality of service architecture that combines resource reservation and application adaptation , 2000, 2000 Eighth International Workshop on Quality of Service. IWQoS 2000 (Cat. No.00EX400).

[23]  Warren Smith,et al.  Scheduling with advanced reservations , 2000, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000.

[24]  David E. Culler,et al.  Market-based Proportional Resource Sharing for Clusters , 2000 .

[25]  James H. Anderson,et al.  Pfair scheduling: beyond periodic task systems , 2000, Proceedings Seventh International Conference on Real-Time Computing Systems and Applications.

[26]  Xiao Qin,et al.  Dynamic, reliability-driven scheduling of parallel real-time jobs in heterogeneous systems , 2001, International Conference on Parallel Processing, 2001..

[27]  Sang Hyuk Son,et al.  Feedback control scheduling in distributed real-time systems , 2001, Proceedings 22nd IEEE Real-Time Systems Symposium (RTSS 2001) (Cat. No.01PR1420).

[28]  Donald F. Towsley,et al.  A control theoretic analysis of RED , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[29]  Sang Hyuk Son,et al.  A feedback control approach for guaranteeing relative delays in Web servers , 2001, Proceedings Seventh IEEE Real-Time Technology and Applications Symposium.

[30]  Kevin Skadron,et al.  Control-theoretic techniques and thermal-RC modeling for accurate and localized dynamic thermal management , 2002, Proceedings Eighth International Symposium on High Performance Computer Architecture.

[31]  Chenyang Lu,et al.  An adaptive control framework for QoS guarantees and its application to differentiated caching , 2002, IEEE 2002 Tenth IEEE International Workshop on Quality of Service (Cat. No.02EX564).

[32]  Ian T. Foster,et al.  SNAP: A Protocol for Negotiating Service Level Agreements and Coordinating Resource Management in Distributed Systems , 2002, JSSPP.

[33]  Chuang Liu,et al.  Design and evaluation of a resource selection framework for Grid applications , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[34]  Reda A. Ammar,et al.  Scheduling real time parallel structure on cluster computing , 2002, Proceedings ISCC 2002 Seventh International Symposium on Computers and Communications.

[35]  Nicolas Christin,et al.  A Quantitative Assured Forwarding service , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[36]  Francine Berman,et al.  Using Moldability to Improve the Performance of Supercomputer Jobs , 2002, J. Parallel Distributed Comput..

[37]  Lichen Zhang,et al.  Scheduling algorithm for real-time applications in grid environment , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[38]  Thomas G. Robertazzi,et al.  Divisible Load Scheduling for Grid Computing , 2003 .

[39]  P. Sadayappan,et al.  A Robust Scheduling Strategy for Moldable Scheduling of Parallel Jobs. , 2003 .

[40]  Jong Kim,et al.  On-line scheduling of scalable real-time tasks on multiprocessor systems , 2003, J. Parallel Distributed Comput..

[41]  Thomas G. Robertazzi,et al.  Ten Reasons to Use Divisible Load Theory , 2003, Computer.

[42]  Scott A. Brandt,et al.  Dynamic integrated scheduling of hard real-time, soft real-time, and non-real-time processes , 2003, RTSS 2003. 24th IEEE Real-Time Systems Symposium, 2003.

[43]  Tarek F. Abdelzaher,et al.  A synthetic utilization bound for aperiodic tasks with resource requirements , 2003, 15th Euromicro Conference on Real-Time Systems, 2003. Proceedings..

[44]  Joseph L. Hellerstein,et al.  Using Control Theory to Achieve Service Level Objectives In Performance Management , 2002, Real-Time Systems.

[45]  Debasish Ghose,et al.  Divisible Load Theory: A New Paradigm for Load Scheduling in Distributed Systems , 2004, Cluster Computing.

[46]  Cheng Jin,et al.  FAST TCP: Motivation, Architecture, Algorithms, and Performance , 2004, INFOCOM.

[47]  Stephen A. Jarvis,et al.  Hybrid Performance-Oriented Scheduling of Moldable Jobs with QoS Demands in Multiclusters and Grids , 2004, GCC.

[48]  Xin Liu,et al.  Supporting dynamic QoS in Linux , 2004, Proceedings. RTAS 2004. 10th IEEE Real-Time and Embedded Technology and Applications Symposium, 2004..

[49]  Atakan Dogan,et al.  A data scheduling algorithm for autonomous distributed real-time applications in grid computing , 2004, International Conference on Parallel Processing, 2004. ICPP 2004..

[50]  A. Amin,et al.  Scheduling real time parallel structures on cluster computing with possible processor failures , 2004, Proceedings. ISCC 2004. Ninth International Symposium on Computers And Communications (IEEE Cat. No.04TH8769).

[51]  Tarek F. Abdelzaher,et al.  Design, implementation, and evaluation of differentiated caching services , 2004, IEEE Transactions on Parallel and Distributed Systems.

[52]  John T. Wen,et al.  A unifying passivity framework for network flow control , 2004, IEEE Transactions on Automatic Control.

[53]  Petru Eles,et al.  Schedulability-driven partitioning and mapping for multi-cluster real-time systems , 2004, Proceedings. 16th Euromicro Conference on Real-Time Systems, 2004. ECRTS 2004..

[54]  Jon B. Weissman,et al.  A genetic algorithm based approach for scheduling decomposable data grid applications , 2004 .

[55]  Füsun Özgüner,et al.  A data scheduling algorithm for autonomous distributed real-time applications in grid computing , 2004 .

[56]  Rajkumar Buyya,et al.  Libra: a computational economy‐based job scheduling system for clusters , 2004, Softw. Pract. Exp..

[57]  Ralf Steinmetz,et al.  Concepts for Resource Reservation in Advance , 2004, Multimedia Tools and Applications.

[58]  Henri Casanova,et al.  Benchmark probes for grid assessment , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[59]  Junwei Cao,et al.  Queue scheduling and advance reservations with COSY , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[60]  Yixin Diao,et al.  Using MIMO linear control for load balancing in computing systems , 2004, Proceedings of the 2004 American Control Conference.

[61]  Ian T. Foster,et al.  End-to-end quality of service for high-end applications , 2004, Comput. Commun..

[62]  Yixin Diao,et al.  Comparative studies of load balancing with control and optimization techniques , 2005, Proceedings of the 2005, American Control Conference, 2005..

[63]  Sanjoy K. Baruah,et al.  Task assignment on uniform heterogeneous multiprocessors , 2005, 17th Euromicro Conference on Real-Time Systems (ECRTS'05).

[64]  Henri Casanova,et al.  Practical divisible load scheduling on grid platforms with APST-DV , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[65]  Chenyang Lu,et al.  Feedback utilization control in distributed real-time systems with end-to-end tasks , 2005, IEEE Transactions on Parallel and Distributed Systems.

[66]  Yuhui Qiu,et al.  A bidding strategy for advance resource reservation in sequential ascending auctions , 2005, Proceedings Autonomous Decentralized Systems, 2005. ISADS 2005..

[67]  Dan Tsafrir,et al.  A Short Survey of Commercial Cluster Batch Schedulers , 2005 .

[68]  Steve Goddard,et al.  VRE-NET: A QoS-supported network subsystem for multimedia applications , 2006, 20th International Conference on Advanced Information Networking and Applications - Volume 1 (AINA'06).

[69]  Borja Sotomayor,et al.  Overhead Matters: A Model for Virtual Resource Management , 2006, First International Workshop on Virtualization Technology in Distributed Computing (VTDC 2006).

[70]  Rajkumar Buyya,et al.  A taxonomy of market‐based resource management systems for utility‐driven cluster computing , 2006, Softw. Pract. Exp..

[71]  Sang Hyuk Son,et al.  Feedback Control Architecture and Design Methodology for Service Delay Guarantees in Web Servers , 2006, IEEE Transactions on Parallel and Distributed Systems.

[72]  Rajkumar Buyya,et al.  A taxonomy of market-based resource management systems for utility-driven cluster computing , 2006 .

[73]  Thomas Fahringer,et al.  Grid allocation and reservation - Grid capacity planning with negotiation-based advance reservation for optimized QoS , 2006, SC.

[74]  Xiaoyun Zhu,et al.  Utility-driven workload management using nested control design , 2006, 2006 American Control Conference.

[75]  P. Sadayappan,et al.  Moldable Parallel Job Scheduling Using Job Efficiency: An Iterative Approach , 2006, JSSPP.

[76]  Wei Xu,et al.  Predictive Control for Dynamic Resource Allocation in Enterprise Data Centers , 2006, 2006 IEEE/IFIP Network Operations and Management Symposium NOMS 2006.

[77]  Angela C. Sodan,et al.  Adaptive Job Scheduling Via Predictive Job Resource Allocation , 2006, JSSPP.

[78]  M. Siddiqui,et al.  Grid Capacity Planning with Negotiation-based Advance Reservation for Optimized QoS , 2006, ACM/IEEE SC 2006 Conference (SC'06).

[79]  Jitender S. Deogun,et al.  Real-Time Divisible Load Scheduling with Different Processor Available Times , 2007, 2007 International Conference on Parallel Processing (ICPP 2007).

[80]  Xue Liu,et al.  Optimal multivariate control for differentiated services on a shared hosting platform , 2007, 2007 46th IEEE Conference on Decision and Control.

[81]  Phil Andrews,et al.  Impact of Reservations on Production Job Scheduling , 2007, JSSPP.

[82]  Anthony A. Maciejewski,et al.  Dynamically mapping tasks with priorities and multiple deadlines in a heterogeneous environment , 2007, J. Parallel Distributed Comput..

[83]  Jitender S. Deogun,et al.  Real-Time Divisible Load Scheduling with Different Processor Available Times , 2007, ICPP.

[84]  Jitender S. Deogun,et al.  Real-Time Divisible Load Scheduling with Advance Reservation , 2008, 2008 Euromicro Conference on Real-Time Systems.

[85]  Sanjoy K. Baruah,et al.  Scheduling Divisible Real-Time Loads on Clusters with Varying Processor Start Times , 2008, 2008 14th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications.

[86]  James H. Anderson,et al.  An Adaptive Framework for Multiprocessor Real-Time System , 2008, 2008 Euromicro Conference on Real-Time Systems.

[87]  Sanjoy K. Baruah,et al.  Evaluation of a linear programming approach towards scheduling divisible real-time loads , 2008, 2008 International Symposium on Information Technology.

[88]  S. Goddard,et al.  Efficient Real-Time Divisible Load Scheduling with Advance Reservations , 2009 .

[89]  Rajkumar Buyya,et al.  Offer-based scheduling of deadline-constrained Bag-of-Tasks applications for utility computing systems , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[90]  Xue Liu,et al.  Power-Aware CPU Utilization Control for Distributed Real-Time Systems , 2009, 2009 15th IEEE Real-Time and Embedded Technology and Applications Symposium.