Distributed Utilization Control for Real-Time Clusters with Load Balancing

Previous years have seen rapid growth of online services that rely on large-scale server clusters to handle high volume of requests. Such clusters must adaptively control the CPU utilizations of many processors in order to maintain desired soft real-time performance and prevent system overload in face of unpredictable workloads. This paper presents DUC-LB, a novel distributed utilization control algorithm for cluster-based soft real-time applications. Compared to earlier works on utilization control, a distinguishing feature of DUC-LB is its capability to handle system dynamics caused by load balancing, which is a common and essential component of most clusters today. Simulation results and control-theoretic analysis demonstrate that DUC-LB can provide robust utilization control and effective load balancing in large-scale clusters

[1]  Chung Laung Liu,et al.  Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment , 1989, JACM.

[2]  Tse-yun Feng,et al.  A Survey of Interconnection Networks , 1981, Computer.

[3]  Thomas L. Casavant,et al.  A Taxonomy of Scheduling in General-Purpose Distributed Computing Systems , 1988, IEEE Trans. Software Eng..

[4]  James W. Layland,et al.  Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment , 1989, JACM.

[5]  George Cybenko,et al.  Dynamic Load Balancing for Distributed Memory Multiprocessors , 1989, J. Parallel Distributed Comput..

[6]  Jacques E. Boillat,et al.  Load Balancing and Poisson Equation in a Graph , 1990, Concurr. Pract. Exp..

[7]  A. Heirich,et al.  A Parabolic Theory of Load Balance , 1993 .

[8]  Anthony P. Reeves,et al.  Strategies for Dynamic Load Balancing on Highly Parallel Computers , 1993, IEEE Trans. Parallel Distributed Syst..

[9]  Kang G. Shin,et al.  Design and Evaluation of Effective Load Sharing in Distributed Real-Time Systems , 1994, IEEE Trans. Parallel Distributed Syst..

[10]  Francis C. M. Lau,et al.  Optimal Parameters for Load Balancing with the Diffusion Method in Mesh Networks , 1994, Parallel Process. Lett..

[11]  Dominik Henrich,et al.  The Liquid Model Load Balancing Method , 1996, Parallel Algorithms Appl..

[12]  J. Doyle,et al.  Robust and optimal control , 1995, Proceedings of 35th IEEE Conference on Decision and Control.

[13]  Samuel T. Chanson,et al.  Hydrodynamic Load Balancing , 1999, IEEE Trans. Parallel Distributed Syst..

[14]  R. D'Andrea,et al.  Software for modeling, analysis, and control design for multidimensional systems , 1999, Proceedings of the 1999 IEEE International Symposium on Computer Aided Control System Design (Cat. No.99TH8404).

[15]  Calton Pu,et al.  A feedback-driven proportion allocator for real-rate scheduling , 1999, OSDI '99.

[16]  Sang Hyuk Son,et al.  The case for feedback control real-time scheduling , 1998, Proceedings of 11th Euromicro Conference on Real-Time Systems. Euromicro RTS'99.

[17]  Y. C. Tay,et al.  Load Sharing in Distributed Multimedia-on-Demand Systems , 2000, IEEE Trans. Knowl. Data Eng..

[18]  Sang Hyuk Son,et al.  Feedback control scheduling in distributed real-time systems , 2001, Proceedings 22nd IEEE Real-Time Systems Symposium (RTSS 2001) (Cat. No.01PR1420).

[19]  R. S. Chandra,et al.  Control of spatially interconnected discrete-time systems , 2002, Proceedings of the 41st IEEE Conference on Decision and Control, 2002..

[20]  Reda A. Ammar,et al.  Scheduling real time parallel structure on cluster computing , 2002, Proceedings ISCC 2002 Seventh International Symposium on Computers and Communications.

[21]  Jonathan Walpole,et al.  Analysis of a reservation-based feedback scheduler , 2002, 23rd IEEE Real-Time Systems Symposium, 2002. RTSS 2002..

[22]  Geir E. Dullerud,et al.  Distributed control design for spatially interconnected systems , 2003, IEEE Trans. Autom. Control..

[23]  Sang Hyuk Son,et al.  QoS management in replicated real-time databases , 2003, RTSS 2003. 24th IEEE Real-Time Systems Symposium, 2003.

[24]  Nobuyuki Yamasaki,et al.  A Task Migration Scheme for High Performance Real-Time Cluster System , 2003, Computers and Their Applications.

[25]  G. Manimaran,et al.  Double-Loop Feedback-Based Scheduling Approach for Distributed Real-Time Systems , 2003, HiPC.

[26]  Chenyang Lu,et al.  Feedback performance control in software services , 2003 .

[27]  Kevin Skadron,et al.  Power-aware QoS management in Web servers , 2003, RTSS 2003. 24th IEEE Real-Time Systems Symposium, 2003.

[28]  Frank Mueller,et al.  Feedback EDF scheduling exploiting dynamic voltage scaling , 2004, Proceedings. RTAS 2004. 10th IEEE Real-Time and Embedded Technology and Applications Symposium, 2004..

[29]  Cédric Langbort,et al.  Distributed control design for systems interconnected over an arbitrary graph , 2004, IEEE Transactions on Automatic Control.

[30]  Jonathan Walpole,et al.  Real-rate scheduling , 2004, Proceedings. RTAS 2004. 10th IEEE Real-Time and Embedded Technology and Applications Symposium, 2004..

[31]  Yixin Diao,et al.  Incorporating cost of control into the design of a load balancing controller , 2004, Proceedings. RTAS 2004. 10th IEEE Real-Time and Embedded Technology and Applications Symposium, 2004..

[32]  A. Amin,et al.  Scheduling real time parallel structures on cluster computing with possible processor failures , 2004, Proceedings. ISCC 2004. Ninth International Symposium on Computers And Communications (IEEE Cat. No.04TH8769).

[33]  Sang Hyuk Son,et al.  Feedback Control Real-Time Scheduling: Framework, Modeling, and Algorithms* , 2001, Real-Time Systems.

[34]  Chenyang Lu,et al.  End-to-end utilization control in distributed real-time systems , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[35]  Chenyang Lu,et al.  Decentralized utilization control in distributed real-time systems , 2005, 26th IEEE International Real-Time Systems Symposium (RTSS'05).

[36]  Chenyang Lu,et al.  Feedback utilization control in distributed real-time systems with end-to-end tasks , 2005, IEEE Transactions on Parallel and Distributed Systems.

[37]  Jitender S. Deogun,et al.  Real-Time Divisible Load Scheduling for Cluster Computing , 2007, 13th IEEE Real Time and Embedded Technology and Applications Symposium (RTAS'07).

[38]  Chenyang Lu,et al.  DEUCON: Decentralized End-to-End Utilization Control for Distributed Real-Time Systems , 2007, IEEE Transactions on Parallel and Distributed Systems.