Self-managing systems: a control theory foundation

The high cost of operating large computing installations has motivated a broad interest in reducing the need for human intervention by making systems self-managing. This paper explores the extent to which control theory can provide an architectural and analytic foundation for building self-managing systems, either from new components or layering on top of existing components. Further, we propose a deployable testbed for autonomic computing (DTAC) that we believe will reduce the barriers to addressing key research problems in autonomic computing. The initial DTAC architecture is described along with several problems that it can be used to investigate.

[1]  Katsuhiko Ogata,et al.  Modern Control Engineering , 1970 .

[2]  Joseph L. Hellerstein,et al.  YES/MVS and the Automation of Operations for Large Computer Complexes , 1986, IBM Syst. J..

[3]  Judy Kay,et al.  A fair share scheduler , 1988, CACM.

[4]  Srinivasan Keshav A control-theoretic approach to flow control , 1991, SIGCOMM 1991.

[5]  Joseph L. Hellerstein,et al.  Achieving Service Rate Objectives with Decay Usage Scheduling , 1993, IEEE Trans. Software Eng..

[6]  H. Vin,et al.  Start-time fair queueing: a scheduling algorithm for integrated services packet switching networks , 1996, SIGCOMM '96.

[7]  Stephen Yurkovich,et al.  Fuzzy Control , 1997 .

[8]  Harrick M. Vin,et al.  Start-time fair queueing: a scheduling algorithm for integrated services packet switching networks , 1997, TNET.

[9]  Donna N. Dillenberger,et al.  Adaptive Algorithms for Managing a Distributed Data Processing Workload , 1997, IBM Syst. J..

[10]  Eitan Altman,et al.  Congestion control as a stochastic control problem with action delays , 1999, Autom..

[11]  T.F. Abdelzaher,et al.  Web server QoS management by adaptive content delivery , 1999, 1999 Seventh International Workshop on Quality of Service. IWQoS'99. (Cat. No.98EX354).

[12]  Sang Hyuk Son,et al.  Performance specifications and metrics for adaptive real-time systems , 2000, Proceedings 21st IEEE Real-Time Systems Symposium.

[13]  Donald F. Towsley,et al.  On designing improved controllers for AQM routers supporting TCP flows , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[14]  Joseph L. Hellerstein,et al.  Using Control Theory to Achieve Service Level Objectives In Performance Management , 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470).

[15]  Donald F. Towsley,et al.  A control theoretic analysis of RED , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[16]  Tarek F. Abdelzaher,et al.  Differentiated caching services; a control-theoretical approach , 2001, Proceedings 21st International Conference on Distributed Computing Systems.

[17]  Calton Pu,et al.  Modeling the effect of short-term rate variations on TCP-friendly congestion control behavior , 2001, Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148).

[18]  Yixin Diao,et al.  Optimizing Quality of Service Using Fuzzy Control , 2002, DSOM.

[19]  Chenyang Lu,et al.  ControlWare: a middleware architecture for feedback control of software performance , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[20]  Lui Sha,et al.  Queueing model based network server performance control , 2002, 23rd IEEE Real-Time Systems Symposium, 2002. RTSS 2002..

[21]  Yixin Diao,et al.  Using MIMO feedback control to enforce policies for interrelated metrics with application to the Apache Web server , 2002, NOMS 2002. IEEE/IFIP Network Operations and Management Symposium. ' Management Solutions for the New Communications World'(Cat. No.02CH37327).

[22]  Gail E. Kaiser,et al.  Kinesthetics eXtreme: an external infrastructure for monitoring distributed legacy systems , 2003, 2003 Autonomic Computing Workshop.

[23]  David Patterson,et al.  Self-repairing computers. , 2003, Scientific American.

[24]  Victor Chang,et al.  Managing the Performance Impact of Administrative Utilities , 2003, DSOM.

[25]  Joseph L. Hellerstein,et al.  Using Control Theory to Achieve Service Level Objectives In Performance Management , 2002, Real-Time Systems.

[26]  Yixin Diao,et al.  Feedback Control of Computing Systems , 2004 .

[27]  Yixin Diao,et al.  Throttling utilities in the IBM DB2 universal database server , 2004, Proceedings of the 2004 American Control Conference.

[28]  Ying Lu,et al.  Improved Prediction for Web Server Delay Control , 2004 .

[29]  Yixin Diao,et al.  Using MIMO linear control for load balancing in computing systems , 2004, Proceedings of the 2004 American Control Conference.

[30]  David Sinreich,et al.  An architectural blueprint for autonomic computing , 2006 .