Scalable Management — Technologies for Management of Large-Scale, Distributed Systems

Modern computing environments, such as enterprise data centers, Grids, and PlanetLab, introduce distributed services to address scalability, locality, and reliability. Web Services (WS), in particular, improve decoupling, decentralization, and autonomicity within distributed systems. Unfortunately, scale and decentralization introduce additional problems in distributed services management, such as deployment, monitoring, and lifecycle maintenance. In this paper, we propose a new approach to management of large scale distributed services, based on three artifacts: scalable publish-subscribe eventing, scalable WS-based deployment, and model-based management. We demonstrate that these techniques improve the manageability of services. In this way we enable service developers to focus on the development of service functionality rather than on management features.

[1]  Guanling Chen,et al.  Dependency management in distributed settings , 2004, International Conference on Autonomic Computing, 2004. Proceedings..

[2]  Calton Pu,et al.  Approaches for service deployment , 2005, IEEE Internet Computing.

[3]  N. A. B. Gray Comparison of Web Services, Java-RMI, and CORBA service implementations , 2004 .

[4]  Robbert van Renesse,et al.  Astrolabe: A robust and scalable technology for distributed system monitoring, management, and data mining , 2003, TOCS.

[5]  Mark Burgess,et al.  A Site Configuration Engine , 1995, Comput. Syst..

[6]  John Dunagan,et al.  Towards a self-managing software patching process using black-box persistent-state manifests , 2004, International Conference on Autonomic Computing, 2004. Proceedings..

[7]  David A. Patterson,et al.  Latency Lags Bandwidth , 2005, ICCD.

[8]  David A. Patterson,et al.  Latency lags bandwith , 2004, CACM.

[9]  David E. Culler,et al.  A blueprint for introducing disruptive technology into the Internet , 2003, CCRV.

[10]  Vern Paxson,et al.  Computing TCP's Retransmission Timer , 2000, RFC.

[11]  T. De Wolf,et al.  Towards autonomic computing: agent-based modelling, dynamical systems analysis, and decentralised control , 2003, IEEE International Conference on Industrial Informatics, 2003. INDIN 2003. Proceedings..

[12]  Munindar P. Singh,et al.  Service-Oriented Computing: Key Concepts and Principles , 2005, IEEE Internet Comput..

[13]  Patrick McCormick,et al.  Deployme: Tellme's Package Management and Deployment System , 2000, LISA.

[14]  Jeffrey O. Kephart,et al.  An architectural approach to autonomic computing , 2004, International Conference on Autonomic Computing, 2004. Proceedings..

[15]  Richard Monson-Haefel,et al.  Java message service , 2000 .

[16]  Segev Wasserkrug,et al.  Autonomic self-optimization according to business objectives , 2004, International Conference on Autonomic Computing, 2004. Proceedings..

[17]  Morris Sloman,et al.  Constructing Distributed Systems in Conic , 1989, IEEE Trans. Software Eng..

[18]  Paul Murray,et al.  SmartFrog: Configuration and Automatic Ignition of Distributed Applications , 2003 .

[19]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[20]  Tobin J. Lehman,et al.  T Spaces , 1998, IBM Syst. J..

[21]  Paul Brett,et al.  A Shared Global Event Propagation System to Enable Next Generation Distributed Services , 2004, WORLDS.

[22]  Peter Pietzuch Hermes: A scalable event-based middleware , 2004 .