Service Middleware for Self-Managing Large-Scale Systems

Resource management poses particular challenges in large-scale systems, such as server clusters that simultaneously process requests from a large number of clients. A resource management scheme for such systems must scale both in the in the number of cluster nodes and the number of applications the cluster supports. Current solutions do not exhibit both of these properties at the same time. Many are centralized, which limits their scalability in terms of the number of nodes, or they are decentralized but rely on replicated directories, which also reduces their ability to scale. In this paper, we propose novel solutions to request routing and application placement- two key mechanisms in a scalable resource management scheme. Our solution to request routing is based on selective update propagation, which ensures that the control load on a cluster node is independent of the system size. Application placement is approached in a decentralized manner, by using a distributed algorithm that maximizes resource utilization and allows for service differentiation under overload. The paper demonstrates how the above solutions can be integrated into an overall design for a peer-to-peer management middleware that exhibits properties of self-organization. Through complexity analysis and simulation, we show to which extent the system design is scalable. We have built a prototype using accepted technologies and have evaluated it using a standard benchmark. The testbed measurements show that the implementation, within the parameter range tested, operates efficiently, quickly adapts to a changing environment and allows for effective service differentiation by a system administrator.

[1]  Germán S. Goldszmidt,et al.  Scaling Internet services by dynamic allocation of connections , 1999, Integrated Network Management VI. Distributed Management for the Networked Millennium. Proceedings of the Sixth IFIP/IEEE International Symposium on Integrated Network Management. (Cat. No.99EX302).

[2]  Abhishek Chandra,et al.  Quantifying the Benefits of Resource Multiplexing in On-Demand Data Centers , 2003 .

[3]  John Davin,et al.  Hierarchical variable ordering for distributed constraint optimization , 2006, AAMAS '06.

[4]  Christopher Stewart,et al.  Profile-Driven Component Placement for Cluster-Based Online Services , 2004, IEEE Distributed Syst. Online.

[5]  Constantin Adam,et al.  A middleware design for large-scale clusters offering multiple services , 2006, IEEE Transactions on Network and Service Management.

[6]  Constantin Adam,et al.  A Service Middleware that Scales in System Size and Applications , 2007, 2007 10th IFIP/IEEE International Symposium on Integrated Network Management.

[7]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[8]  Colin Low Decentralised application placement , 2005, Future Gener. Comput. Syst..

[9]  Michele Colajanni,et al.  Kernel-based Web switches providing content-aware routing , 2003, Second IEEE International Symposium on Network Computing and Applications, 2003. NCA 2003..

[10]  Deeparnab Chakrabarty,et al.  Knapsack Problems , 2008 .

[11]  Xiaoyun Zhu,et al.  Resource Assignment for Large-Scale Computing Utilities using Mathematical Programming , 2003 .

[12]  Asser N. Tantawi,et al.  Dynamic placement for clustered web applications , 2006, WWW '06.

[13]  Dirk Beyer,et al.  Quartermaster - a resource utility system , 2005, 2005 9th IFIP/IEEE International Symposium on Integrated Network Management, 2005. IM 2005..

[14]  Asser N. Tantawi,et al.  An analytical model for multi-tier internet services and its applications , 2005, SIGMETRICS '05.

[15]  Victor R. Lesser,et al.  Multi-agent based peer-to-peer information retrieval systems with concurrent search sessions , 2006, AAMAS '06.

[16]  Heiko Schuldt,et al.  Decentralized coordination of transactional processes in peer-to-peer environments , 2005, CIKM '05.

[17]  Maarten van Steen,et al.  CYCLON: Inexpensive Membership Management for Unstructured P2P Overlays , 2005, Journal of Network and Systems Management.

[18]  Supranamaya Ranjan,et al.  Wide area redirection of dynamic content by Internet data centers , 2004, IEEE INFOCOM 2004.

[19]  John Moy,et al.  OSPF Version 2 , 1998, RFC.

[20]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[21]  Prasant Mohapatra,et al.  Session-based overload control in QoS-aware Web servers , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[22]  George S. Fishman,et al.  Discrete-event simulation , 2001 .

[23]  M. Johansson,et al.  Distributed Resource Allocation Strategies for Achieving Quality of Service in Server Clusters , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[24]  Lada A. Adamic,et al.  Zipf's law and the Internet , 2002, Glottometrics.

[25]  Constantin Adam,et al.  A Middleware for Self-Managing Large-Scale Systems , 2006 .

[26]  Xiaoyun Zhu,et al.  Optimal resource assignment in Internet data centers , 2001, MASCOTS 2001, Proceedings Ninth International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[27]  Constantin Adam,et al.  Adaptable server clusters with QoS objectives , 2005, 2005 9th IFIP/IEEE International Symposium on Integrated Network Management, 2005. IM 2005..

[28]  Constantin Adam,et al.  Implementation and Evaluation of a Middleware for Self-Organizing Decentralized Web Services , 2006, SelfMan.

[29]  Asser N. Tantawi,et al.  Dynamic estimation of CPU demand of web traffic , 2006, valuetools '06.

[30]  Chunqiang Tang,et al.  GoCast: gossip-enhanced overlay multicast for fast and dependable group communication , 2005, 2005 International Conference on Dependable Systems and Networks (DSN'05).

[31]  Rolf Stadler,et al.  Robust monitoring of network-wide aggregates through gossiping , 2007, IEEE Transactions on Network and Service Management.

[32]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[33]  Asser N. Tantawi,et al.  Performance management for cluster-based web services , 2005, IEEE Journal on Selected Areas in Communications.

[34]  Kagan Tumer,et al.  Efficient agent-based cluster ensembles , 2006, AAMAS '06.

[35]  Sven Buchholz,et al.  Replica placement in adaptive content distribution networks , 2004, SAC '04.

[36]  Philip S. Yu,et al.  The state of the art in locally distributed Web-server systems , 2002, CSUR.

[37]  Erich M. Nahum,et al.  A method for transparent admission control and request scheduling in e-commerce web sites , 2004, WWW '04.

[38]  Karen L. Myers,et al.  Continuous refinement of agent resource estimates , 2006, AAMAS '06.

[39]  Zhi-Li Zhang,et al.  Distributed Algorithm for Service Replication in Service Overlay Network , 2004, NETWORKING.

[40]  Benny Rochwerger,et al.  Oceano-SLA based management of a computing utility , 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470).

[41]  Peter Druschel,et al.  Pastry: Scalable, distributed object location and routing for large-scale peer-to- , 2001 .