Service Middleware for

Resource management poses particular challenges in large-scale systems, such as server clusters that simultaneously process requests from a large number of clients. A resource management scheme for such systems must scale both in the in the number of cluster nodes and the number of applications the cluster supports. Current solutions do not exhibit both of these properties at the same time. Many are centralized, which limits their scalability in terms of the number of nodes, or they are decentralized but rely on replicated directories, which also reduces their ability to scale. In this paper, we propose novel solutions to request routing and application placement- two key mechanisms in a scalable resource management scheme. Our solution to request routing is based on selective update propagation, which ensures that the control load on a cluster node is independent of the system size. Application placement is approached in a decentralized manner, by using a distributed algorithm that maximizes resource utilization and allows for ser- vice differentiation under overload. The paper demonstrates how the above solutions can be integrated into an overall design for a peer-to-peer management middleware that exhibits properties of self-organization. Through complexity analysis and simulation, we show to which extent the system design is scalable. We have built a prototype using accepted technologies and have evaluated it using a standard benchmark. The testbed measurements show that the implementation, within the parameter range tested, operates efficiently, quickly adapts to a changing environment and allows for effective service differentiation by a system administrator.

[1]  Karen L. Myers,et al.  Continuous refinement of agent resource estimates , 2006, AAMAS '06.

[2]  Zhi-Li Zhang,et al.  Distributed Algorithm for Service Replication in Service Overlay Network , 2004, NETWORKING.

[3]  Erich M. Nahum,et al.  A method for transparent admission control and request scheduling in e-commerce web sites , 2004, WWW '04.

[4]  Maarten van Steen,et al.  CYCLON: Inexpensive Membership Management for Unstructured P2P Overlays , 2005, Journal of Network and Systems Management.

[5]  Constantin Adam,et al.  Adaptable server clusters with QoS objectives , 2005, 2005 9th IFIP/IEEE International Symposium on Integrated Network Management, 2005. IM 2005..

[6]  George S. Fishman,et al.  Discrete-event simulation , 2001 .

[7]  Abhishek Chandra,et al.  Quantifying the Benefits of Resource Multiplexing in On-Demand Data Centers , 2003 .

[8]  Benny Rochwerger,et al.  Oceano-SLA based management of a computing utility , 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470).

[9]  Philip S. Yu,et al.  The state of the art in locally distributed Web-server systems , 2002, CSUR.

[10]  John Davin,et al.  Hierarchical variable ordering for distributed constraint optimization , 2006, AAMAS '06.

[11]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[12]  Chunqiang Tang,et al.  GoCast: gossip-enhanced overlay multicast for fast and dependable group communication , 2005, 2005 International Conference on Dependable Systems and Networks (DSN'05).

[13]  Germán S. Goldszmidt,et al.  Scaling Internet services by dynamic allocation of connections , 1999, Integrated Network Management VI. Distributed Management for the Networked Millennium. Proceedings of the Sixth IFIP/IEEE International Symposium on Integrated Network Management. (Cat. No.99EX302).

[14]  Christopher Stewart,et al.  Profile-Driven Component Placement for Cluster-Based Online Services , 2004, IEEE Distributed Syst. Online.

[15]  Michele Colajanni,et al.  Kernel-based Web switches providing content-aware routing , 2003, Second IEEE International Symposium on Network Computing and Applications, 2003. NCA 2003..

[16]  Constantin Adam,et al.  A middleware design for large-scale clusters offering multiple services , 2006, IEEE Transactions on Network and Service Management.

[17]  Constantin Adam,et al.  A Service Middleware that Scales in System Size and Applications , 2007, 2007 10th IFIP/IEEE International Symposium on Integrated Network Management.

[18]  Heiko Schuldt,et al.  Decentralized coordination of transactional processes in peer-to-peer environments , 2005, CIKM '05.

[19]  Kagan Tumer,et al.  Efficient agent-based cluster ensembles , 2006, AAMAS '06.

[20]  Sven Buchholz,et al.  Replica placement in adaptive content distribution networks , 2004, SAC '04.

[21]  M. Johansson,et al.  Distributed Resource Allocation Strategies for Achieving Quality of Service in Server Clusters , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[22]  Lada A. Adamic,et al.  Zipf's law and the Internet , 2002, Glottometrics.

[23]  Constantin Adam,et al.  A Middleware for Self-Managing Large-Scale Systems , 2006 .

[24]  Xiaoyun Zhu,et al.  Optimal resource assignment in Internet data centers , 2001, MASCOTS 2001, Proceedings Ninth International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[25]  Constantin Adam,et al.  Implementation and Evaluation of a Middleware for Self-Organizing Decentralized Web Services , 2006, SelfMan.

[26]  Asser N. Tantawi,et al.  Performance management for cluster based Web services , 2003 .

[27]  Dirk Beyer,et al.  Quartermaster - a resource utility system , 2005, 2005 9th IFIP/IEEE International Symposium on Integrated Network Management, 2005. IM 2005..

[28]  Asser N. Tantawi,et al.  An analytical model for multi-tier internet services and its applications , 2005, SIGMETRICS '05.

[29]  Victor R. Lesser,et al.  Multi-agent based peer-to-peer information retrieval systems with concurrent search sessions , 2006, AAMAS '06.

[30]  Abhishek Chandra,et al.  An observation-based approach towards self-managing Web servers , 2002, IEEE 2002 Tenth IEEE International Workshop on Quality of Service (Cat. No.02EX564).

[31]  Asser N. Tantawi,et al.  Dynamic estimation of CPU demand of web traffic , 2006, valuetools '06.

[32]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[33]  Rolf Stadler,et al.  Robust monitoring of network-wide aggregates through gossiping , 2007, IEEE Transactions on Network and Service Management.

[34]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[35]  Supranamaya Ranjan,et al.  Wide area redirection of dynamic content by Internet data centers , 2004, IEEE INFOCOM 2004.

[36]  John Moy,et al.  OSPF Version 2 , 1998, RFC.