Event-Driven Management Automation in the ALBM Cluster System

One of major concerns on using a large-scale cluster system is manageability. The ALBM (Adaptive Load Balancing and Management) cluster system is an active cluster system that is scalable, reliable and manageable. We introduce the event-driven management automation by using the ALBM active cluster system. This architecture is based on an event management solution that is composed of event notification service, event channel service and event rule engine. Critical system state changes are generated as events and delivered to the event rule engine. According to the predefined management rules, some management actions are performed when a specific condition is satisfied. This event-driven mechanism can be used to manage the system automatically without human intervention. This event management solution can also be used for other advance management purpose, such as event correlation, root cause analysis, trend analysis or capacity planning. In order to support the management automation possibility, the experimental results are presented by comparing adaptive load balancing with non-adaptive load balancing mechanism. The adaptive scheduling algorithm that uses the event management automation results in a better performance compared to the non-adaptive ones for a realistic heavy-tailed workload.

[1]  Philip S. Yu,et al.  The state of the art in locally distributed Web-server systems , 2002, CSUR.

[2]  Rod Gamache,et al.  Windows NT Clustering Service , 1998, Computer.

[3]  Malgorzata Steinder,et al.  Yemanja-a layered event correlation engine for multi-domain server farms , 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470).

[4]  Jeffrey S. Chase Server switching: yesterday and tomorrow , 2001, Proceedings. The Second IEEE Workshop on Internet Applications. WIAPP 2001.

[5]  Carey L. Williamson,et al.  Internet Web servers: workload characterization and performance implications , 1997, TNET.

[6]  Mor Harchol-Balter,et al.  Task assignment with unknown duration , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[7]  Eunmi Choi,et al.  A Proactive Management Framework in Active Clusters , 2003, IWAN.

[8]  Byrav Ramamurthy,et al.  Scalable Web server clustering technologies , 2000, IEEE Netw..

[9]  Ayse Basar Bener,et al.  Web service standards and real business scenario challenges , 2003, 2003 Proceedings 29th Euromicro Conference.