DIOS++: A Framework for Rule-Basedn Autonomic Management of Distributed Scientific Applications

This paper presents the design, prototype implementation and experimental evaluation of DIOS++, an infrastructure for enabling rule based autonomic adaptation and control of distributed scientific applications. DIOS++ provides: (1) abstractions for enhancing existing application objects with sensors and actuators for runtime interrogation and control, (2) a control network that connects and manages the distributed sensors and actuators, and enables external discovery, interrogation, monitoring and manipulation of these objects at runtime, and (3) a distributed rule engine that enables the runtime definition, deployment and execution of rules for autonomic application management. The framework is currently being used to enable autonomic monitoring and control of a wide range of scientific applications including oil reservoir, compressible turbulence and numerical relativity simulations.

[1]  Cristina V. Lopes,et al.  Aspect-oriented programming , 1999, ECOOP Workshops.

[2]  Hector Garcia-Molina,et al.  Incentives for Combatting Freeriding on P2P Networks , 2003, Euro-Par.

[3]  Salim Hariri,et al.  Catalina: A Smart Application Control and Management Environement , 2000 .

[4]  Bernd Mohr,et al.  KOJAK - A Tool Set for Automatic Performance Analysis of Parallel Programs , 2003, Euro-Par.

[5]  Kuzman Ganchev,et al.  Nswap: A Network Swapping Module for Linux Clusters , 2003, Euro-Par.

[6]  Hamid Pirahesh,et al.  A rule engine for query transformation in Starburst and IBM DB2 C/S DBMS , 1997, Proceedings 13th International Conference on Data Engineering.

[7]  Gábor Terstyánszky,et al.  Demonstration of P-GRADE Job-Mode for the Grid , 2003, Euro-Par.

[8]  Manish Parashar,et al.  Enabling Peer-to-Peer Interactions for Scientific Applications on the Grid , 2003, Euro-Par.

[9]  Gregor Kiczales,et al.  Aspect-oriented programming , 1996, CSUR.

[10]  Simon J. Cox,et al.  Short Message Service in a Grid-Enabled Computing Environment , 2003, Euro-Par.

[11]  Jeffrey S. Vetter,et al.  Autopilot: adaptive control of distributed applications , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[12]  Edward P. Katz A Multiple Rule Engine-Based Agent Control Architecture , 2002 .

[13]  Gail-Joon Ahn,et al.  Role-based access control on the web , 2001, TSEC.

[14]  Petr Jan Horn,et al.  Autonomic Computing: IBM's Perspective on the State of Information Technology , 2001 .

[15]  Manish Parashar,et al.  A distributed object infrastructure for interaction and steering , 2003, Concurr. Comput. Pract. Exp..