Combining Exception Handling and Replication for Improving the Reliability of Agent Software

Exception handling and replication are two complementary mechanisms that increase software reliability. Exception handling helps programmers in controlling situations in which the normal execution flow of a program cannot continue. Replication handles system fail- ures through redundancy. Combining both techniques is a first step to- wards building a trustworthy software engineering framework. This paper presents some of the results from the Facoma project. It proposes the specification of an exception handling system for replicated agents as an adaptation of the Sage proposal. It then describes its implementation in the Dimax replicated agent environment.

[1]  Santosh K. Shrivastava,et al.  Exception Handling in Replicated Systems with Voting , 1986 .

[2]  Alexander Romanovsky An exception handling framework for N-version programming in object-oriented systems , 2000, Proceedings Third IEEE International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC 2000) (Cat. No. PR00607).

[3]  Jessie Dedecker,et al.  Ambient-Oriented Exception Handling , 2006, Advanced Topics in Exception Handling Techniques.

[4]  Brian Randell,et al.  Error recovery in asynchronous systems , 1986, IEEE Transactions on Software Engineering.

[5]  John B. Goodenough,et al.  Exception handling: issues and a proposed notation , 1975, CACM.

[6]  Richard Carlsson,et al.  Erlang's exception handling revisited , 2004, ERLANG '04.

[7]  Shinichi Honiden,et al.  A Definition of Exceptions in Agent-Oriented Computing , 2006, ESAW.

[8]  Michael Wooldridge,et al.  An Introduction to MultiAgent Systems John Wiley & Sons , 2002 .

[9]  Michael Wooldridge,et al.  An Introduction to MultiAgent Systems, Second Edition , 2009 .

[10]  Jie Xu,et al.  From Recovery Blocks to Concurrent Atomic Actions , 2003 .

[11]  Valérie Issarny,et al.  An exception handling model for parallel programming and its verification , 1991, SIGSOFT '91.

[12]  Alexander Romanovsky,et al.  Exception handling in coordination-based mobile environments , 2005, 29th Annual International Computer Software and Applications Conference (COMPSAC'05).

[13]  Anand R. Tripathi,et al.  The guardian model and primitives for exception handling in distributed systems , 2004, IEEE Transactions on Software Engineering.

[14]  Olivier Marin,et al.  DimaX: a fault-tolerant multi-agent platform , 2006, SELMAS '06.

[15]  Carlos José Pereira de Lucena,et al.  Exception Handling in Context-Aware Agent Systems: A Case Study , 2007, SELMAS.

[16]  Christelle Urtado,et al.  A proposition for exception handling in multi-agent systems , 2003 .

[17]  Jørgen Lindskov Knudsen,et al.  Advanced Topics in Exception Handling Techniques (the book grow out of ECOOP workshops) , 2006, Advanced Topics in Exception Handling Techniques.

[18]  Ronald A. Olsson,et al.  Exception Handling during Asynchronous Method Invocation (Research Note) , 2002, Euro-Par.

[19]  Christelle Urtado,et al.  Exception Handling and Asynchronous Active Objects: Issues and Proposal , 2006, Advanced Topics in Exception Handling Techniques.

[20]  Olivier Marin,et al.  Towards reliable multi-agent systems: An adaptive replication mechanism , 2010, Multiagent Grid Syst..

[21]  Chouki Tibermacine,et al.  Specification of an exception handling system for a replicated agent environment , 2008, WEH '08.

[22]  Jean-Pierre Briot,et al.  Adaptive replication of large-scale multi-agent systems: towards a fault-tolerant multi-agent platform , 2005, SELMAS '05.

[23]  Ramakrishna Kotla,et al.  Zyzzyva , 2007, SOSP.

[24]  Pierre Sens,et al.  DARX - a framework for the fault-tolerant support of agent software , 2003, 14th International Symposium on Software Reliability Engineering, 2003. ISSRE 2003..