Transient Fault Tolerance in Mobile Agent Based Computing

Agent technology is emerging as a new paradigm in the areas of distributed and mobile computing. Agent is a computational entity capable of relocating code, data and execution- state to another host. Mobile agents' code often experience transient faults resulting in a partial or complete loss during execution at a host machine. Protocol for fault – tolerant agent prevents a partial or complete loss of a mobile agent at a host. This article describes how to detect and recover random transient bit-errors at an agent before starting its execution at a host after its arrival at a host, in order to maintain availability of an agent by comparing an agent's states by using time and space redundancy. In this proposed self-repair approach, a software fix for fault – tolerance exists along with an agent. This generalized scheme is useful for recovering any kind of distributed agents against hardware transient faults (at a host). This paper presents a fault-tolerance mechanism for mobile agents that attempts to detect and correct any bit errors that may occur at a host after agents' mobility on a Web Agent-based Service Providing (WASP) platform. Though in modern distributed systems, the communication stack handles any bit errors and error correction is used on multiple layers (for example, in transport layer), the proposed approach is intended to be a supplement one to the conventional error detecting and correcting codes.

[1]  Fred B. Schneider,et al.  Towards Fault-Tolerant and Secure Agentry , 1997, WDAG.

[2]  Fritz Hohl,et al.  Mobile Agent Systems: What is Missing? , 1997 .

[3]  Luís Moura Silva,et al.  Fault-tolerant execution of mobile agents , 2000, Proceeding International Conference on Dependable Systems and Networks. DSN 2000.

[4]  Levent Ertaul,et al.  Mobile Agent Security , 2006, Security and Management.

[5]  Mahadev Satyanarayanan,et al.  Experience with Disconnected Operation in a Mobile Computing Environment , 1994, Mobidata.

[6]  David Wong,et al.  Java-based mobile agents , 1999, CACM.

[7]  Stefan Fünfrocken,et al.  How to Integrate Mobile Agents into Web Servers , 1997, WETICE.

[8]  Markus Endler,et al.  Coordinating mobile agents through the Broadcast Channel , 2001 .

[9]  Goutam Kumar Saha Fault management in mobile computing , 2003, UBIQ.

[10]  Markus Straßer,et al.  Providing Reliable Agents for Electronic Commerce , 1998, Trends in Distributed Systems for Electronic Commerce.

[11]  Radu Popescu-Zeletin,et al.  Mobile Agent-Based Transactions in Open Environments , 2000 .

[12]  Hartmut Vogler,et al.  An approach for mobile agent security and fault tolerance using distributed transactions , 1997, Proceedings 1997 International Conference on Parallel and Distributed Systems.

[13]  R. Tolksdorf Workshop On Collaborative Agents In Distributed Web Applications , 1997, Proceedings of IEEE 6th Workshop on Enabling Technologies: Infrastructure for Collaborative Enterprises.

[14]  André Schiper,et al.  FATOMAS-a fault-tolerant mobile agent system based on the agent-dependent approach , 2001, 2001 International Conference on Dependable Systems and Networks.

[15]  Henrique Madeira,et al.  Experimental evaluation of the fail-silent behaviour in programs with consistency checks , 1996, Proceedings of Annual Symposium on Fault Tolerant Computing.

[16]  Jacob A. Abraham,et al.  Algorithm-Based Fault Tolerance for Matrix Operations , 1984, IEEE Transactions on Computers.

[17]  Louise E. Moser,et al.  MAgNET: Mobile Agents for Networked Electronic Trading , 1999, IEEE Trans. Knowl. Data Eng..

[18]  Cornelia Boldyreff,et al.  Mobile agent fault tolerance for information retrieval applications: an exception handling approach , 2003, The Sixth International Symposium on Autonomous Decentralized Systems, 2003. ISADS 2003..