Time-adaptive self stabilization

Time-Adaptive Self Stabilization Shay Kutten Boaz Patt-Shamir* Dept. of Industrial Engineering College of Computer Science The Technion Northeastern University and boez~ccs. neu. edu IBM T.J, Watson Research Center kutten@ie. technion. ac. il We study the scenario where a transient fault hit ~ of the n nodes of a dktributed system by corrupting their state. We consider the basic problem of persistent bit, where the system is required to maintain a value in the face of transient failures by means of replication. We give an algorithm to recover the value quickly: the value of the bit is recovered at all nodes in 0(~) time units for any unknown value of j < n/2. Moreover, complete state quiescence occurs in O(dhun) time units, where diam denotes the diameter of the network. This means that the value persists indefinitely so long as any ~ < n/2 faults are followed by fl(diam) fault-free time units. We prove lower bounds which show that both time bounds are asymptotically optimal. Using the algorithm for persistent bit, we present a general transformer which takes a distributed nonreactive, non-stabilizing protocol P, and produces a self-stabilizing protocol P’ which solves the problem P solves, with the additional property that if the number of faults that hit the system after stabilization is j, for any unknown ~ < n/2, then the output of P’ regains stability in O(f) time units, and the state stabilizes in O(dlam) time units. ● Research supported by DARPA and Rome Laboratory under agreement F30602-96-0239. Permission to make digitalllmrd copies of fill or part of [his nmleriol for personal or clo.wroum use is granted wiU1outfee pmvi(ied UmtUw copies are not made or dislrihutcd for profit or commercin I w{vnnl~tgc.UWcopyrigbl notice. the tine oftlw publication and iL$date oppew’, *MInotice is given Uralcopyright is hy permission oftbe AChl, IIW To copy oUwn\,ise, to republish, to post on servers or to redisirilw! c to IISIS,requires specific Pemlissiotl oncVorfee 1997 F’(]D~ 97 Simla Bdmm (‘A lJ,V1 Copyright 1997 ACM 0-89791 -952 -1/97/8..$3 50

[1]  Shmuel Katz,et al.  Self-stabilizing extensions for message-passing systems , 1990, PODC '90.

[2]  Shlomi Dolev,et al.  SuperStabilizing protocols for dynamic distributed systems , 1995, PODC '95.

[3]  Shay Kutten,et al.  Fault-local distributed mending (extended abstract) , 1995, PODC '95.

[4]  Edsger W. Dijkstra,et al.  Self-stabilizing systems in spite of distributed control , 1974, CACM.

[5]  Shlomi Dolev,et al.  Memory requirements for silent stabilization , 1999, Acta Informatica.

[6]  Danny Dolev,et al.  Proceedings of the Israel Symposium on Theory of Computing and Systems , 1992 .

[7]  Shay Kutten,et al.  Fault-Local Distributed Mending , 1999, J. Algorithms.

[8]  Boaz Patt-Shamir,et al.  Self-stabilization by local checking and correction , 1991, [1991] Proceedings 32nd Annual Symposium of Foundations of Computer Science.

[9]  Arobinda Gupta,et al.  Fault-containing self-stabilizing algorithms , 1996, PODC '96.

[10]  Michael E. Saks,et al.  Sphere packing and local majorities in graphs , 1993, [1993] The 2nd Israel Symposium on Theory and Computing Systems.

[11]  Moti Yung,et al.  Non-Exploratory Self-Stabilization for Constant-Space Symmetry-Breaking , 1994, ESA.

[12]  Moti Yung,et al.  Memory-Efficient Self Stabilizing Protocols for General Networks , 1990, WDAG.

[13]  Boaz Patt-Shamir,et al.  Time optimal self-stabilizing synchronization , 1993, STOC.

[14]  Boaz Patt-Shamir,et al.  Self-Stabilization by Local Checking and Global Reset (Extended Abstract) , 1994, WDAG.