论文信息 - Fault Tolerance: Why Should I Pay for It?

Fault Tolerance: Why Should I Pay for It?

Fault tolerant systems are not as widely used today as one might expect from an analysis of the costs of failures. System developers must consider other factors as well: where should development dollars be spent for maximum leverage? Will development in one area (e.g. fault tolerance) impede development in others? Development of fault tolerance techniques that are orthogonal to other development efforts must be a high priority. Market forces are driving a number of new technologies into products; our analysis suggests that these new technologies will change the trade-offs in both the performance cost and development cost areas.

Barry J. Gleeson | Barry Gleeson

[1] David B. Johnson,et al. Sender-Based Message Logging , 1987 .

[2] David J. DeWitt,et al. Parallel database systems: the future of high performance database systems , 1992, CACM.

[3] André Schiper,et al. Lightweight causal and atomic group multicast , 1991, TOCS.

[4] Robert S. Swarz,et al. The theory and practice of reliable system design , 1982 .

[5] Bruce J. Walker,et al. The LOCUS Distributed System Architecture , 1986 .

[6] Nancy A. Lynch,et al. Impossibility of distributed consensus with one faulty process , 1985, JACM.

[7] Goetz Graefe,et al. Encapsulation of parallelism in the Volcano query processing system , 1990, SIGMOD '90.

[8] Kenneth P. Birman,et al. Using process groups to implement failure detection in asynchronous environments , 1991, PODC '91.

[9] R. Freiburghouse. Making processing fail-safe , 1982 .

[10] Stefano Ceri,et al. Distributed Databases: Principles and Systems , 1984 .

[11] Wolfgang Graetsch,et al. Fault tolerance under UNIX , 1989, TOCS.