Byzantine Fault Tolerance as a Service

In this paper, we argue for the need and benefits for providing Byzantine fault tolerance as a service to mission critical Web applications. In this new approach to Byzantine fault tolerance, an application server can partition the incoming requests into different domains for concurrent processing, decide which set of messages that should be totally ordered, or not at all, based its application semantics. This flexibility would reduce the end-to-end latency experienced by the clients and significantly increase the system throughput. Perhaps most importantly, we propose a middleware framework that provides a uniform interface to the applications so that they are not strongly tied to any particular Byzantine fault tolerance algorithm implementation.

[1]  Priya Narasimhan,et al.  Thema: Byzantine-fault-tolerant middleware for Web-service applications , 2005, 24th IEEE Symposium on Reliable Distributed Systems (SRDS'05).

[2]  Kenneth J. Goldman,et al.  Byzantine Fault-Tolerant Web Services for n-Tier and Service Oriented Architectures , 2008, 2008 The 28th International Conference on Distributed Computing Systems.

[3]  Ramakrishna Kotla,et al.  Zyzzyva: speculative byzantine fault tolerance , 2007, TOCS.

[4]  Louise E. Moser,et al.  Trustworthy Coordination of Web Services Atomic Transactions , 2012, IEEE Transactions on Parallel and Distributed Systems.

[5]  Ramakrishna Kotla,et al.  High throughput Byzantine fault tolerance , 2004, International Conference on Dependable Systems and Networks, 2004.

[6]  Wenbing Zhao,et al.  Design and implementation of a Byzantine fault tolerance framework for Web services , 2009, J. Syst. Softw..

[7]  Sangmin Lee,et al.  Upright cluster services , 2009, SOSP '09.

[8]  Louise E. Moser,et al.  Toward Trustworthy Coordination of Web Services Business Activities , 2013, IEEE Transactions on Services Computing.

[9]  Tobias Distler,et al.  Increasing performance in byzantine fault-tolerant systems with on-demand replica consistency , 2011, EuroSys '11.

[10]  Miguel Castro,et al.  Practical byzantine fault tolerance and proactive recovery , 2002, TOCS.

[11]  Michael Dahlin,et al.  Making Byzantine Fault Tolerant Systems Tolerate Byzantine Faults , 2009, NSDI.