Transfer^ encia de Dados entre Grupos de Processos no Modelo BAR

Peer-to-peer networks have emerged as a relevant architecture for executing scientific computations using resources provided by volunteers. However, this architecture poses several challenges, since volunteers may fail arbitrarily (Byzantine behaviour) or even follow a selfish behaviour (Rational behaviour), as indicated by the Byzantine-Altruistic-Rational (BAR) model. Also, if such distributed computations are to be performed in a computational model such as MapReduce, there must be some mechanism for reliably transferring data among tasks. This thesis introduces the N-party BAR Transfer (NBART) problem, which is the problem of reliably transferring data from a set of producers to a set of consumers, in the BAR model. This is an important building block for volunteer computing, since it allows tasks to reliably communicate among each other, and volunteers are not obliged to store the data for long periods of time. Instead, each volunteer may transfer the data, after storing it for a minimum amount of time, to another volunteer. This thesis also proposes two alternative algorithms for solving the NBART problem. One executes in a constant number of rounds and has greater communication complexity, while the other has a lower communication complexity but it has a greater execution time. It is proved that, if no volunteer follows a Rational behaviour, then both algorithms ensure a reliable transfer of data when a majority of producers and a majority of consumers are non-Byzantine. In addition, a theoretical analysis based on Game Theory is performed, proving that both algorithms provide a Nash equilibrium. Palavras Chave Keywords

[1]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1985, JACM.

[2]  Ingoo Han,et al.  Multi-party fair exchange protocol using ring architecture model , 2000 .

[3]  Michael Dahlin,et al.  FlightPath: Obedience vs. Choice in Cooperative Services , 2008, OSDI.

[4]  Fred B. Schneider,et al.  Implementing fault-tolerant services using the state machine approach: a tutorial , 1990, CSUR.

[5]  W. Hamilton,et al.  The evolution of cooperation. , 1984, Science.

[6]  Laurent Massoulié,et al.  Faithfulness in internet algorithms , 2004, PINS '04.

[7]  Olivier Markowitch,et al.  A Multi-Party Non-Repudiation Protocol , 2000, SEC.

[8]  Robert H. Deng,et al.  Multi-party fair exchange with an off-line trusted neutral party , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[9]  Olivier Markowitch,et al.  A Multi-party Optimistic Non-repudiation Protocol , 2000, ICISC.

[10]  A. K. Somani,et al.  Reputation management framework and its use as currency in large-scale peer-to-peer networks , 2004 .

[11]  Philip M. Thambidurai,et al.  Interactive consistency with multiple failure modes , 1988, Proceedings [1988] Seventh Symposium on Reliable Distributed Systems.

[12]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[13]  Michael Dahlin,et al.  BAR gossip , 2006, OSDI '06.

[14]  Miguel Correia,et al.  Active Quorum Systems , 2010, HotDep.

[15]  Tobias Distler,et al.  Increasing performance in byzantine fault-tolerant systems with on-demand replica consistency , 2011, EuroSys '11.

[16]  Michael J. Fischer,et al.  The Consensus Problem in Unreliable Distributed Systems (A Brief Survey) , 1983, FCT.

[17]  Kang G. Shin,et al.  Interleaved All-to-All Reliable Broadcast on Meshes and Hypercubes , 1994, IEEE Trans. Parallel Distributed Syst..

[18]  Danny Dolev,et al.  On the minimal synchronism needed for distributed consensus , 1983, 24th Annual Symposium on Foundations of Computer Science (sfcs 1983).

[19]  G. Mailath Do People Play Nash Equilibrium? Lessons From Evolutionary Game Theory , 1998 .

[20]  Carl E. Landwehr,et al.  Basic concepts and taxonomy of dependable and secure computing , 2004, IEEE Transactions on Dependable and Secure Computing.

[21]  Michael K. Reiter,et al.  Byzantine quorum systems , 1997, STOC '97.

[22]  Brian D. Noble,et al.  Samsara: honor among thieves in peer-to-peer storage , 2003, SOSP '03.

[23]  Nancy A. Lynch,et al.  Consensus in the presence of partial synchrony , 1988, JACM.

[24]  B. Cohen,et al.  Incentives Build Robustness in Bit-Torrent , 2003 .

[25]  Fred B. Schneider,et al.  Byzantine generals in action: implementing fail-stop processors , 1984, TOCS.

[26]  Jianying Zhou,et al.  An intensive survey of fair non-repudiation protocols , 2002, Comput. Commun..

[27]  David P. Anderson,et al.  BOINC: a system for public-resource computing and storage , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[28]  Idit Keidar,et al.  EquiCast: scalable multicast with selfish users , 2006, PODC '06.

[29]  Michael K. Reiter,et al.  Dynamic byzantine quorum systems , 2000, Proceeding International Conference on Dependable Systems and Networks. DSN 2000.

[30]  Ning Zhang,et al.  Achieving Non-Repudiation of Receipt , 1996, Comput. J..

[31]  Michael Dahlin,et al.  Theory of BAR games , 2007, PODC '07.

[32]  Matthias Schunter,et al.  Optimistic Protocols for Multi-Party Fair Exchange , 1996 .

[33]  Miguel Correia,et al.  Byzantine Fault-Tolerant MapReduce: Faults are Not Just Crashes , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.

[34]  David P. Anderson,et al.  SETI@home: an experiment in public-resource computing , 2002, CACM.

[35]  Robbert van Renesse,et al.  Fireflies: scalable support for intrusion-tolerant network overlays , 2006, EuroSys.

[36]  Sam Toueg,et al.  Asynchronous consensus and broadcast protocols , 1985, JACM.

[37]  Michael Dahlin,et al.  Minimal Byzantine Storage , 2002, DISC.

[38]  Michael Dahlin,et al.  BAR fault tolerance for cooperative services , 2005, SOSP '05.

[39]  Enrico Tronci,et al.  Model Checking Coalition Nash Equilibria in MAD Distributed Systems , 2009, SSS.

[40]  Henri E. Bal,et al.  An efficient reliable broadcast protocol , 1989, OPSR.

[41]  Éva Tardos,et al.  The effect of collusion in congestion games , 2006, STOC '06.

[42]  Jean-Philippe Martin,et al.  Fast Byzantine Consensus , 2006, IEEE Trans. Dependable Secur. Comput..

[43]  Yonatan Aumann,et al.  Pareto Efficiency and Approximate Pareto Efficiency in Routing and Load Balancing Games , 2010, SAGT.

[44]  Danny Dolev,et al.  Authenticated Algorithms for Byzantine Agreement , 1983, SIAM J. Comput..

[45]  Sangmin Lee,et al.  Upright cluster services , 2009, SOSP '09.

[46]  Tom Coffey,et al.  Non-repudiation with mandatory proof of receipt , 1996, CCRV.

[47]  Leslie Lamport,et al.  Reaching Agreement in the Presence of Faults , 1980, JACM.

[48]  Mark Lillibridge,et al.  Peer-to-peer Cooperative Backup System , 2001 .

[49]  Leslie Lamport,et al.  Cheap Paxos , 2004, International Conference on Dependable Systems and Networks, 2004.

[50]  Sam Toueg,et al.  A Modular Approach to Fault-Tolerant Broadcasts and Related Problems , 1994 .

[51]  Miguel Castro,et al.  BASE: Using abstraction to improve fault tolerance , 2003, TOCS.

[52]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[53]  Robert Griesemer,et al.  Paxos made live: an engineering perspective , 2007, PODC '07.

[54]  J. Nash NON-COOPERATIVE GAMES , 1951, Classics in Game Theory.

[55]  Miguel Correia,et al.  EBAWA: Efficient Byzantine Agreement for Wide-Area Networks , 2010, 2010 IEEE 12th International Symposium on High Assurance Systems Engineering.

[56]  S. Lennart Johnsson,et al.  Optimum Broadcasting and Personalized Communication in Hypercubes , 1989, IEEE Trans. Computers.

[57]  Vivien Quéma,et al.  FireSpam: Spam Resilient Gossiping in the BAR Model , 2010, 2010 29th IEEE Symposium on Reliable Distributed Systems.

[58]  Jo-Mei Chang,et al.  Reliable broadcast protocols , 1984, TOCS.

[59]  Michel Raynal,et al.  Consensus in Byzantine asynchronous systems , 2003, J. Discrete Algorithms.

[60]  Javier López,et al.  Multiparty nonrepudiation: A survey , 2009, CSUR.

[61]  Andrew S. Tanenbaum,et al.  An evaluation of the Amoeba group communication system , 1996, Proceedings of 16th International Conference on Distributed Computing Systems.

[62]  J. Goldberg,et al.  SIFT: Design and analysis of a fault-tolerant computer for aircraft control , 1978, Proceedings of the IEEE.

[63]  Enrico Tronci,et al.  Model Checking Nash Equilibria in MAD Distributed Systems , 2008, 2008 Formal Methods in Computer-Aided Design.

[64]  Olivier Markowitch,et al.  Probabilistic Non-Repudiation without Trusted Third Party , 1999 .

[65]  Rachid Guerraoui,et al.  Introduction to reliable distributed programming , 2006 .

[66]  Ariel Rubinstein,et al.  A Course in Game Theory , 1995 .

[67]  Miguel Correia,et al.  N-party BAR Transfer , 2011, OPODIS.

[68]  Danny Dolev,et al.  Distributed computing meets game theory: robust mechanisms for rational secret sharing and multiparty computation , 2006, PODC '06.

[69]  Miguel Castro,et al.  Practical byzantine fault tolerance and proactive recovery , 2002, TOCS.

[70]  Emin Gün Sirer,et al.  Experience with an Object Reputation System for Peer-to-Peer Filesharing , 2006, NSDI.

[71]  Stefan Schmid,et al.  On the topologies formed by selfish peers , 2006, PODC '06.

[72]  Leslie Lamport,et al.  The part-time parliament , 1998, TOCS.

[73]  Juan A. Garay,et al.  Abuse-Free Multi-party Contract Signing , 1999, DISC.

[74]  Sam Toueg,et al.  Time and Message Efficient Reliable Broadcasts , 1990, WDAG.

[75]  Olivier Markowitch,et al.  Optimistic non-repudiable information exchange , 2000 .

[76]  G. Hardin,et al.  The Tragedy of the Commons , 1968, Green Planet Blues.

[77]  Pierre Fraigniaud,et al.  Asymptotically Optimal Broadcasting and Gossiping in Faulty Hypercube Multicomputers , 1992, IEEE Trans. Computers.

[78]  Olivier Markowitch,et al.  Optimistic Fair Exchange with Transparent Signature Recovery , 2002, Financial Cryptography.

[79]  Yongfei Han Investigation of non-repudiation protocols , 1996, ACISP.

[80]  Leslie Lamport,et al.  The Byzantine Generals Problem , 1982, TOPL.

[81]  Michael O. Rabin,et al.  Transaction Protection by Beacons , 1983, J. Comput. Syst. Sci..

[82]  K. Eliaz Fault Tolerant Implementation , 2002 .

[83]  Miguel Castro,et al.  BASE: using abstraction to improve fault tolerance , 2001, SOSP.

[84]  Robert H. Deng,et al.  Evolution of Fair Non-repudiation with TTP , 1999, ACISP.

[85]  Liuba Shrira,et al.  HQ replication: a hybrid quorum protocol for byzantine fault tolerance , 2006, OSDI '06.

[86]  Robbert van Renesse,et al.  A middleware for gossip protocols , 2010, IPTPS.

[87]  Leslie Lamport,et al.  Disk Paxos , 2003, Distributed Computing.

[88]  Leslie Lamport,et al.  Lower bounds for asynchronous consensus , 2006, Distributed Computing.

[89]  James S. Plank A tutorial on Reed-Solomon coding for fault-tolerance in RAID-like systems , 1997 .

[90]  Michael Dahlin,et al.  BAR primer , 2008, 2008 IEEE International Conference on Dependable Systems and Networks With FTCS and DCC (DSN).

[91]  Geoff Coulson,et al.  Free Riding on Gnutella Revisited: The Bell Tolls? , 2005, IEEE Distributed Syst. Online.

[92]  Kenneth P. Birman,et al.  Bimodal multicast , 1999, TOCS.

[93]  Edmund L. Wong,et al.  It's on Me! The Benefit of Altruism in BAR Environments , 2010, DISC.

[94]  Darrell D. E. Long,et al.  DESIGN CHOICES FOR WEAK-CONSISTENCY GROUP COMMUNICATION , 1992 .

[95]  Sam Toueg,et al.  The weakest failure detector for solving consensus , 1996, JACM.

[96]  Michael Waidner,et al.  Optimistic Synchronous Multi-Party Contract Signing , 1998 .