geográficamente distribuidos, basada en Replicación de Datos

[1]  Barry W. Johnson Design & analysis of fault tolerant digital systems , 1988 .

[2]  Erik Maehle,et al.  Fault-Tolerant Dynamic Task Scheduling Based on Dataflow Graphs , 1998 .

[3]  Thomas Hérault,et al.  MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile Nodes , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[4]  Sudhakar M. Reddy,et al.  Perspectives for High Performance Computing in Workstation Networks , 1996, HPCN Europe.

[5]  Ashok K. Agrawala,et al.  Fault Tolerant System Design , 1993 .

[6]  Kenneth P. Birman,et al.  Building Secure and Reliable Network Applications [Book Review] , 1998, IEEE Concurrency.

[7]  Herbert F. York,et al.  John Von Neumann: The Scientific Genius Who Pioneered the Modern Computer, Game Theory, Nuclear Deterrance, and Much More , 1993 .

[8]  Rajkumar Buyya,et al.  High Performance Cluster Computing , 1999 .

[9]  Thomas Sterling,et al.  How to Build a Beowulf: A Guide to the Implementation and Application of PC Clusters 2nd Printing , 1999 .

[10]  Kang G. Shin,et al.  Fault Injection Techniques and Tools , 1997, Computer.

[11]  Emilio Luque,et al.  Architectures for an Efficient Application Execution in a Collection of HNOWS , 2002, PVM/MPI.

[12]  Jon B. Weissman Fault Tolerant Wide-Area Parallel Computing , 2000, IPDPS Workshops.

[13]  Christian Engelmann,et al.  Development of Naturally Fault Tolerant Algorithms for Computing on 100,000 Processors , 2002 .

[14]  Anoop Gupta,et al.  Parallel computer architecture - a hardware / software approach , 1998 .

[15]  David B. Skillicorn A taxonomy for computer architectures , 1988, Computer.

[16]  Algirdas Avizienis,et al.  Toward Systematic Design of Fault-Tolerant Systems , 1997, Computer.

[17]  Emilio Luque,et al.  Efficient Execution on Long-Distance Geographically Distributed Dedicated Clusters , 2004, PVM/MPI.

[18]  Jack J. Dongarra,et al.  FT-MPI: Fault Tolerant MPI, Supporting Dynamic Applications in a Dynamic World , 2000, PVM/MPI.

[19]  Thomas L. Sterling Launching into the future of commodity cluster computing , 2002, Proceedings. IEEE International Conference on Cluster Computing.

[20]  Jean Arlat,et al.  Fault Injection for Dependability Validation: A Methodology and Some Applications , 1990, IEEE Trans. Software Eng..

[21]  Norman P. Jouppi,et al.  Readings in computer architecture , 2000 .

[22]  Adrianos Lachanas,et al.  MPI-FT: Portable Fault Tolerance Scheme for MPI , 2000, Parallel Process. Lett..

[23]  Ian T. Foster,et al.  Designing and building parallel programs - concepts and tools for parallel software engineering , 1995 .

[24]  Yves Robert,et al.  Matrix Multiplication on Heterogeneous Platforms , 2001, IEEE Trans. Parallel Distributed Syst..

[25]  Hermann Kopetz,et al.  Dependability: Basic Concepts and Terminology , 1992 .

[26]  Johan Karlsson,et al.  Comparison of Physical and Software-Implemented Fault Injection Techniques , 2003, IEEE Trans. Computers.

[27]  G. C. Fox,et al.  Solving Problems on Concurrent Processors , 1988 .

[28]  Flaviu Cristian,et al.  Understanding fault-tolerant distributed systems , 1991, CACM.

[29]  Hairong Sun,et al.  A generic availability model for clustered computing systems , 2001, Proceedings 2001 Pacific Rim International Symposium on Dependable Computing.

[30]  Carl E. Landwehr,et al.  Basic concepts and taxonomy of dependable and secure computing , 2004, IEEE Transactions on Dependable and Secure Computing.

[31]  Rajkumar Buyya,et al.  A taxonomy of Data Grids for distributed data sharing, management, and processing , 2005, CSUR.

[32]  Emilio Luque,et al.  Fault Tolerant Master-Worker over a Multi-Cluster Architecture , 2005, PARCO.

[33]  Stephen Taylor,et al.  Reliable heterogeneous applications , 2003, IEEE Trans. Reliab..

[34]  J-C. Laprie,et al.  DEPENDABLE COMPUTING AND FAULT TOLERANCE : CONCEPTS AND TERMINOLOGY , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing, 1995, ' Highlights from Twenty-Five Years'..

[35]  Laxmikant V. Kalé,et al.  A fault tolerant protocol for massively parallel systems , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[36]  Yasushi Saito,et al.  Optimistic replication , 2005, CSUR.

[37]  Jack Dongarra,et al.  PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing , 1995 .

[38]  Anthony Skjellum,et al.  A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard , 1996, Parallel Comput..

[39]  Leslie Lamport,et al.  Distributed snapshots: determining global states of distributed systems , 1985, TOCS.

[40]  Jon B. Weissman Fault tolerant computing on the grid: what are my options? , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[41]  Roy Friedman,et al.  Starfish: Fault-Tolerant Dynamic MPI Programs on Clusters of Workstations , 2004, Cluster Computing.

[42]  Anthony Skjellum,et al.  MPI/FT/sup TM/: architecture and taxonomies for fault-tolerant, message-passing middleware for performance-portable parallel computing , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[43]  Dhiraj K. Pradhan,et al.  Roll-Forward and Rollback Recovery: Performance-Reliability Trade-Off , 1997, IEEE Trans. Computers.

[44]  Zvonko G. Vranesic,et al.  Computer Organization , 1984 .

[45]  Peter S. Pacheco Parallel programming with MPI , 1996 .

[46]  P. Pierce,et al.  The Paragon implementation of the NX message passing interface , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.

[47]  Yves Robert,et al.  The master-slave paradigm with heterogeneous processors , 2001, Proceedings 42nd IEEE Symposium on Foundations of Computer Science.

[48]  Algirdas Avizienis,et al.  Design of fault-tolerant computers , 1967, AFIPS '67 (Fall).

[49]  William Gropp,et al.  User's Guide for MPE: Extensions for MPI Programs , 1998 .

[50]  Georg Stellner,et al.  CoCheck: checkpointing and process migration for MPI , 1996, Proceedings of International Conference on Parallel Processing.

[51]  Jack J. Dongarra,et al.  Building and Using a Fault-Tolerant MPI Implementation , 2004, Int. J. High Perform. Comput. Appl..