Efficient Parallel Application Execution on Opportunistic Desktop Grids

[1]  Fabio Kon,et al.  Design and Implementation of a Middleware for Data Storage in Opportunistic Grids , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[2]  Daniel Marques,et al.  Automated application-level checkpointing of MPI programs , 2003, PPoPP '03.

[3]  Lucio Grandinetti Grid Computing: The New Frontier of High Performance Computing [post-proceedings of the High Performance Computing Workshop, HPC 2004, Cetraro, Italy, 2004] , 2005, Advances in Parallel Computing.

[4]  Marcelo Finger,et al.  Resource use pattern analysis for opportunistic grids , 2008, MGC '08.

[5]  Bysmarck Barros de Sousa,et al.  MAG: A Mobile Agent Based Computational Grid Platform , 2005, GCC.

[6]  L. Alvisi,et al.  A Survey of Rollback-Recovery Protocols , 2002 .

[7]  Anne-Marie Kermarrec,et al.  Peer-to-Peer Membership Management for Gossip-Based Protocols , 2003, IEEE Trans. Computers.

[8]  Rida A. Bazzi,et al.  Compiler-assisted heterogeneous checkpointing , 2001, Proceedings 20th IEEE Symposium on Reliable Distributed Systems.

[9]  Carl Kesselman,et al.  Performance and scalability of a replica location service , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..

[10]  Domenico Talia,et al.  Grid Programming Models: Current Tools, Issues and Directions , 2003 .

[11]  Ahmed Karmouch,et al.  Mobile software agents: an overview , 1998, IEEE Commun. Mag..

[12]  Miron Livny,et al.  Checkpoint and Migration of UNIX Processes in the Condor Distributed Processing System , 1997 .

[13]  Peter Sobe Stable checkpointing in distributed systems without shared disks , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[14]  Alfredo Goldman,et al.  MobiGrid*: Framework for mobile agents on computer grid environments , 2004 .

[15]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[16]  Barry Wilkinson Grid Computing: Techniques and Applications , 2009 .

[17]  Marcelo Finger,et al.  Resource use pattern analysis for predicting resource availability in opportunistic grids , 2010, Concurr. Comput. Pract. Exp..

[18]  Oscar H. Ibarra,et al.  Heuristic Algorithms for Scheduling Independent Tasks on Nonidentical Processors , 1977, JACM.

[19]  Calton Pu,et al.  A feedback-driven proportion allocator for real-rate scheduling , 1999, OSDI '99.

[20]  Fabio Kon,et al.  The implementation of the BSP parallel computing model on the InteGrade Grid middleware , 2005, MGC '05.

[21]  Calton Pu,et al.  Adaptive Resource Management via Modular Feedback Control , 1999 .

[22]  Ian T. Foster,et al.  A decentralized, adaptive replica location mechanism , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[23]  Fabio Kon,et al.  Portable checkpointing and communication for BSP applications on dynamic heterogeneous grid environments , 2005, 17th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD'05).

[24]  Fabio Kon,et al.  A group membership service for large-scale grids , 2008, MGC '08.

[25]  John Kubiatowicz,et al.  Erasure Coding Vs. Replication: A Quantitative Comparison , 2002, IPTPS.

[26]  Hesham H. Ali,et al.  Task scheduling in parallel and distributed systems , 1994, Prentice Hall series in innovative technology.

[27]  Ian T. Foster,et al.  MPICH-G2: A Grid-enabled implementation of the Message Passing Interface , 2002, J. Parallel Distributed Comput..

[28]  Fabio Kon,et al.  InteGrade: object‐oriented Grid middleware leveraging the idle computing power of desktop machines , 2004, Concurr. Pract. Exp..

[29]  Kai Li,et al.  Diskless Checkpointing , 1998, IEEE Trans. Parallel Distributed Syst..

[30]  Geoffrey E. Hinton,et al.  Unsupervised learning : foundations of neural computation , 1999 .

[31]  Robbert van Renesse,et al.  A Gossip-Style Failure Detection Service , 2009 .

[32]  Fabio Kon,et al.  Strategies for Checkpoint Storage on Opportunistic Grids , 2006, IEEE Distributed Systems Online.

[33]  Michael O. Rabin,et al.  Efficient dispersal of information for security, load balancing, and fault tolerance , 1989, JACM.

[34]  P. Sadayappan,et al.  Distributed job scheduling on computational Grids using multiple simultaneous requests , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[35]  Thomas Hérault,et al.  MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile Nodes , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[36]  Min Cai,et al.  A Peer-to-Peer Replica Location Service Based on a Distributed Hash Table , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[37]  Fabio Kon,et al.  Adaptive fault tolerance mechanisms for opportunistic environments: a mobile agent approach , 2011, Concurr. Comput. Pract. Exp..

[38]  Yanmin Zhu,et al.  A Survey on Grid Scheduling Systems , 2013 .

[39]  Roberto Ierusalimschy,et al.  Lua—An Extensible Extension Language , 1996, Softw. Pract. Exp..

[40]  Rodrigo Rodrigues,et al.  High Availability in DHTs: Erasure Coding vs. Replication , 2005, IPTPS.

[41]  Francisco Vilar Brasileiro,et al.  Faults in grids: why are they so bad and what can be done about it? , 2003, Proceedings. First Latin American Web Congress.

[42]  Jennifer M. Schopf,et al.  Ten actions when Grid scheduling: the user as a Grid scheduler , 2004 .

[43]  R. F. Freund,et al.  Dynamic matching and scheduling of a class of independent tasks onto heterogeneous computing systems , 1999, Proceedings. Eighth Heterogeneous Computing Workshop (HCW'99).

[44]  Alex Rapaport,et al.  Mpi-2: extensions to the message-passing interface , 1997 .

[45]  Soonwook Hwang,et al.  Grid workflow: a flexible failure handling framework for the grid , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[46]  Leslie G. Valiant,et al.  A bridging model for parallel computation , 1990, CACM.

[47]  Fabio Kon,et al.  Application execution management on the InteGrade opportunistic grid middleware , 2010, J. Parallel Distributed Comput..

[48]  Volker Strumpen,et al.  Portable Checkpointing and Recovery in Heterogeneous Environments , 1996 .

[49]  William E. Johnston,et al.  Coding for High Availability of a Distributed-Parallel Storage System , 1998, IEEE Trans. Parallel Distributed Syst..

[50]  Rob H. Bisseling,et al.  Parallel scientific computation - a structured approach using BSP and MPI , 2004 .

[51]  Fábio M. Costa,et al.  MPI support on opportunistic grids based on the InteGrade middleware , 2010 .

[52]  Fabio Kon,et al.  Trust in large-scale computational grids: an SPKI/SDSI extension for representing opinion , 2006, MCG '06.

[53]  Selim G. Akl,et al.  Scheduling Algorithms for Grid Computing: State of the Art and Open Problems , 2006 .

[54]  Fabio Kon,et al.  InteGrade object-oriented Grid middleware leveraging the idle computing power of desktop machines: Research Articles , 2004 .