Bag-of-Tasks Self-Scheduling over Range-Queriable Search Overlays

The opportunistic computing paradigm is extremely valuable to modern technical and scientific endeavors, as it can support the demand for large and steady amounts of computing capacity. The applications of opportunistic computing environments often require independent and intensive processing over different data sets, characterizing themselves as BoT applications. Opportunistic computing systems, however, usually employ centralized approaches to do task allocation, a problematic situation on sizable settings. This paper proposes and evaluates a peer-to-peer technique that allows the self-scheduling of tasks without any central controller whatsoever, aiming at opportunistic computing scenarios running BoT applications. Its key is to employ range query capabilities of search overlays like Skip Graphs as an infrastructure for fully distributed allocation decisions. Experimental results obtained in a message-passing simulator consisting of 5,000 nodes and 75,000 tasks show that central points of failure were eliminated and communication bottlenecks were highly alleviated, subject to some congestion characteristics of the search overlay.

[1]  Nazareno Andrade,et al.  Labs of the World, Unite!!! , 2006, Journal of Grid Computing.

[2]  David P. Anderson,et al.  BOINC: a system for public-resource computing and storage , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[3]  Suresh Jagannathan,et al.  Unstructured peer-to-peer networks for sharing processor cycles , 2006, Parallel Comput..

[4]  James Aspnes,et al.  Skip graphs , 2003, SODA '03.

[5]  Mario Lauria,et al.  The organic grid: self-organizing computation on a peer-to-peer network , 2004 .

[6]  Azzedine Boukerche,et al.  A multiple task allocation framework for biological sequence comparison in a grid environment , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[7]  Francisco Vilar Brasileiro,et al.  Running Bag-of-Tasks applications on computational grids: the MyGrid approach , 2003, 2003 International Conference on Parallel Processing, 2003. Proceedings..

[8]  L.M. Ni,et al.  Trapezoid Self-Scheduling: A Practical Scheduling Scheme for Parallel Compilers , 1993, IEEE Trans. Parallel Distributed Syst..

[9]  CONSTANTINE D. POLYCHRONOPOULOS,et al.  Guided Self-Scheduling: A Practical Scheduling Scheme for Parallel Supercomputers , 1987, IEEE Transactions on Computers.

[10]  Ian T. Foster,et al.  On Death, Taxes, and the Convergence of Peer-to-Peer and Grid Computing , 2003, IPTPS.

[11]  Matei Ripeanu,et al.  Peer-to-peer architecture case study: Gnutella network , 2001, Proceedings First International Conference on Peer-to-Peer Computing.

[12]  Edith Schonberg,et al.  Factoring: a method for scheduling parallel loops , 1992 .

[13]  Damla Turgut,et al.  Task distribution with a random overlay network , 2006, Future Gener. Comput. Syst..

[14]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[15]  Ben Y. Zhao,et al.  OceanStore: an architecture for global-scale persistent storage , 2000, SIGP.

[16]  Diomidis Spinellis,et al.  A survey of peer-to-peer content distribution technologies , 2004, CSUR.

[17]  Robert Tappan Morris,et al.  Serving DNS Using a Peer-to-Peer Lookup Service , 2002, IPTPS.

[18]  Kenneth Steiglitz,et al.  Combinatorial Optimization: Algorithms and Complexity , 1981 .

[19]  Peiyi Tang,et al.  Dynamic Processor Self-Scheduling for General Parallel Nested Loops , 1987, IEEE Trans. Computers.

[20]  Unai Arronategui,et al.  Scalable Architecture for Allocation of Idle CPUs in a P2P Network , 2006, HPCC.

[21]  Francine Berman,et al.  Adaptive scheduling of master/worker applications on distributed computational resources , 2001 .

[22]  Ben Y. Zhao,et al.  An Infrastructure for Fault-tolerant Wide-area Location and Routing , 2001 .