Locality-Preserving Clustering and Discovery of Resources in Wide-Area Distributed Computational Grids

In large-scale computational Grids, discovery of heterogeneous resources as a working group is crucial to achieving scalable performance. This paper presents a resource management scheme including a hierarchical cycloid overlay architecture, resource clustering and discovery algorithms for wide-area distributed Grid systems. We establish program/data locality by clustering resources based on their physical proximity and functional matching with user applications. We further develop dynamism-resilient resource management algorithm, cluster-token forwarding algorithm, and deadline-driven resource management algorithms. The advantage of the proposed scheme lies in low overhead, fast and dynamism-resilient multiresource discovery. The paper presents the scheme, new performance metrics, and experimental simulation results. This scheme compares favorably with other resource discovery methods in static and dynamic Grid applications. In particular, it supports efficient resource clustering, reduces communications cost, and enhances resource discovery success rate in promoting large-scale distributed supercomputing applications.

[1]  P. Salvador,et al.  Study on geographical distribution and availability of BitTorrent peers sharing video files , 2008, 2008 IEEE International Symposium on Consumer Electronics.

[2]  Walter Willinger,et al.  Self-similarity through high-variability: statistical analysis of Ethernet LAN traffic at the source level , 1997, TNET.

[3]  Amin Vahdat,et al.  Design and implementation tradeoffs for wide-area resource discovery , 2005, HPDC.

[4]  David R. Karger,et al.  Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web , 1997, STOC '97.

[5]  Li Xiao,et al.  An Effective P2P Search Scheme to Exploit File Sharing Heterogeneity , 2007, IEEE Transactions on Parallel and Distributed Systems.

[6]  Francine Berman,et al.  Adaptive Computing on the Grid Using AppLeS , 2003, IEEE Trans. Parallel Distributed Syst..

[7]  Guihai Chen,et al.  Cycloid: a constant-degree and lookup-efficient P2P overlay network , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[8]  Timothy L. Harris,et al.  XenoSearch: distributed resource discovery in the XenoServer open platform , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[9]  Elena Pagani,et al.  ORION - Ontology-based queRy routIng in Overlay Networks , 2009, J. Parallel Distributed Comput..

[10]  Artur Andrzejak,et al.  Scalable, efficient range queries for grid information services , 2002, Proceedings. Second International Conference on Peer-to-Peer Computing,.

[11]  Y. Charlie Hu,et al.  Assisted Peer-to-Peer Search with Partial Indexing , 2007, IEEE Transactions on Parallel and Distributed Systems.

[12]  Miguel Castro,et al.  Dynamic Lookup Networks , 2003, Future Directions in Distributed Computing.

[13]  Li Xiao,et al.  Improving distributed workload performance by sharing both CPU and memory resources , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[14]  Magnus Karlsson,et al.  Turning heterogeneity into an advantage in overlay routing , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[15]  Manish Parashar,et al.  Squid: Enabling search in DHT-based systems , 2008, J. Parallel Distributed Comput..

[16]  Jun Wang,et al.  An advertisement-based peer-to-peer search algorithm , 2009, J. Parallel Distributed Comput..

[17]  Ian T. Foster,et al.  Condor-G: A Computation Management Agent for Multi-Institutional Grids , 2004, Cluster Computing.

[18]  Jing Zhu,et al.  SOMO: Self-Organized Metadata Overlay for Resource Management in P2P DHT , 2003, IPTPS.

[19]  H. V. Jagadish,et al.  Linear clustering of objects with multiple attributes , 1990, SIGMOD '90.

[20]  Manish Parashar,et al.  Flexible information discovery in decentralized distributed systems , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[21]  Min Cai,et al.  MAAN: A Multi-Attribute Addressable Network for Grid Information Services , 2003, Journal of Grid Computing.

[22]  Mark Handley,et al.  Topologically-aware overlay construction and server selection , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[23]  Yunhao Liu,et al.  BAKE: A Balanced Kautz Tree Structure for Peer-to-Peer Networks , 2008, IEEE INFOCOM 2008 - The 27th Conference on Computer Communications.

[24]  James E. Smith,et al.  Virtual machines - versatile platforms for systems and processes , 2005 .

[25]  Pablo Molinero-Fernández,et al.  Systems with multiple servers under heavy-tailed workloads , 2005, Perform. Evaluation.

[26]  Anand Sivasubramaniam,et al.  SSW: A Small-World-Based Overlay for Peer-to-Peer Search , 2008, IEEE Transactions on Parallel and Distributed Systems.

[27]  Yiming Hu,et al.  Ferry: A P2P-Based Architecture for Content-Based Publish/Subscribe Services , 2007, IEEE Transactions on Parallel and Distributed Systems.

[28]  Ben Y. Zhao,et al.  Tapestry: a resilient global-scale overlay for service deployment , 2004, IEEE Journal on Selected Areas in Communications.

[29]  Mor Harchol-Balter,et al.  Exploiting process lifetime distributions for dynamic load balancing , 1995, SIGMETRICS.

[30]  Franco P. Preparata,et al.  The cube-connected-cycles: A versatile network for parallel computation , 1979, 20th Annual Symposium on Foundations of Computer Science (sfcs 1979).

[31]  Cai Yi-chao,et al.  Indexing structures for content-based retrieval of large image databases: a review , 2005 .

[32]  Jie Wu,et al.  Moore: An Extendable Peer-to-Peer Network Based on Incomplete Kautz Digraph with Constant Degree , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[33]  Desh Ranjan,et al.  Space-Filling Curves and Their Use in the Design of Geometric Data Structures , 1997, Theor. Comput. Sci..

[34]  Ellen W. Zegura,et al.  How to model an internetwork , 1996, Proceedings of IEEE INFOCOM '96. Conference on Computer Communications.

[35]  Srinivasan Seshan,et al.  Mercury: supporting scalable multi-attribute range queries , 2004, SIGCOMM '04.

[36]  Balachander Krishnamurthy,et al.  Web protocols and practice , 2001 .

[37]  Hong Jiang,et al.  HBA: Distributed Metadata Management for Large Cluster-Based Storage Systems , 2008, IEEE Transactions on Parallel and Distributed Systems.

[38]  Wolfgang Kellerer,et al.  Cost-Based Analysis of Hierarchical DHT Design , 2006, Sixth IEEE International Conference on Peer-to-Peer Computing (P2P'06).

[39]  Hector Garcia-Molina,et al.  One torus to rule them all: multi-dimensional queries in P2P systems , 2004, WebDB '04.

[40]  Takeo Kanade,et al.  Advances in Data and Web Management , 2009 .

[41]  Yunhao Liu,et al.  Popularity Adaptive Search in Hybrid P2P Systems , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[42]  Gade Krishna,et al.  A scalable peer-to-peer lookup protocol for Internet applications , 2012 .

[43]  Csaba D. Tóth,et al.  Uncoordinated Load Balancing and Congestion Games in P2P Systems , 2004, IPTPS.

[44]  Amin Vahdat,et al.  Scalable Wide-Area Resource Discovery , 2004 .

[45]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[46]  Yuchi Liu,et al.  Indexing Structures for Content-Based Retrieval of Large Image Databases: A Review , 2005, AIRS.

[47]  Dharma P. Agrawal,et al.  dBCube: A New Class of Hierarchical Multiprocessor Interconnection Networks with Area Efficient Layout , 1993, IEEE Trans. Parallel Distributed Syst..

[48]  Ben Y. Zhao,et al.  An Infrastructure for Fault-tolerant Wide-area Location and Routing , 2001 .

[49]  Ting Li,et al.  PIRD: P2P-Based Intelligent Resource Discovery in Internet-Based Distributed Systems , 2008, 2008 The 28th International Conference on Distributed Computing Systems.

[50]  Krishna P. Gummadi,et al.  Canon in G major: designing DHTs with hierarchical structure , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[51]  Cheng-Zhong Xu,et al.  Locality-Aware and Churn-Resilient Load-Balancing Algorithms in Structured Peer-to-Peer Networks , 2007, IEEE Transactions on Parallel and Distributed Systems.

[52]  Ben Y. Zhao,et al.  Tapestry: An Infrastructure for Fault-tolerant Wide-area Location and , 2001 .

[53]  Yuan He,et al.  KCube: A novel architecture for interconnection networks , 2010, Inf. Process. Lett..

[54]  M.I.T. Press,et al.  The International Journal of Supercomputer Applications and High Performance Computing— , 1994 .

[55]  Shyi-Chyi Cheng,et al.  Fast indexing method for image retrieval using k nearest neighbors searches by principal axis analysis , 2006, J. Vis. Commun. Image Represent..

[56]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[57]  Kai Hwang,et al.  Distributed Aggregation Algorithms with Load-Balancing for Scalable Grid Resource Monitoring , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.