Data transfer scheduling with advance reservation and provisioning

Over the years, scientific applications have become more complex and more data intensive. Although through the use of distributed resources the institutions and organizations gain access to the resources needed for their large-scale applications, complex middleware is required to orchestrate the use of these storage and network resources between collaborating parties, and to manage the end-to-end processing of data. We present a new data scheduling paradigm with advance reservation and provisioning. Our methodology provides a basis for provisioning end-to-end high performance data transfers which require integration between system, storage and network resources, and coordination between reservation managers and data transfer nodes. This allows researchers/users and higher level meta-schedulers to use data placement as a service where they can plan ahead and reserve time and resources for their data movement operations. We present a novel approach for evaluating timedependent structures with bandwidth guaranteed paths. We present a practical online scheduling model using advance reservation in dynamic network with time constraints. In addition, we report a new polynomial algorithm presenting possible reservation options and alternatives for earliest completion and shortest transfer duration. We enhance the advance network reservation system by extending the underlying mechanism to provide a new service in which users submit their constraints and the system suggests possible reservation requests satisfying users’ requirements. We have studied scheduling data transfer operation with resource and time conflicts. We have developed a new scheduling methodology considering resource allocation in client sites and bandwidth allocation on network link connecting resources. Some other major contributions of our study include enhanced reliability, adaptability, and performance optimization of distributed data placement tasks. While designing this new data scheduling architecture, we also developed other important methodologies such as early error detection, failure awareness, job aggregation, and dynamic adaptation of distributed data placement tasks. The adaptive tuning includes dynamically setting data transfer parameters and controlling utilization of available network capacity. Our research aims to provide a middleware to improve the data bottleneck in high performance computing systems.

[1]  Mehmet Balman,et al.  Distributed data management with PetaShare , 2008, Mardi Gras Conference.

[2]  Chase Qishi Wu,et al.  On design of bandwidth scheduling algorithms for multiple data transfers in dedicated networks , 2008, ANCS '08.

[3]  Ismail Chabini,et al.  Discrete Dynamic Shortest Path Problems in Transportation Applications: Complexity and Algorithms with Optimal Run Time , 1998 .

[4]  Ian J. Taylor,et al.  Visual Grid Workflow in Triana , 2005, Journal of Grid Computing.

[5]  Lars-Olof Burchard,et al.  Analysis of data structures for admission control of advance reservation requests , 2005, IEEE Transactions on Knowledge and Data Engineering.

[6]  Chung-Piaw Teo,et al.  Gale-Shapley Stable Marriage Problem Revisited: Strategic Issues and Applications , 1999, IPCO.

[7]  Reuven Cohen,et al.  Graded Channel Reservation with Path Switching in Ultra High Capacity Networks , 2006, 2006 3rd International Conference on Broadband Communications, Networks and Systems.

[8]  Eduard Escalona,et al.  Advance reservations for service-aware GMPLS-based optical networks , 2008, Comput. Networks.

[9]  Gabrielle Allen,et al.  Semantic enabled metadata management in PetaShare , 2009, Int. J. Grid Util. Comput..

[10]  Jason Lee,et al.  High-Performance Remote Access to Climate Simulation Data: A Challenge Problem for Data Grid Technologies , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[11]  Kavitha Ranganathan,et al.  Decoupling computation and data scheduling in distributed data-intensive applications , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[12]  Mehmet Balman,et al.  Dynamic Adaptation of Parallelism Level in Data Transfer Scheduling , 2009, 2009 International Conference on Complex, Intelligent and Software Intensive Systems.

[13]  Kristina Lerman,et al.  Resource allocation in the grid using reinforcement learning , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[14]  Kavitha Ranganathan,et al.  Computation scheduling and data replication algorithms for data Grids , 2004 .

[15]  Qiang Song,et al.  CHEETAH virtual label switching router for dynamic provisioning in IP optical networks , 2008, Opt. Switch. Netw..

[16]  Miron Livny,et al.  DISC: A System for Distributed Data Intensive Scientific Computing , 2004, WORLDS.

[17]  Arie Shoshani,et al.  Storage resource managers: essential components for the Grid , 2003 .

[18]  Yossi Azar,et al.  Strongly Polynomial Algorithms for the Unsplittable Flow Problem , 2001, IPCO.

[19]  Kavitha Ranganathan,et al.  Simulation Studies of Computation and Data Scheduling Algorithms for Data Grids , 2003, Journal of Grid Computing.

[20]  Chase Qishi Wu,et al.  Ultrascience net: network testbed for large-scale science applications , 2005, IEEE Communications Magazine.

[21]  J. Anthony Tyson,et al.  Large Synoptic Survey Telescope: Overview , 2002, SPIE Astronomical Telescopes + Instrumentation.

[22]  Erwin Laure The EU DataGrid Setting the Basis for Production Grids : Preface , 2004 .

[23]  Arie Shoshani,et al.  Advance Network Reservation and Provisioning for Science , 2009 .

[24]  Ian J. Taylor,et al.  The Triana Workflow Environment: Architecture and Applications , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[25]  Mehmet Balman,et al.  A new paradigm: Data-aware scheduling in grid computing , 2009, Future Gener. Comput. Syst..

[26]  Valentin Cristea,et al.  An Algorithm for File Transfer Scheduling in Grid Environments , 2009, ArXiv.

[27]  Arunabha Sen,et al.  Optimal routing for fast transfer of bulk data files in time-varying networks , 2004, 2004 IEEE International Conference on Communications (IEEE Cat. No.04CH37577).

[28]  Mehmet Balman,et al.  Early Error Detection and Classification in Data Transfer Scheduling , 2009, 2009 International Conference on Complex, Intelligent and Software Intensive Systems.

[29]  Paolo Toth,et al.  Knapsack Problems: Algorithms and Computer Implementations , 1990 .

[30]  Ian T. Foster,et al.  Grid information services for distributed resource sharing , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[31]  Sanjay Ranka,et al.  Scheduling Bulk File Transfers with Start and End Times , 2007, Sixth IEEE International Symposium on Network Computing and Applications (NCA 2007).

[32]  Mehmet Balman,et al.  Data-Aware Distributed Computing , 2012 .

[33]  Jon M. Kleinberg,et al.  Single-source unsplittable flow , 1996, Proceedings of 37th Conference on Foundations of Computer Science.

[34]  Admela Jukan,et al.  Advances in IP-optical networking for IP quad-play traffic and services , 2008, Opt. Switch. Netw..

[35]  Mehmet Balman,et al.  Data scheduling for large scale distributed applications , 2007 .

[36]  Rajkumar Buyya,et al.  The Gridbus toolkit for service oriented grid and utility computing: an overview and status report , 2004, 1st IEEE International Workshop on Grid Economics and Business Models, 2004. GECON 2004..

[37]  Sartaj Sahni,et al.  Bandwidth Scheduling and Path Computation Algorithms for Connection-Oriented Networks , 2007, Sixth International Conference on Networking (ICN'07).

[38]  Gigi Karmous-Edwards,et al.  Dynamic scheduling of network resources with advance reservations in optical grids , 2008, Int. J. Netw. Manag..

[39]  Hua Li,et al.  A varying-bandwidth list scheduling heuristic for file transfers , 2004, 2004 IEEE International Conference on Communications (IEEE Cat. No.04CH37577).

[40]  Jeffrey Xu Yu,et al.  Finding time-dependent shortest paths over large graphs , 2008, EDBT '08.

[41]  Francine Berman,et al.  Grid Computing: Making the Global Infrastructure a Reality , 2003 .

[42]  Adrian Vetta,et al.  A Priority-Based Model of Routing , 2008, Chic. J. Theor. Comput. Sci..

[43]  Miron Livny,et al.  Data placement in widely distributed systems , 2005 .

[44]  F. Manea,et al.  Solving a combinatorial problem with network flows , 2005 .

[45]  Mehmet Balman,et al.  Failure-awareness and dynamic adaptation in data scheduling , 2009 .

[46]  Ariel Orda,et al.  Networks with advance reservations: the routing perspective , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[47]  Ariel Orda,et al.  Shortest-path and minimum-delay algorithms in networks with time-dependent edge-length , 1990, JACM.

[48]  Sandy Irani,et al.  Scheduling with Conflicts on Bipartite and Interval Graphs , 2003, J. Sched..

[49]  Edwin A. Valentijn,et al.  Survey and other telescope technologies and discoveries , 2002 .

[50]  Rajkumar Buyya,et al.  A grid service broker for scheduling distributed data-oriented applications on global grids , 2004, MGC '04.

[51]  Miron Livny,et al.  Data pipelines: enabling large scale multi-protocol data transfers , 2004, MGC '04.

[52]  Randall R. Stewart,et al.  Stream Control Transmission Protocol , 2000, RFC.

[53]  Amit Kumar,et al.  Approximation Algorithms for the Unsplittable Flow Problem , 2002, Algorithmica.

[54]  Marek Piotrów A note on constructing binary heaps with periodic networks , 2002, Inf. Process. Lett..

[55]  Pascale Vicat-Blanc Primet,et al.  Flow scheduling and endpoint rate control in GridNetworks , 2009, Future Gener. Comput. Syst..

[56]  Olov Schelén,et al.  An agent-based architecture for advance reservations , 1997, Proceedings of 22nd Annual Conference on Local Computer Networks.

[57]  Chase Qishi Wu,et al.  Control Plane for Advance Bandwidth Scheduling in Ultra High-Speed Networks , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[58]  Edward G. Coffman,et al.  Scheduling File Transfers , 1985, SIAM J. Comput..

[59]  Bertram Ludäscher,et al.  Scientific workflow management and the Kepler system: Research Articles , 2006 .

[60]  Petr Kolman,et al.  A note on the greedy algorithm for the unsplittable flow problem , 2003, Inf. Process. Lett..

[61]  Mehmet Balman,et al.  Dynamically tuning level of parallelism in wide area data transfers , 2008, DADC '08.

[62]  Lars-Olof Burchard Networks with Advance Reservations: Applications, Architecture, and Performance , 2005, Journal of Network and Systems Management.

[63]  Samir Khuller,et al.  Large-scale data collection: a coordinated approach , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[64]  Steven Tuecke,et al.  GridFTP: Protocol Extensions to FTP for the Grid , 2001 .

[65]  Klara Nahrstedt,et al.  A distributed resource management architecture that supports advance reservations and co-allocation , 1999, 1999 Seventh International Workshop on Quality of Service. IWQoS'99. (Cat. No.98EX354).

[66]  Jon Stokes Inside the Machine: An Illustrated Introduction to Microprocessors and Computer Architecture , 2006 .

[67]  Biswanath Mukherjee,et al.  Provisioning Data-Aggregation Sessions in Lambda Grids ? , 2008 .

[68]  李幼升,et al.  Ph , 1989 .

[69]  Dana Ron,et al.  Scheduling with conflicts: online and offline algorithms , 2009, J. Sched..

[70]  T. Kosar,et al.  From Micro- to Macro-processing: A Generic Data Management Model , 2007 .

[71]  Sartaj Sahni,et al.  An Evaluation of In-Advance Bandwidth Scheduling Algorithms for Connection-Oriented Networks , 2008, 2008 International Symposium on Parallel Architectures, Algorithms, and Networks (i-span 2008).

[72]  Clifford Stein,et al.  Improved approximation algorithms for unsplittable flow problems , 1997, Proceedings 38th Annual Symposium on Foundations of Computer Science.

[73]  Huyin Zhang,et al.  A Linked-List Data Structure for Advance Reservation Admission Control , 2005, ICCNMC.

[74]  H. T. Mouftah,et al.  Toward automated provisioning of advance reservation service in next-generation optical internet , 2006, IEEE Communications Magazine.

[75]  Eyas El-Qawasmeh,et al.  A polynomial time algorithm for the N-queens problem , 2004, Neural Networks and Computational Intelligence.

[76]  Francine Berman,et al.  Overview of the Book: Grid Computing – Making the Global Infrastructure a Reality , 2003 .

[77]  Ian T. Foster,et al.  The Globus project: a status report , 1998, Proceedings Seventh Heterogeneous Computing Workshop (HCW'98).

[78]  Mehmet Balman,et al.  Choosing Between Remote I/O versus Staging in Large Scale Distributed Applications , 2008, ISCA PDCCS.

[79]  Tao Wang,et al.  Bandwidth tree - a data structure for routing in networks with advanced reservations , 2002, Conference Proceedings of the IEEE International Performance, Computing, and Communications Conference (Cat. No.02CH37326).

[80]  Anne E. Trefethen,et al.  The Data Deluge: An e-Science Perspective , 2003 .

[81]  William E. Allcock,et al.  Reliable file transfer in Grid environments , 2002, 27th Annual IEEE Conference on Local Computer Networks, 2002. Proceedings. LCN 2002..

[82]  Ying Ding,et al.  Algorithms for High Performance, Wide-Area Distributed File Downloads , 2003, Parallel Process. Lett..

[83]  Farhad Shahrokhi,et al.  The maximum concurrent flow problem , 1990, JACM.

[84]  Jason Lee,et al.  Intra and Interdomain Circuit Provisioning Using the OSCARS Reservation System , 2006, 2006 3rd International Conference on Broadband Communications, Networks and Systems.

[85]  Miron Livny,et al.  Stork: making data placement a first class citizen in the grid , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[86]  Richard P. Mount The Office of Science Data-Management Challenge , 2005 .

[87]  Matthew S. Allen,et al.  The Livny and Plank-Beck Problems: Studies in Data Movement on the Computational Grid , 2003, ACM/IEEE SC 2003 Conference (SC'03).

[88]  Edward G. Coffman,et al.  Scheduling file transfers in a distributed network , 1983, PODC '83.

[89]  Ian T. Foster,et al.  Data management and transfer in high-performance computational grid environments , 2002, Parallel Comput..

[90]  Susana E. Deustua,et al.  Overview of the SuperNova/Acceleration Probe (SNAP) , 2002, SPIE Astronomical Telescopes + Instrumentation.

[91]  Sang Hyuk Son,et al.  A Priority-Based Scheduling Algorithm for Real-Time Databases , 1995, J. Inf. Sci. Eng..

[92]  Ian H. Witten,et al.  DEVELOPER'S GUIDE , 2001 .

[93]  Biju K. Raveendran,et al.  Evaluation of priority based real time scheduling algorithms: choices and tradeoffs , 2008, SAC '08.