Lowering Inter-datacenter Bandwidth Costs via Bulk Data Scheduling

Cloud service providers (CSP) of today operate multiple data centers, over which they provide resilient infrastructure, data storage and compute services. The links between data centers have very high capacity, and are typically purchased by the CSPs using established billing practices, such as 95-thpercentile billing or average-usage billing. These links are used to serve both client traffic as well as CSP-specific bulk data traffic, such as backup jobs, etc. Past studies have shown a diurnal pattern of traffic over such links. However, CSPs pay for the peak bandwidth, which implies that they are under-utilizing the capacity for which they have paid for. We propose a scheduling framework that considers various classes of jobs that are encountered over such links, and propose GRESE, an algorithm that attempts to minimize overall bandwidth costs to the CSP, by leveraging the flexible nature of the deadlines of these bulk data jobs. We demonstrate the problem is not a simple extension of any well-known scheduling problems, and show how the GRESE algorithm is effective in curtailing CSP bandwidth costs.

[1]  Matthew Mathis,et al.  The macroscopic behavior of the TCP congestion avoidance algorithm , 1997, CCRV.

[2]  Edward G. Coffman,et al.  Approximation algorithms for bin packing: a survey , 1996 .

[3]  Michael A. Bender,et al.  Flow and stretch metrics for scheduling continuous job streams , 1998, SODA '98.

[4]  Pablo Rodriguez,et al.  Delay-Tolerant Bulk Data Transfers on the Internet , 2009, IEEE/ACM Transactions on Networking.

[5]  Zhi-Li Zhang,et al.  A first look at inter-data center traffic characteristics via Yahoo! datasets , 2011, 2011 Proceedings IEEE INFOCOM.

[6]  Albert Mo Kim Cheng,et al.  Optimal Scheduling of Urgent Preemptive Tasks , 2010, 2010 IEEE 16th International Conference on Embedded and Real-Time Computing Systems and Applications.

[7]  Pascale Vicat-Blanc Primet,et al.  Scheduling deadline-constrained bulk data transfers to minimize network congestion , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[8]  Michael Sirivianos,et al.  Inter-datacenter bulk transfers with netstitcher , 2011, SIGCOMM.

[9]  Fatiha Bouabache,et al.  Planning Large Data Transfers in Institutional Grids , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[10]  Ming Zhang,et al.  Understanding data center traffic characteristics , 2010, CCRV.

[11]  Albert G. Greenberg,et al.  The nature of data center traffic: measurements & analysis , 2009, IMC '09.

[12]  Ilias Giechaskiel,et al.  Delay Tolerant Bulk Data Transfers on the Internet , 2014 .

[13]  Éva Tardos,et al.  Scheduling data transfers in a network and the set scheduling problem , 2003, J. Algorithms.