Reco: Efficient Regularization-Based Coflow Scheduling in Optical Circuit Switches

To improve the application-level data efficiency, the scheduling of coflows, defined as a collection of parallel flows sharing the same objective, is prevailing in recent data centers. Meanwhile, optical circuit switches (OCS) are gradually applied to provide high data rate with low power consumption. However, so far few research outputs have covered the flow scheduling in the context of OCS, let alone the coflow scheduling problems. In this paper, we investigate coflow scheduling in the OCS-based data centers. We first derive a novel operation called regularization processed respectively on the flow traffic demands and the flow start times. Regularization can be efficiently implemented and reduce the circuit reconfiguration frequency dramatically. We then propose a 2-approximation algorithm, called Reco-Sin, for single coflow scheduling to minimize the coflow completion time (CCT). For multiple coflows, we derive another approximation algorithm, called Reco-Mul, to minimize the total weighted CCT, which can transform any non-preemptive multi-coflow scheduling in packet switches to that in OCS. Extensive simulations based on Facebook data traces show that Reco-Sin and Reco-Mul outperform state-of-the-art schemes significantly, i.e., one single coflow can be finished up to 2.72× faster with Reco-Sin, and multiple coflows can be completed up to 3.44× faster with Reco-Mul.

[1]  Richard A. Brualdi,et al.  Notes on the Birkhoff Algorithm for Doubly Stochastic Matrices , 1982, Canadian Mathematical Bulletin.

[2]  Zizhan Zheng,et al.  Online Partial Throughput Maximization for Multidimensional Coflow , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[3]  Bo Li,et al.  Optimizing coflow completion times with utility max-min fairness , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[4]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[5]  Amin Vahdat,et al.  Helios: a hybrid electrical/optical switch architecture for modular data centers , 2010, SIGCOMM '10.

[6]  T. S. Eugene Ng,et al.  Sunflow: Efficient Optical Circuit Scheduling for Coflows , 2016, CoNEXT.

[7]  Marvin Marcus,et al.  DIAGONALS OF DOUBLY STOCHASTIC MATRICES , 1959 .

[8]  Qingkai Liang,et al.  Coflow scheduling in input-queued switches: Optimal delay scaling and algorithms , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[9]  John von Neumann,et al.  1. A Certain Zero-sum Two-person Game Equivalent to the Optimal Assignment Problem , 1953 .

[10]  Javad Ghaderi,et al.  An Improved Bound for Minimizing the Total Weighted Completion Time of Coflows in Datacenters , 2017, IEEE/ACM Transactions on Networking.

[11]  Konstantina Papagiannaki,et al.  c-Through: part-time optics in data centers , 2010, SIGCOMM 2010.

[12]  Ion Stoica,et al.  Coflow: a networking abstraction for cluster applications , 2012, HotNets-XI.

[13]  William J. Dally,et al.  Guaranteed scheduling for switches with configuration overhead , 2003, IEEE/ACM Trans. Netw..

[14]  He Liu,et al.  Circuit Switching Under the Radar with REACToR , 2014, NSDI.

[15]  Tara Javidi,et al.  End-to-end scheduling for all-optical data centers , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[16]  Srinivasan Seshan,et al.  Scheduling techniques for hybrid circuit/packet networks , 2015, CoNEXT.

[17]  Amin Vahdat,et al.  Integrating microsecond circuit switching into the data center , 2013, SIGCOMM.

[18]  Yanhui Geng,et al.  CODA: Toward Automatically Identifying and Scheduling Coflows in the Dark , 2016, SIGCOMM.

[19]  Yuan Yu,et al.  Dryad: distributed data-parallel programs from sequential building blocks , 2007, EuroSys '07.

[20]  Yuan Zhong,et al.  Minimizing the Total Weighted Completion Time of Coflows in Datacenter Networks , 2015, SPAA.

[21]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[22]  Amin Vahdat,et al.  Sincronia: near-optimal network design for coflows , 2018, SIGCOMM.

[23]  Bo Li,et al.  Utopia: Near-optimal Coflow Scheduling with Isolation Guarantee , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[24]  Xiaobo Zhou,et al.  Leveraging Endpoint Flexibility when Scheduling Coflows across Geo-distributed Datacenters , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[25]  Nick McKeown,et al.  pFabric: minimal near-optimal datacenter transport , 2013, SIGCOMM.

[26]  Chen Tian,et al.  Scheduling Cofiows of Multi-stage Jobs to Minimize the Total Weighted Job Completion Time , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[27]  Ion Stoica,et al.  Efficient Coflow Scheduling Without Prior Knowledge , 2015, SIGCOMM.

[28]  Ion Stoica,et al.  Efficient coflow scheduling with Varys , 2015, SIGCOMM.

[29]  Fanny Dufossé,et al.  Notes on Birkhoff-von Neumann decomposition of doubly stochastic matrices , 2016 .

[30]  J. Edmonds Paths, Trees, and Flowers , 1965, Canadian Journal of Mathematics.

[31]  Jipeng Zhou,et al.  Efficient online coflow routing and scheduling , 2016, MobiHoc.

[32]  Chao Xu,et al.  OMCO: Online Multiple Coflow Scheduling in Optical Circuit Switch , 2018, 2018 IEEE International Conference on Communications (ICC).