Efficient Coflow Transmission for Distributed Stream Processing

Distributed streaming applications require the underlying network flows to transmit packets continuously to keep their output results fresh. These results will become stale if no updates come, and their staleness is determined by the slowest flow. At this point, coflows can be semantically comprised. Hence, efficient coflow transmission is critical for streaming applications. However, prior coflow-based solutions have significant limitations. They use a one-shot performance metric—CCT (coflow completion time), which cannot continuously reflect the staleness of the output results for a streaming application.To this end, we propose a new performance metric—coflow age (CA), for coflows generated by distributed streaming applications. The CA tracks the longest time-since-last-service among all flows in a coflow. In such a context, we consider a data center network with multiple coflows that continuously transmit packets between their source-destination pairs and address the problem of minimizing the average long-term CA while simultaneously satisfying the throughput constraints from the coflows. To solve this problem efficiently, we design a randomized algorithm and a drift-plus-age algorithm, and show that they can make the average long-term CA to achieve nearly two times and arbitrarily close to the optimal value, respectively. Through extensive simulations, we further demonstrate that both of the proposed algorithms can significantly reduce the CA of coflows, without violating the throughput requirement of any coflow, when compared to the state-of-the-art solution.

[1]  Rong Pan,et al.  Let It Flow: Resilient Asymmetric Load Balancing with Flowlet Switching , 2017, NSDI.

[2]  Xiaobo Zhou,et al.  Shaping Deadline Coflows to Accelerate Non-Deadline Coflows , 2018, 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS).

[3]  Paramvir Bahl,et al.  Live Video Analytics at Scale with Approximation and Delay-Tolerance , 2017, NSDI.

[4]  Ion Stoica,et al.  Coflow: a networking abstraction for cluster applications , 2012, HotNets-XI.

[5]  Eytan Modiano,et al.  Scheduling Algorithms for Optimizing Age of Information in Wireless Networks With Throughput Constraints , 2019, IEEE/ACM Transactions on Networking.

[6]  Anthony Ephremides,et al.  Optimal Link Scheduling for Age Minimization in Wireless Systems , 2018, IEEE Transactions on Information Theory.

[7]  Jipeng Zhou,et al.  Efficient online coflow routing and scheduling , 2016, MobiHoc.

[8]  Wei Lin,et al.  StreamScope: Continuous Reliable Distributed Processing of Big Data Streams , 2016, NSDI.

[9]  Amin Vahdat,et al.  Sincronia: near-optimal network design for coflows , 2018, SIGCOMM.

[10]  Roy D. Yates,et al.  Update or wait: How to keep your data fresh , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[11]  Weng-Fai Wong,et al.  Gloss: Seamless Live Reconfiguration and Reoptimization of Stream Programs , 2018, ASPLOS.

[12]  Baochun Li,et al.  Wide-Area Spark Streaming: Automated Routing and Batch Sizing , 2019, IEEE Transactions on Parallel and Distributed Systems.

[13]  Yuan Yao,et al.  Data centers power reduction: A two time scale approach for delay tolerant workloads , 2012, 2012 Proceedings IEEE INFOCOM.

[14]  R. Gallager Stochastic Processes , 2014 .

[15]  Michael J. Freedman,et al.  Aggregation and Degradation in JetStream: Streaming Analytics in the Wide Area , 2014, NSDI.

[16]  Yanhui Geng,et al.  CODA: Toward Automatically Identifying and Scheduling Coflows in the Dark , 2016, SIGCOMM.

[17]  Ion Stoica,et al.  Efficient coflow scheduling with Varys , 2015, SIGCOMM.

[18]  Devavrat Shah,et al.  Fastpass: a centralized "zero-queue" datacenter network , 2015, SIGCOMM 2015.

[19]  Ion Stoica,et al.  Efficient Coflow Scheduling Without Prior Knowledge , 2015, SIGCOMM.

[20]  Bo Li,et al.  Optimizing coflow completion times with utility max-min fairness , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[21]  Ramesh K. Sitaraman,et al.  Trading Timeliness and Accuracy in Geo-Distributed Streaming Analytics , 2016, SoCC.

[22]  Xiaobo Zhou,et al.  Leveraging Endpoint Flexibility when Scheduling Coflows across Geo-distributed Datacenters , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.