Heuristics for Analyzing Download Time in MDS Coded Storage Systems

There has been a growing interest, in both theory and practice, in using the available redundancy in storage systems for mitigating stragglers in content download. This paper is concerned with MDS coded storage systems and studies (n, k) data access model. When $k$ = n, system is equivalent to a fork-join queue, which is known to be notoriously hard to analyze, while system with $k$ = 1 has been previously shown to be equivalent to an M/G/1 queue. We here argue that the system with $k$ = 2 is of practical interest, and then present a method that approximates the system as an M/G/1 queue. Approximated download time is shown to be more accurate than the bounds available in the literature. We also note that the presented method can be used for approximating systems that employ other newly designed and deployed storage codes.

[1]  Alan Scheller-Wolf,et al.  A Better Model for Job Redundancy: Decoupling Server Slowdown and Job Size , 2016, IEEE/ACM Transactions on Networking.

[2]  Cheng Huang,et al.  On the Locality of Codeword Symbols , 2011, IEEE Transactions on Information Theory.

[3]  Mor Harchol-Balter,et al.  Reducing Latency via Redundant Requests: Exact Analysis , 2015, SIGMETRICS 2015.

[4]  M. Aktaş,et al.  Analyzing the Download Time of Availability Codes , 2019 .

[5]  Emina Soljanin,et al.  Simplex Queues for Hot-Data Download , 2017, SIGMETRICS.

[6]  GhemawatSanjay,et al.  The Google file system , 2003 .

[7]  Kannan Ramchandran,et al.  The MDS queue: Analysing the latency performance of erasure codes , 2012, 2014 IEEE International Symposium on Information Theory.

[8]  Asser N. Tantawi,et al.  Approximate Analysis of Fork/Join Synchronization in Parallel Queues , 1988, IEEE Trans. Computers.

[9]  Charles Reiss,et al.  Towards understanding heterogeneous clouds at scale : Google trace analysis , 2012 .

[10]  Kannan Ramchandran,et al.  Codes can reduce queueing delay in data centers , 2012, 2012 IEEE International Symposium on Information Theory Proceedings.

[11]  Emina Soljanin,et al.  Efficient Redundancy Techniques for Latency Reduction in Cloud Systems , 2015, ACM Trans. Model. Perform. Evaluation Comput. Syst..

[12]  Ulas C. Kozat,et al.  FAST CLOUD: Pushing the Envelope on Delay Performance of Cloud Storage With Coding , 2013, IEEE/ACM Transactions on Networking.

[13]  Emina Soljanin,et al.  Coding for fast content download , 2012, 2012 50th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[14]  Luiz André Barroso,et al.  The tail at scale , 2013, CACM.