论文信息 - Kollaps/Thunderstorm: Reproducible Evaluation of Distributed Systems

Kollaps/Thunderstorm: Reproducible Evaluation of Distributed Systems

Reproducing experimental results is nowadays seen as one of the greatest impairments for the progress of science in general and distributed systems in particular. This stems from the increasing complexity of the systems under study and the inherent complexity of capturing and controlling all variables that can potentially affect experimental results. We argue that this can only be addressed with a systematic approach to all the stages and aspects of the evaluation process, such as the environment in which the experiment is run, the configuration and software versions used, and the network characteristics among others. In this tutorial paper, we focus on the networking aspect, and discuss our ongoing research efforts and tools to contribute to a more systematic and reproducible evaluation of large scale distributed systems.

Miguel Matos | M. Matos

[1] Rahul Potharaju,et al. When the network crumbles: an empirical study of cloud network failures and their impact on services , 2013, SoCC.

[2] Dirk Merkel,et al. Docker: lightweight Linux containers for consistent development and deployment , 2014 .

[3] Valerio Schiavoni,et al. THUNDERSTORM: A Tool to Evaluate Dynamic Network Topologies on Distributed Systems , 2019, 2019 38th Symposium on Reliable Distributed Systems (SRDS).

[4] Adam Silberstein,et al. Benchmarking cloud serving systems with YCSB , 2010, SoCC '10.

[5] Valerio Schiavoni,et al. Kollaps: decentralized and dynamic topology emulation , 2020, EuroSys.

[6] Harsh Chawla,et al. Azure Kubernetes Service , 2019 .

[7] Prashant Malik,et al. Cassandra: a decentralized structured storage system , 2010, OPSR.