Monitoring Testbed Experiments with MonEx

Almost all testbed experiments deal with different kinds of metrics which are collected from and/or about various kinds of resources. Despite the importance of collecting experiment metrics in the experiment life cycle, this phase is often done via ad hoc, manual, and artisanal actions such as manually combining multiple scripts, or manipulating some missing values. A few tools (e.g. Vendetta, OML) can be used for monitoring experiments. However their work is restricted to communicating metrics towards a central server, and they do not cover different features from user perspective such as drawing and archiving experiments results. In this talk, we will firstly discuss the requirements of experiment monitoring. Having a well-defined set of requirements eliminates the potential ambiguity around what should be targeted by any Experiment Monitoring Framework (EMF). The defined requirements are not testbed dependent nor technology-dependent, so any testbed community can build their own EMF by implementing these requirements, using different software systems. We will then describe our own proposition, MonEx (for long: Monitoring Experiments), which is an EMF that satisfies all the defined requirements. MonEx is built over several off-the-shelf infrastructure monitoring tools, and supports various monitoring approaches such as pull- and push-based monitoring, agent-based and agent-less monitoring. MonEx covers all the required steps of monitoring experiments from collecting metrics to archiving experiments data and producing figures. We will then demonstrate MonEx’s usability through a set of experiments , performed on the Grid’5000 testbed and being monitored by MonEx. Each of those experiments have different requirements, and as a group they show how MonEx meets all defined requirements. We show how MonEx nicely integrates the experimental workflow and how it simplifies the monitoring task, reducing the efforts of users during experimentation and pushing towards the repeatability of experiments’ analysis and metrics comparison.

[1]  Maximilian Ott,et al.  OMF: a control and management framework for networking testbeds , 2010, OPSR.

[2]  Rihards Olups,et al.  Zabbix 1.8 Network Monitoring , 2010 .

[3]  Olof Rensfelt,et al.  Vendetta - A Tool for Flexible Monitoring and Management of Distributed Testbeds , 2007, 2007 3rd International Conference on Testbeds and Research Infrastructure for the Development of Networks and Communities.

[4]  Olivier Richard,et al.  Lightweight emulation to study peer-to-peer systems , 2008 .

[5]  Andrea C. Arpaci-Dusseau,et al.  The Popper Convention: Making Reproducible Systems Evaluation Practical , 2017, 2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[6]  David E. Culler,et al.  The ganglia distributed monitoring system: design, implementation, and experience , 2004, Parallel Comput..

[7]  Emmanuel Jeannot,et al.  Adding Virtualization Capabilities to the Grid'5000 Testbed , 2012, CLOSER.

[8]  David E. Culler,et al.  PlanetLab: an overlay testbed for broad-coverage services , 2003, CCRV.

[9]  M. R. Salvador,et al.  FIBRE - An International Testbed for Future Internet Experimentation , 2014 .

[10]  Patricia Morreale,et al.  Scalable Agentless Cloud Network Monitoring , 2017, 2017 IEEE 4th International Conference on Cyber Security and Cloud Computing (CSCloud).

[11]  Laurent Lefèvre,et al.  A Unified Monitoring Framework for Energy Consumption and Network Traffic , 2015, EAI Endorsed Trans. Energy Web.

[12]  Emmanuel Jeanvoine,et al.  Design and Evaluation of a Virtual Experimental Environment for Distributed Systems , 2013, 2013 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing.

[13]  Manpreet Singh,et al.  ORBIT Measurements framework and library (OML): motivations, implementation and features , 2005, First International Conference on Testbeds and Research Infrastructures for the DEvelopment of NeTworks and COMmunities.

[14]  Leandros Tassiulas,et al.  Methodology and Tools for Measurements on Wireless Testbeds: The NITOS Approach , 2012, FP7 FIRE/EULER.

[15]  Tobias Oetiker,et al.  MRTG: The Multi Router Traffic Grapher , 1998, LISA.