Abstract Monitoring systems give network administrators a better view andunderstanding of their networks. Amongst their many uses, they canbe used to audit computing assets, profile resource usage, and pinpointsecurity problems.Current monitoring systems have not really explored the limits ofmonitoring scalability, preferring to focus on other important issues suchas reliability and node discovery.We present a monitoring system that scale to over 100000 nodes. Ithas minimal local and global overhead, and maintains integrity in theface of transient network failure. Through a hierarchal organisation, ourmonitoring system can operate in multiple administrative zones.Since we did not have a large fleet of machines at our disposal, wesimulated a large fleet of machines, upon which a smaller network ofreal machines was overlayed. This system was the testbed for our scal-ability evaluations.In addition we include a web service interface, which allows accessto our system via HTTP. This frees consumers from the need to imple-ment special clients for interfacing with our system. InThis work is part of a larger project, Panopticon, which is a completemonitoring solution, including a database tier and visualisation client.
[1]
Robbert van Renesse,et al.
Astrolabe: A robust and scalable technology for distributed system monitoring, management, and data mining
,
2003,
TOCS.
[2]
Robbert van Renesse,et al.
Scalable Management and Data Mining Using Astrolabe
,
2002,
IPTPS.
[3]
David E. Culler,et al.
Wide area cluster monitoring with Ganglia
,
2003,
2003 Proceedings IEEE International Conference on Cluster Computing.
[4]
David E. Culler,et al.
The ganglia distributed monitoring system: design, implementation, and experience
,
2004,
Parallel Comput..
[5]
Eric Anderson,et al.
Extensible, Scalable Monitoring for Clusters of Computers
,
1997,
LISA.
[6]
Jeffrey D. Case,et al.
Simple Network Management Protocol (SNMP)
,
1989,
RFC.
[7]
Ronald Minnich,et al.
Supermon: a high-speed cluster monitoring system
,
2002,
Proceedings. IEEE International Conference on Cluster Computing.
[8]
Roy Fielding,et al.
Architectural Styles and the Design of Network-based Software Architectures"; Doctoral dissertation
,
2000
.