S3: a scalable sensing service for monitoring large networked systems

Efficiently operating and managing large scale distributed and federated systems is an extremely challenging problem. Current solutions are a combination of centralized management and significant over-provisioning of the infrastructure. With the explosion of new resource-intensive media applications and services, over provisioning of the infrastructure is no longer a viable option. Timely and accurate knowledge of the global environment (particularly the highly dynamic network path properties) is necessary for management of performance SLAs, just-in-time resource provisioning, near-optimal dynamic service placement and reuse, construction of network service overlays, and fast detection of failures and malicious attacks. Further, different applications require information about different aspects of the environment at different timescales. We propose S3, a Scalable Sensing Service, that achieves the above requirements and enables personalized sensing of the environment as dictated by applications.

[1]  Mark Handley,et al.  Topologically-aware overlay construction and server selection , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[2]  Sridhar Srinivasan,et al.  M-coop: a scalable infrastructure for network measurement , 2003, Proceedings the Third IEEE Workshop on Internet Applications. WIAPP 2003.

[3]  David R. Karger,et al.  Chord: a scalable peer-to-peer lookup protocol for internet applications , 2003, TNET.

[4]  Parameswaran Ramanathan,et al.  What do packet dispersion techniques measure? , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[5]  Srinivasan Seshan,et al.  IrisNet: An Architecture for a Worldwide Sensor Web , 2003, IEEE Pervasive Comput..

[6]  Randy H. Katz,et al.  An algebraic approach to practical and scalable overlay network monitoring , 2004, SIGCOMM '04.

[7]  Krishna P. Gummadi,et al.  King: estimating latency between arbitrary internet end hosts , 2002, IMW '02.

[8]  Paul Francis,et al.  IDMaps: a global internet host distance estimation service , 2001, TNET.

[9]  Sriram Ramabhadran,et al.  NetProfiler: Profiling Wide-Area Networks Using Peer Cooperation , 2005, IPTPS.

[10]  Helen J. Wang,et al.  Strider: a black-box, state-based approach to change and configuration management and support , 2003, Sci. Comput. Program..

[11]  Michael Dahlin,et al.  A scalable distributed information management system , 2004, SIGCOMM.

[12]  G. Weikum Querying the Internet with PIER , 2005 .

[13]  Larry L. Peterson,et al.  Sophia: an Information Plane for networked systems , 2004, Comput. Commun. Rev..

[14]  Ratul Mahajan,et al.  User-level internet path diagnosis , 2003, SOSP '03.

[15]  Krishna P. Gummadi,et al.  SProbe: A fast tool for measuring bottleneck bandwidth in un-cooperative environments , 2002 .

[16]  Sujata Banerjee,et al.  Distributed Querying of Internet Distance Information , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[17]  Richard G. Baraniuk,et al.  pathChirp: Efficient available bandwidth estimation for network paths , 2003 .

[18]  Yin Zhang,et al.  INSIGHT: a distributed monitoring system for tracking continuous queries , 2005, SOSP '05.

[19]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[20]  Sujata Banerjee,et al.  Netvigator: Scalable Network Proximity Estimation , 2005 .

[21]  Hui Zhang,et al.  Predicting Internet network distance with coordinates-based approaches , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[22]  David D. Clark,et al.  A knowledge plane for the internet , 2003, SIGCOMM '03.

[23]  Scott Shenker,et al.  The Network Oracle , 2005, IEEE Data Eng. Bull..

[24]  Robbert van Renesse,et al.  Astrolabe: A robust and scalable technology for distributed system monitoring, management, and data mining , 2003, TOCS.

[25]  M. Frans Kaashoek,et al.  A measurement study of available bandwidth estimation tools , 2003, IMC '03.

[26]  Emin Gün Sirer,et al.  Meridian: a lightweight network location service without virtual coordinates , 2005, SIGCOMM '05.

[27]  Helen J. Wang,et al.  Automatic Misconfiguration Troubleshooting with PeerPressure , 2004, OSDI.

[28]  Jon Crowcroft,et al.  Lighthouses for Scalable Distributed Location , 2003, IPTPS.