Discovering Dependencies for Network Management

It is lamentable that Leslie Lamport’s famous quote [9] “A distributed system is one in which the failure of a computer you didn’t even know existed can render your own computer unusable” describes a scenario familiar to almost every computer user. As IT systems are increasingly distributed, it is not only the clients and servers themselves that can render a computer useless for an afternoon, but any of the many routers, links and network services also involved. In distributed systems, the underlying problem is the absence of tools to identify the components that “can render your own computer unusable”: the implicit web of dependencies among these components exists only in the minds of the human experts running them. The complexity of these dependencies quickly adds up, requiring more help than traditional IT management software provides. Listing the contents of a single DFS 1 directory, for example, can involve a minimum of three hosts and eight network services (WINS, ICMP Echo, SMB, DFS, DNS, Kerberos, ISA key exchange, ARP). Existing management solutions focus on network elements, topology discovery, or particular services, but what is needed are tools to manage and improve the user’s end-to-end experience of networked applications. In deference to Lamport, this paper defines the Leslie Graph as the graph representing the dependencies between the system components, with subgraphs representing the dependencies pertaining to a particular application or activity. Nodes represent the computers, routers and services on which user activities rely, and directed edges capture their inter-dependencies. Different versions of a Leslie Graph can express different granularities of dependence for an activity — for some analyses, an Leslie Graph capturing intermachine dependences at the granularity of IP addresses might be sufficient, while for others an Leslie Graph capturing inter-service dependencies at the granularity of software processes might be desirable. This paper makes three contributions: (i) we define Leslie Graphs and discuss the challenges in finding them, (ii) we suggest important problems that Leslie Graphs could help

[1]  David G. Stork,et al.  Pattern Classification , 1973 .

[2]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[3]  大西 仁,et al.  Pearl, J. (1988, second printing 1991). Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan-Kaufmann. , 1994 .

[4]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[5]  Aaron B. Brown,et al.  An active approach to characterizing dynamic dependencies for problem determination in a distributed environment , 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470).

[6]  Marcos K. Aguilera,et al.  Performance debugging for distributed systems of black boxes , 2003, SOSP '03.

[7]  Archana Ganapathi,et al.  Why Do Internet Services Fail, and What Can Be Done About It? , 2002, USENIX Symposium on Internet Technologies and Systems.

[8]  Rajeev Rastogi,et al.  Topology discovery in heterogeneous IP networks: the NetInventory system , 2004, IEEE/ACM Transactions on Networking.

[9]  Cédric Fournet,et al.  Ethernet topology discovery without network assistance , 2004, Proceedings of the 12th IEEE International Conference on Network Protocols, 2004. ICNP 2004..

[10]  Richard Mortier,et al.  Using Magpie for Request Extraction and Workload Modelling , 2004, OSDI.

[11]  David A. Patterson,et al.  Path-Based Failure and Evolution Management , 2004, NSDI.

[12]  Jason Lee,et al.  A first look at modern enterprise traffic , 2005, IMC '05.

[13]  Patrick D. McDaniel,et al.  Analysis of Communities of Interest in Data Networks , 2005, PAM.

[14]  Marcos K. Aguilera,et al.  WAP5: black-box performance debugging for wide-area systems , 2006, WWW '06.