Active Learning of Multiple Source Multiple Destination Topologies

We consider the problem of inferring the topology of a network with M sources and N receivers (an M-by- N network), by sending probes between the sources and receivers. Prior work has shown that this problem can be decomposed into two parts: first, infer smaller subnetwork components (1-by- N's or 2-by-2's) and then merge them to identify the M-by- N topology. We focus on the second part, which had previously received less attention in the literature. We assume that a 1-by- N topology is given and that all 2-by-2 components can be queried and learned using end-to-end probes. The problem is which 2-by-2's to query and how to merge them with the given 1-by- N, so as to exactly identify the 2-by- N topology, and optimize a number of performance metrics, including the number of queries (which directly translates into measurement bandwidth), time complexity, and memory usage. We provide a lower bound, [N/2], on the number of 2-by-2's required by any active learning algorithm and propose two greedy algorithms. The first algorithm follows the framework of multiple hypothesis testing, in particular Generalized Binary Search (GBS). The second algorithm is called the Receiver Elimination Algorithm (REA) and follows a bottom-up approach. It requires exactly N-1 steps, which is much less than all (2N) possible 2-by-2's. Simulation results demonstrate that both algorithms correctly identify the 2-by- N topology and are near-optimal, but REA is more efficient in practice.

[1]  Donald F. Towsley,et al.  Multicast topology inference from measured end-to-end loss , 2002, IEEE Trans. Inf. Theory.

[2]  Robert D. Nowak,et al.  Minimax Bounds for Active Learning , 2007, IEEE Transactions on Information Theory.

[3]  Jin Cao,et al.  Network Tomography: Identifiability and Fourier Domain Estimation , 2007, IEEE Transactions on Signal Processing.

[4]  Ramesh Govindan,et al.  Heuristics for Internet map discovery , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[5]  Robert D. Nowak,et al.  Multiple source, multiple destination network tomography , 2004, IEEE INFOCOM 2004.

[6]  Paul Barford,et al.  Toward the Practical Use of Network Tomography for Internet Topology Discovery , 2010, 2010 Proceedings IEEE INFOCOM.

[7]  Sanjoy Dasgupta,et al.  Coarse sample complexity bounds for active learning , 2005, NIPS.

[8]  Christina Fragouli,et al.  Topology inference using network coding , 2012 .

[9]  Bruce M. Maggs,et al.  Globally Distributed Content Delivery , 2002, IEEE Internet Comput..

[10]  Robert D. Nowak,et al.  Multiple-Source Internet Tomography , 2006, IEEE Journal on Selected Areas in Communications.

[11]  Nick G. Duffield,et al.  Network tomography from measured end-to-end delay covariance , 2004, IEEE/ACM Transactions on Networking.

[12]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[13]  Akshay Krishnamurthy,et al.  Robust multi-source network tomography using selective probes , 2012, 2012 Proceedings IEEE INFOCOM.

[14]  Hari Balakrishnan,et al.  Resilient overlay networks , 2001, SOSP.

[15]  Michael Rabbat,et al.  Active learning of multiple source multiple destination topologies , 2013, CISS.

[16]  Alfred O. Hero,et al.  Hierarchical Inference of Unicast Network Topologies Based on End-to-End Measurements , 2007, IEEE Transactions on Signal Processing.

[17]  Ratul Mahajan,et al.  Measuring ISP topologies with Rocketfuel , 2004, IEEE/ACM Transactions on Networking.

[18]  Randy Bush,et al.  iSPY: Detecting IP Prefix Hijacking on My Own , 2008, IEEE/ACM Transactions on Networking.

[19]  B.K. Dey,et al.  Network tomography via network coding , 2008, 2008 Information Theory and Applications Workshop.

[20]  Robert D. Nowak,et al.  Merging logical topologies using end-to-end measurements , 2003, IMC '03.

[21]  Ítalo S. Cunha,et al.  Measurement methods for fast and accurate blackhole identification with binary tomography , 2009, IMC '09.

[22]  Robert D. Nowak,et al.  Network delay tomography , 2003, IEEE Trans. Signal Process..

[23]  Bill Cheswick,et al.  Mapping and Visualizing the Internet , 2000, USENIX Annual Technical Conference, General Track.

[24]  Vincent Y. F. Tan,et al.  Learning Latent Tree Graphical Models , 2010, J. Mach. Learn. Res..

[25]  Steven McCanne,et al.  Inference of multicast routing trees and bottleneck bandwidths using end-to-end measurements , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[26]  Bin Yu,et al.  Maximum pseudo likelihood estimation in network tomography , 2003, IEEE Trans. Signal Process..

[27]  Robert Nowak,et al.  Network Tomography: Recent Developments , 2004 .

[28]  David Wetherall,et al.  Studying Black Holes in the Internet with Hubble , 2008, NSDI.

[29]  Robert D. Nowak,et al.  Maximum likelihood network topology identification from edge-based unicast measurements , 2002, SIGMETRICS '02.

[30]  Judea Pearl,et al.  Structuring causal trees , 1986, J. Complex..

[31]  Avinatan Hassidim,et al.  Topology discovery of sparse random graphs with few participants , 2011, SIGMETRICS '11.

[32]  K. Claffy,et al.  Topology discovery by active probing , 2002, Proceedings 2002 Symposium on Applications and the Internet (SAINT) Workshops.

[33]  Yang Li,et al.  On Delay Tomography: Fast Algorithms and Spatially Dependent Models , 2012, IEEE Transactions on Signal Processing.

[34]  Christina Fragouli,et al.  Multiple source multiple destination topology inference using network coding , 2009, 2009 Workshop on Network Coding, Theory, and Applications.

[35]  Christina Fragouli,et al.  Active topology inference using network coding , 2010, Phys. Commun..

[36]  Nick G. Duffield,et al.  Adaptive multicast topology inference , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[37]  Alfred O. Hero,et al.  Unicast-based inference of network link delay distributions with finite mixture models , 2003, IEEE Trans. Signal Process..

[38]  Jian Ni,et al.  Efficient and Dynamic Routing Topology Inference From End-to-End Measurements , 2010, IEEE/ACM Transactions on Networking.

[39]  Ronald L. Rivest,et al.  Constructing Optimal Binary Decision Trees is NP-Complete , 1976, Inf. Process. Lett..

[40]  Robert D. Nowak,et al.  The Geometry of Generalized Binary Search , 2009, IEEE Transactions on Information Theory.

[41]  Jian Ni,et al.  Network tomography based on additive metrics , 2008, 2008 42nd Annual Conference on Information Sciences and Systems.