The Marginal Benefit of Monitor Placement on Networks

Inferring the structure of an unknown network is a difficult problem of interest to researchers, academics , and industrialists . We develop a novel algorithm to infer nodes and edges in an unknown network. Our algorithm utilizes monitors that detect incident edges and adjacent nodes with their labels and degrees. The algorithm infers the network through a preferential random walk with a probabilistic restart at a previously discovered but unmonitored node, or a random teleportation to an unexplored node. Our algorithm outperforms random walk inference and random placement of monitors inference in edge discovery in all test cases. Our algorithm outperforms both methodologies in node inference in synthetic test networks; on real networks it outperforms them in the beginning of the inference. Finally, a website was created where these algorithms can be tested live on preloaded networks or custom networks as desired by the user. The visualization also displays the network as it is being inferred, and provides other statistics about the real and inferred networks.

[1]  Hirotaka Ono,et al.  Maximum Domination Problem , 2011, CATS.

[2]  Thomas Erlebach,et al.  Network Discovery and Verification , 2005, WG.

[3]  O. Ore Theory of Graphs , 1962 .

[4]  E. Ziv,et al.  Inferring network mechanisms: the Drosophila melanogaster protein interaction network. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Jure Leskovec,et al.  {SNAP Datasets}: {Stanford} Large Network Dataset Collection , 2014 .

[6]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[7]  Lada A. Adamic,et al.  Search in Power-Law Networks , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  W. Eric L. Grimson,et al.  Inference of non-overlapping camera network topology by measuring statistical dependence , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[9]  Christopher M. Danforth,et al.  Estimation of Global Network Statistics from Incomplete Data , 2014, PloS one.

[10]  Matteo Magnani,et al.  Missing Data in Multiplex Networks: A Preliminary Study , 2014, 2014 Tenth International Conference on Signal-Image Technology and Internet-Based Systems.

[11]  Forrest W. Crawford A recruitment model and population size estimation for respondent-driven sampling , 2014 .

[12]  Marc Timme,et al.  Inferring network topology from complex dynamics , 2010, 1007.1640.

[13]  Y. Vardi,et al.  Network Tomography: Estimating Source-Destination Traffic Intensities from Link Data , 1996 .

[14]  Robert D. Nowak,et al.  Maximum likelihood network topology identification from edge-based unicast measurements , 2002, SIGMETRICS '02.

[15]  Donald F. Towsley,et al.  Multicast topology inference from measured end-to-end loss , 2002, IEEE Trans. Inf. Theory.

[16]  Christos Faloutsos,et al.  Sampling from large graphs , 2006, KDD '06.

[17]  Miguel Rio,et al.  Network topologies: inference, modeling, and generation , 2008, IEEE Communications Surveys & Tutorials.

[18]  P. Biernacki,et al.  Snowball Sampling: Problems and Techniques of Chain Referral Sampling , 1981 .

[19]  Mike H. MacGregor,et al.  Network Topology Inference from End-to-End Unicast Measurements , 2013, 2013 27th International Conference on Advanced Information Networking and Applications Workshops.

[20]  Matteo Magnani,et al.  Investigating the types and effects of missing data in multilayer networks , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).