Inferring Degrees from Incomplete Networks and Nonlinear Dynamics

Inferring topological characteristics of complex networks from observed data is critical to understand the dynamical behavior of networked systems, ranging from the Internet and the World Wide Web to biological networks and social networks. Prior studies usually focus on the structure-based estimation to infer network sizes, degree distributions, average degrees, and more. Little effort attempted to estimate the specific degree of each vertex from a sampled induced graph, which prevents us from measuring the lethality of nodes in protein networks and influencers in social networks. The current approaches dramatically fail for a tiny sampled induced graph and require a specific sampling method and a large sample size. These approaches neglect information of the vertex state, representing the dynamical behavior of the networked system, such as the biomass of species or expression of a gene, which is useful for degree estimation. We fill this gap by developing a framework to infer individual vertex degrees using both information of the sampled topology and vertex state. We combine the mean-field theory with combinatorial optimization to learn vertex degrees. Experimental results on real networks with a variety of dynamics demonstrate that our framework can produce reliable degree estimates and dramatically improve existing link prediction methods by replacing the sampled degrees with our estimated degrees.

[1]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[2]  Owen L. Petchey,et al.  Ecological Networks in a Changing Climate , 2010 .

[3]  Pili Hu,et al.  A Survey and Taxonomy of Graph Sampling , 2013, ArXiv.

[4]  Jianxi Gao,et al.  True Nonlinear Dynamics from Incomplete Networks , 2020, AAAI.

[5]  Andrei Broder,et al.  Proceedings of the 23rd International Conference on World Wide Web , 2014, WWW 2014.

[6]  O. Bagasra,et al.  Proceedings of the National Academy of Sciences , 1914, Science.

[7]  Sabrina S Wilson Radiology , 1938, Glasgow Medical Journal.

[8]  N. Stanietsky,et al.  The interaction of TIGIT with PVR and PVRL2 inhibits human NK cell cytotoxicity , 2009, Proceedings of the National Academy of Sciences.

[9]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[10]  Dana Ron,et al.  Provable and Practical Approximations for the Degree Distribution using Sublinear Graph Samples , 2018, WWW.

[11]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[12]  Amin Karbasi,et al.  Seeing the Unseen Network: Inferring Hidden Social Ties from Respondent-Driven Sampling , 2016, AAAI.

[13]  Athina Markopoulou,et al.  Graph Size Estimation , 2012, ArXiv.

[14]  Bruce D. Spencer,et al.  Estimating network degree distributions under sampling: An inverse problem, with applications to monitoring social media networks , 2013, 1305.4977.

[15]  Ove Frank,et al.  Estimation of the number of vertices of different degrees in a graph , 1980 .

[16]  Carsten Wiuf,et al.  Sampling properties of random graphs: the degree distribution. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  J. Bunge,et al.  Estimating the Number of Species: A Review , 1993 .

[18]  Cristopher Moore,et al.  On the bias of traceroute sampling: or, power-law degree distributions in regular graphs , 2005, STOC '05.

[19]  Giorgio Parisi,et al.  Physica A: Statistical Mechanics and its Applications: Editorial note , 2005 .

[20]  Yihong Wu,et al.  Counting Motifs with Graph Sampling , 2018, COLT.

[21]  Edo Liberty,et al.  Estimating sizes of social networks via biased sampling , 2011, Internet Math..

[22]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[23]  Donald F. Towsley,et al.  On the estimation accuracy of degree distributions from graph sampling , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[24]  Meeyoung Cha,et al.  Social bootstrapping: how pinterest and last.fm social communities benefit by borrowing links from facebook , 2014, WWW.

[25]  Ulrik Brandes,et al.  Social Networks , 2013, Handbook of Graph Drawing and Visualization.

[26]  Oded Goldreich,et al.  Approximating average parameters of graphs , 2008 .

[27]  Anirban Dasgupta,et al.  On estimating the average degree , 2014, WWW.

[28]  J. Skolnick,et al.  Prediction of physical protein–protein interactions , 2005, Physical biology.

[29]  Cristopher Moore,et al.  On the bias of traceroute sampling: Or, power-law degree distributions in regular graphs , 2005, JACM.

[30]  Uri Alon,et al.  Efficient sampling algorithm for estimating subgraph concentrations and detecting network motifs , 2004, Bioinform..

[31]  L. Christophorou Science , 2018, Emerging Dynamics: Science, Energy, Society and Values.

[32]  Elisa Bertino,et al.  Proceedings of the 20th international conference on World wide web , 2011, WWW 2011.

[33]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .