Network Topology Inference

Network graphs are constructed in all sorts of ways and to varying levels of completeness. In some settings, there is little if any uncertainty in assessing whether or not an edge exists between two vertices and we can exhaustively assess incidence between vertex pairs. For example, in examining one’s own network of Facebook friends, the presence or absence of an edge can be assessed through direct inspection. In other settings, however, constructing a network graph is not so straightforward. We may have information only on the status of some of the potential edges in the network, but not all. Alternatively, we may not even have the ability to directly assess whether or not an edge is present. Rather, it may be that we can only measure vertex or edge attributes that are to some extent predictive of edge status. In such cases, it can be natural to consider the task of constructing a network graph representation from the available data as one of statistical inference.

[1]  D. Goldberg,et al.  Assessing experimentally derived interactions in a small world , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[2]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[3]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[4]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[5]  Susan Holmes,et al.  Statistics for phylogenetic trees. , 2003, Theoretical population biology.

[6]  Susan Holmes,et al.  Phylogenies: An Overview , 1997 .

[7]  T. W. Anderson An Introduction to Multivariate Statistical Analysis , 1959 .

[8]  Eric D. Kolaczyk,et al.  Statistical Analysis of Network Data , 2009 .

[9]  Mohammad Al Hasan,et al.  A Survey of Link Prediction in Social Networks , 2011, Social Network Data Analytics.

[10]  J. Rothberg,et al.  Gaining confidence in high-throughput protein interaction networks , 2004, Nature Biotechnology.

[11]  J. N. R. Jeffers,et al.  Graphical Models in Applied Multivariate Statistics. , 1990 .

[12]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[13]  Robert D. Nowak,et al.  Maximum likelihood network topology identification from edge-based unicast measurements , 2002, SIGMETRICS '02.

[14]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[15]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[16]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[17]  Ben Taskar,et al.  Link Prediction in Relational Data , 2003, NIPS.

[18]  Robert D. Nowak,et al.  Likelihood based hierarchical clustering , 2004, IEEE Transactions on Signal Processing.

[19]  Peter D. Hoff,et al.  Multiplicative latent factor models for description and prediction of social networks , 2009, Comput. Math. Organ. Theory.

[20]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[21]  Robert Nowak,et al.  Network Tomography: Recent Developments , 2004 .

[22]  P. Bühlmann,et al.  Sparse graphical Gaussian modeling of the isoprenoid gene network in Arabidopsis thaliana , 2004, Genome Biology.

[23]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[24]  T. W. Anderson An Introduction to Multivariate Statistical Analysis, 2nd Edition. , 1985 .

[25]  Hazem N. Nounou,et al.  An Overview of the Statistical Methods Used for Inferring Gene Regulatory Networks and Protein-Protein Interaction Networks , 2013, Adv. Bioinformatics.

[26]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[27]  Lyle H. Ungar,et al.  Statistical Relational Learning for Link Prediction , 2003 .

[28]  Eric D. Kolaczyk,et al.  Statistical Analysis of Network Data: Methods and Models , 2009 .