Network sampling and classification: An investigation of network model representations

Methods for generating a random sample of networks with desired properties are important tools for the analysis of social, biological, and information networks. Algorithm-based approaches to sampling networks have received a great deal of attention in recent literature. Most of these algorithms are based on simple intuitions that associate the full features of connectivity patterns with specific values of only one or two network metrics. Substantive conclusions are crucially dependent on this association holding true. However, the extent to which this simple intuition holds true is not yet known. In this paper, we examine the association between the connectivity patterns that a network sampling algorithm aims to generate and the connectivity patterns of the generated networks, measured by an existing set of popular network metrics. We find that different network sampling algorithms can yield networks with similar connectivity patterns. We also find that the alternative algorithms for the same connectivity pattern can yield networks with different connectivity patterns. We argue that conclusions based on simulated network studies must focus on the full features of the connectivity patterns of a network instead of on the limited set of network metrics for a specific network type. This fact has important implications for network data analysis: for instance, implications related to the way significance is currently assessed.

[1]  Martin G. Everett,et al.  Models of core/periphery structures , 2000, Soc. Networks.

[2]  Scott Shane,et al.  Network Ties, Reputation, and the Financing of New Ventures , 2002, Manag. Sci..

[3]  E. Ziv,et al.  Inferring network mechanisms: the Drosophila melanogaster protein interaction network. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[5]  Frank Harary,et al.  Graph Theoretic Methods in the Management Sciences , 1959 .

[6]  X ZhengAlice,et al.  A Survey of Statistical Network Models , 2010 .

[7]  Chris Arney,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World (Easley, D. and Kleinberg, J.; 2010) [Book Review] , 2013, IEEE Technology and Society Magazine.

[8]  Béla Bollobás,et al.  Random Graphs , 1985 .

[9]  Kathleen M. Carley,et al.  ORA: Organization Risk Analyzer , 2004 .

[10]  Edoardo M. Airoldi,et al.  Sampling algorithms for pure network topologies: a study on the stability and the separability of metric embeddings , 2005, SKDD.

[11]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[12]  Bin Zhu,et al.  Visualization of Network Concepts: The Impact of Working Memory Capacity Differences , 2010, Inf. Syst. Res..

[13]  Robert Russell,et al.  AWSM: Allocation of workflows utilizing social network metrics , 2010, Decis. Support Syst..

[14]  Martin Bichler,et al.  Identification of influencers - Measuring influence in customer networks , 2008, Decis. Support Syst..

[15]  Martina Morris,et al.  A Simple Model for Complex Networks with Arbitrary Degree Distribution and Clustering , 2006, SNA@ICML.

[16]  Jon M. Kleinberg,et al.  The small-world phenomenon: an algorithmic perspective , 2000, STOC '00.

[17]  S H Strogatz,et al.  Random graph models of social networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[18]  B. Kogut,et al.  Social Capital, Structural Holes and the Formation of an Industry Network , 1997 .

[19]  Adalbert Mayer,et al.  Online social networks in economics , 2009, Decis. Support Syst..

[20]  Michael Goul,et al.  The influence of collaborative technology knowledge on advice network structures , 2010, Decis. Support Syst..

[21]  F. Chung,et al.  The small world phenomenon in hybrid graphs , 2006 .

[22]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[23]  Eric D. Kolaczyk,et al.  Statistical Analysis of Network Data: Methods and Models , 2009 .

[24]  Jon M. Kleinberg,et al.  Navigation in a small world , 2000, Nature.

[25]  E. David,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World , 2010 .

[26]  Gueorgi Kossinets,et al.  Empirical Analysis of an Evolving Social Network , 2006, Science.

[27]  Edoardo M. Airoldi,et al.  Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[28]  Peter Sheridan Dodds,et al.  Information exchange and the robustness of organizational networks , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Rob Cross,et al.  A Relational View of Information Seeking and Learning in Social Networks , 2003, Manag. Sci..

[30]  Larry Wasserman,et al.  All of Statistics , 2004 .

[31]  Edoardo M. Airoldi,et al.  A Survey of Statistical Network Models , 2009, Found. Trends Mach. Learn..

[32]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[33]  Wonseok Oh,et al.  Membership Herding and Network Stability in the Open Source Community: The Ising Perspective , 2007, Manag. Sci..

[34]  Kathleen M. Carley,et al.  Network Structure in Virtual Organizations , 1999 .

[35]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[36]  Kathleen M. Carley,et al.  Toward an interoperable dynamic network analysis toolkit , 2007, Decis. Support Syst..

[37]  Terrill L. Frantz,et al.  A Formal Characterization of Cellular Networks , 2005 .

[38]  David Easley,et al.  Networks, Crowds, and Markets: The Small-World Phenomenon , 2010 .

[39]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[40]  Viktor Seifert Sampling Algorithms for Pure Network Topologies , 2007 .

[41]  E. Todeva Networks , 2007 .

[42]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[43]  William W. Cohen,et al.  Bayesian Models for Frequent Terms in Text , 2004 .

[44]  I. N. A. C. I. J. H. Fowler Book Review: Connected: The surprising power of our social networks and how they shape our lives. , 2009 .

[45]  Steven Durlauf,et al.  Social Capital , 2004 .