Assessing network structure with practical sampling methods: An example of the Global Airport Network

Using data from an enumerated network of worldwide flight connections between airports, we examine how sampling designs and sample size influence network metrics. Specifically, we apply three types of sampling designs: simple random sampling, nonrandom strategic sampling (i.e., selection of the largest airports), and a variation of snowball sampling. For the latter sampling method, we design what we refer to as a controlled snowball sampling design, which selects nodes in a manner analogous to a respondent-driven sampling design. For each design, we evaluate five commonly used measures of network structure and examine the percentage of total air traffic accounted for by each design. The empirical application shows that (1) the random and controlled snowball sampling designs give rise to more efficient estimates of the true underlying structure, and (2) the strategic sampling method can account for a greater proportion of the total number of passenger movements occurring in the network.

[1]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[2]  Martina Morris,et al.  Overview of Network Survey Designs , 2004 .

[3]  M E Newman,et al.  Scientific collaboration networks. I. Network construction and fundamental results. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  M. Newman,et al.  Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  S. Feld Why Your Friends Have More Friends Than You Do , 1991, American Journal of Sociology.

[6]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[7]  L. A. Rvachev,et al.  A mathematical model for the global spread of influenza , 1985 .

[8]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[9]  William Richards,et al.  Nonrespondents in Communication Network Studies , 1992 .

[10]  Carsten Wiuf,et al.  Subnets of scale-free networks are not scale-free: sampling properties of networks. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Yulia R. Gel,et al.  Bootstrap quantification of estimation uncertainties in network degree distributions , 2017, Scientific Reports.

[12]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[13]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[14]  Robert F. Chew,et al.  Patterns of Twitter Behavior Among Networks of Cannabis Dispensaries in California , 2017, Journal of medical Internet research.

[15]  Mark E. J. Newman,et al.  Ego-centered networks and the ripple effect , 2001, Soc. Networks.

[16]  M. Kretzschmar,et al.  Concurrent partnerships and the spread of HIV , 1997, AIDS.

[17]  Alessandro Vespignani,et al.  The role of the airline transportation network in the prediction and predictability of global epidemics , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[18]  D. Goedecke,et al.  Sampling for Global Epidemic Models and the Topology of an International Airport Network , 2008, PloS one.

[19]  L. Sattenspiel,et al.  The spread and persistence of infectious diseases in structured populations , 1988 .

[20]  G. Glass,et al.  Assessing the impact of airline travel on the geographic spread of pandemic influenza , 2003 .

[21]  Douglas D. Heckathorn,et al.  Respondent-driven sampling II: deriving valid population estimates from chain-referral samples of hi , 2002 .

[22]  Carsten Wiuf,et al.  Sampling properties of random graphs: the degree distribution. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  Joshua M. Epstein,et al.  Controlling Pandemic Flu: The Value of International Air Travel Restrictions , 2007, PloS one.

[24]  W. Edmunds,et al.  Delaying the International Spread of Pandemic Influenza , 2006, PLoS medicine.