Generation and analysis of large synthetic social contact networks

We describe “first principles” based methods for developing synthetic urban and national scale social contact networks. Unlike simple random graph techniques, these methods use real world data sources and combine them with behavioral and social theories to synthesize networks. We develop a synthetic population for the United States modeling every individual in the population including household structure, demographics and a 24-hour activity sequence. The process involves collecting and manipulating public and proprietary data sets integrated into a common architecture for data exchange and then using these data sets to generate new relations. A social contact network is derived from the synthetic population based on physical co-location of interacting persons. We use graph measures to compare and contrast the structural characteristics of the social networks that span different urban regions. We then simulate diffusion processes on these networks and analyze similarities and differences in the structure of the networks.

[1]  Aravind Srinivasan,et al.  Modelling disease outbreaks in realistic urban social networks , 2004, Nature.

[2]  J. Hyman,et al.  Scaling laws for the movement of people between locations in a large city. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  T. Snijders,et al.  Modeling the Coevolution of Networks and Behavior , 2007 .

[4]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[5]  Christos Faloutsos,et al.  Mobile call graphs: beyond power-law and lognormal distributions , 2008, KDD.

[6]  C. Macken,et al.  Modeling targeted layered containment of an influenza pandemic in the United States , 2008, Proceedings of the National Academy of Sciences.

[7]  C. Steglich,et al.  DYNAMIC NETWORKS AND BEHAVIOR: SEPARATING SELECTION FROM INFLUENCE: separating selection from influence , 2010 .

[8]  Eli Upfal,et al.  The Web as a graph , 2000, PODS.

[9]  Madhav V. Marathe,et al.  Modeling and simulation of large biological, information and socio-technical systems: an interaction based approach , 2006 .

[10]  M. E. Williams,et al.  TRANSIMS: TRANSPORTATION ANALYSIS AND SIMULATION SYSTEM , 1995 .

[11]  Jure Leskovec,et al.  Patterns of Influence in a Recommendation Network , 2006, PAKDD.

[12]  Harry B. Hunt,et al.  Reachability problems for sequential dynamical systems with threshold functions , 2003, Theor. Comput. Sci..

[13]  M. D. McKay,et al.  Creating synthetic baseline populations , 1996 .

[14]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[15]  M. Macy,et al.  FROM FACTORS TO ACTORS: Computational Sociology and Agent-Based Modeling , 2002 .

[16]  Walter Willinger,et al.  A first-principles approach to understanding the internet's router-level topology , 2004, SIGCOMM '04.

[17]  Madhav V. Marathe,et al.  EpiFast: a fast algorithm for large scale realistic epidemic simulations on distributed memory systems , 2009, ICS.