A Structural Model of Homophily and Clustering in Social Networks

Social networks display homophily and clustering, and are usually sparse. I develop and estimate a structural model of strategic network formation with heterogeneous players and latent community structure, whose equilibrium networks are sparse and exhibit homophily and clustering. Each player belongs to a community unobserved by the econometrician. Players' payoffs vary by community and depend on the composition of direct links and common neighbors, allowing preferences to have a bias for similar people. Players meet sequentially and decide whether to form bilateral links, after receiving a random matching shock. The probability of meeting people in different communities is smaller than the probability of meeting people in the same community, and it decreases with the size of the network. The model converges to an exponential family random graph, with weak dependence among links. As a consequence the equilibrium networks are sparse and the sufficient statistics of the network are asymptotically normal. The posterior distribution of structural parameters and unobserved heterogeneity is estimated with school friendship network data from Add Health, using a Bayesian exchange algorithm. The estimates detect high levels of racial homophily, and heterogeneity in both costs of links and payoffs from common friends. The posterior predictions show that the model is able to replicate the homophily levels and the aggregate clustering of the observed network, in contrast with standard exponential family network models.

[1]  P. Diaconis,et al.  Estimating and understanding exponential random graph models , 2011, 1102.2650.

[2]  Jeremy T. Fox,et al.  A note on identification of discrete choice models for bundles and binary games , 2017 .

[3]  Kenneth C. Wilbur,et al.  Structural models of complementary choices , 2014 .

[4]  P. Deb Finite Mixture Models , 2008 .

[5]  Jing Wang,et al.  Bayesian inference of exponential random graph models for large social networks , 2011 .

[6]  Angelo Mele,et al.  A Structural Model of Dense Network Formation , 2017 .

[7]  Andreas Dzemski,et al.  An Empirical Model of Dyadic Link Formation in a Network with Unobserved Heterogeneity , 2018, Review of Economics and Statistics.

[8]  Lung-fei Lee,et al.  A Structural Modeling Approach for Network Formation and Social Interactions – with Applications to Students ’ Friendship Choices and Selectivity on Activities , 2012 .

[9]  Lingjiong Zhu,et al.  On the phase transition curve in a directed exponential random graph model , 2014, Advances in Applied Probability.

[10]  Alberto Caimo,et al.  Bayesian inference for exponential random graph models , 2010, Soc. Networks.

[11]  Lingjiong Zhu,et al.  Approximate Variational Estimation for a Model of Network Formation , 2017, Review of Economics and Statistics.

[12]  L. Blume The Statistical Mechanics of Strategic Interaction , 1993 .

[13]  Paul Marjoram,et al.  Markov chain Monte Carlo without likelihoods , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Faming Liang,et al.  A double Metropolis–Hastings sampler for spatial models with intractable normalizing constants , 2010 .

[15]  Edoardo M. Airoldi,et al.  Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[16]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .

[17]  Michael D. König The formation of networks with local spillovers and limited observability , 2016 .

[18]  Konrad Menzel,et al.  STRATEGIC NETWORK FORMATION WITH MANY AGENTS , 2016 .

[19]  Matthew O. Jackson,et al.  A Network Formation Model Based on Subgraphs , 2016, ArXiv.

[20]  Brian W. Rogers,et al.  Meeting Strangers and Friends of Friends: How Random are Social Networks? , 2007 .

[21]  Elie Tamer,et al.  Identifying preferences in networks with bounded degree , 2016 .

[22]  Federico Echenique,et al.  A Measure of Segregation Based on Social Interactions , 2007 .

[23]  M. Bálek,et al.  Large Networks and Graph Limits , 2022 .

[24]  Johan Koskinen,et al.  The Linked Importance Sampler Auxiliary Variable Metropolis Hastings Algorithm for Distributions with Intractable Normalising Constants , 2008 .

[25]  Marcel Fafchamps,et al.  Risk Sharing and Network Formation , 2007 .

[26]  A. Galeotti One-way flow networks: the role of heterogeneity , 2004 .

[27]  S. Bonhomme,et al.  Discretizing Unobserved Heterogeneity , 2017, Econometrica.

[28]  Timothy G. Conley,et al.  Learning About a New Technology: Pineapple in Ghana , 2010 .

[29]  Douglas Gale,et al.  Bayesian learning in social networks , 2003, Games Econ. Behav..

[30]  Timothy G. Conley GMM estimation with cross sectional dependence , 1999 .

[31]  Timothy G. Conley,et al.  Estimating dynamic local interactions models , 2007 .

[32]  Brian W. Rogers,et al.  The Oxford handbook of the economics of networks , 2016 .

[33]  L. Shapley,et al.  Potential Games , 1994 .

[34]  L. Freeman Segregation in Social Networks , 1978 .

[35]  Ryo Nakajima,et al.  Measuring Peer Eects on Youth Smoking Behavior , 2007 .

[36]  Michael P. Leung A Random-Field Approach to Inference in Large Models of Network Formation , 2019 .

[37]  Yves Zenou,et al.  Peer Effects and Social Networks in Education , 2008 .

[38]  Sanjeev Goyal,et al.  A Noncooperative Model of Network Formation , 2000 .

[39]  G. Imbens,et al.  Social Networks and the Identification of Peer Effects , 2013 .

[40]  M. Jackson,et al.  A Strategic Model of Social and Economic Networks , 1996 .

[41]  Matthew O. Jackson,et al.  The Existence of Pairwise Stable Networks , 2002 .

[42]  M. Jackson,et al.  An Economic Model of Friendship: Homophily, Minorities and Segregation , 2007 .

[43]  Paolo Pin,et al.  Identifying the roles of race-based choice and chance in high school friendship network formation , 2010, Proceedings of the National Academy of Sciences.

[44]  Vincent Boucher,et al.  Structural Homophily , 2015 .

[45]  Lancelot F. James,et al.  Gibbs Sampling Methods for Stick-Breaking Priors , 2001 .

[46]  J. Heckman Dummy Endogenous Variables in a Simultaneous Equation System , 1977 .

[47]  Steven L. Puller,et al.  The Old Boy (and Girl) Network: Social Network Formation on University Campuses , 2008 .

[48]  M. Gentzkow Valuing New Goods in a Model with Complementarity: Online Newspapers , 2007 .

[49]  Giacomo De Giorgi,et al.  Identification of Social Interactions through Partially Overlapping Peer Groups , 2010 .

[50]  Tyler H. McCormick,et al.  Using Aggregated Relational Data to Feasibly Identify Network Structure Without Network Data , 2017, The American economic review.

[51]  Koen Jochmans,et al.  Two-Way Models for Gravity , 2017, Review of Economics and Statistics.

[52]  Anton I Badev,et al.  Discrete Games in Endogenous Networks: Theory and Policy , 2014 .

[53]  Matthew O. Jackson,et al.  Network Structure and the Speed of Learning: Measuring Homophily Based on its Consequences , 2011 .

[54]  Tom A. B. Snijders,et al.  Markov Chain Monte Carlo Estimation of Exponential Random Graph Models , 2002, J. Soc. Struct..

[55]  Giorgio Topa,et al.  Social interactions, local spillovers and unemployment , 2001 .

[56]  J. Moody Race, School Integration, and Friendship Segregation in America1 , 2001, American Journal of Sociology.

[57]  Mark S Handcock,et al.  Local dependence in random graph models: characterization, properties and statistical inference , 2015, Journal of the American Statistical Association.

[58]  Ron A. Laschever,et al.  The doughboys network: Social interactions and labor market outcomes of World War I veterans , 2005 .

[59]  Shuyang Sheng,et al.  Identi…cation and Estimation of Network Formation Games , 2012 .

[60]  Karyne B. Charbonneau,et al.  Multiple Fixed Effects in Binary Response Panel Data Models , 2017 .