An empirical model of network formation : detecting homophily when agents are heterogeneous

I formalize a widely-used empirical model of network formation. The model allows for assortative matching on observables (homophily) as well as unobserved agent level heterogeneity in link surplus (degree heterogeneity). The joint distribution of observed and unobserved agent-level characteristics is left unrestricted. Inferences about homophily do not depend upon untestable assumptions about this distribution. The model is non-standard since the dimension of the heterogeneity parameter grows with the number of agents, and hence network size. Nevertheless, under certain conditions, a joint maximum likelihood (ML) procedure, which simultaneously estimates the common and agent-level parameters governing link formation, is consistent. Although the asymptotic sampling distribution of the common parameter is Normal, it (i) contains a bias term and (ii) its variance does not coincide with the inverse of Fisher’s information matrix. Standard ML asymptotic inference procedures are invalid. Forming confidence intervals with a bias-corrected maximum likelihood estimate, and appropriate standard error estimates, results in correct coverage. I assess the value of these results for understanding finite sample behavior via a set of Monte Carlo experiments and through an empirical analysis of risk-sharing links in a rural Tanzanian village (cf., De Weerdt, 2004). JEL Codes: C31, C33, C35 ∗Department of Economics, University of California Berkeley, 530 Evans Hall #3380, Berkeley, CA 94720-3888 and National Bureau of Economic Research, e-mail: bgraham@econ.berkeley.edu, web: http : //emlab.berkeley.edu/ bgraham/. I thank Jinyong Hahn for sharing an unpublished appendix of Hahn and Newey (2004), Joachim De Weerdt for generously making his Nyakatoke dataset available, Jesus Carro, Demian Pouzo and Michael Jansson for many helpful suggestions, and participants in the New York University, UC Berkeley, Toulouse School of Economics, Oxford, and Cambridge econometrics seminars for helpful discussion. All the usual disclaimers apply.

[1]  Allan Sly,et al.  Random graphs with a given degree sequence , 2010, 1005.1136.

[2]  Marcel Fafchamps,et al.  The formation of risk sharing networks , 2007 .

[3]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[4]  Matthew O. Jackson,et al.  Tractable and Consistent Random Graph Models , 2012, ArXiv.

[5]  Edoardo M. Airoldi,et al.  A Survey of Statistical Network Models , 2009, Found. Trends Mach. Learn..

[6]  Gary Chamberlain,et al.  Analysis of Covariance with Qualitative Data , 1979 .

[7]  Whitney K. Newey,et al.  Generalized method of moments with many weak moment conditions , 2009 .

[8]  Robin S. Lee,et al.  Markov-Perfect Network Formation An Applied Framework for Bilateral Oligopoly and Bargaining in Buyer-Seller Networks ∗ , 2011 .

[9]  E. Tamer Incomplete Simultaneous Discrete Response Model with Multiple Equilibria , 2003 .

[10]  Dennis Epple,et al.  Peer Effects in Education: A Survey of the Theory and Evidence , 2011 .

[11]  G. Imbens,et al.  Social Networks and the Identification of Peer Effects , 2013 .

[12]  Bryan S. Graham,et al.  Identifying Social Interactions Through Conditional Variance Restrictions , 2008 .

[13]  Charles F. Manski,et al.  SEMIPARAMETRIC ANALYSIS OF RANDOM EFFECTS LINEAR MODELS FROM BINARY PANEL DATA , 1987 .

[14]  Jinyong Hahn,et al.  Understanding Bias in Nonlinear Panel Models: Some Recent Developments ∗ , 2005 .

[15]  M. Jackson,et al.  A Strategic Model of Social and Economic Networks , 1996 .

[16]  Gary Chamberlain,et al.  Longitudinal Analysis of Labor Market Data: Heterogeneity, omitted variable bias, and duration dependence , 1985 .

[17]  Margherita Comola,et al.  Testing Unilateral and Bilateral Link Formation , 2009 .

[18]  Bo E. Honoré,et al.  Panel Data Discrete Choice Models with Lagged Dependent Variables , 2000 .

[19]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[20]  Yuhei Miyauchi,et al.  Structural Estimation of Pairwise Stable Networks with Nonnegative Externality , 2014 .

[21]  N. Christakis,et al.  Social Networks and Cooperation in Hunter-Gatherers , 2011, Nature.

[22]  Angelo Mele,et al.  A Structural Model of Segregation in Social Networks , 2010 .

[23]  Peter D. Hoff,et al.  Latent Space Approaches to Social Network Analysis , 2002 .

[24]  Paul A. Bekker,et al.  ALTERNATIVE APPROXIMATIONS TO THE DISTRIBUTIONS OF INSTRUMENTAL VARIABLE ESTIMATORS , 1994 .

[25]  Bryan S. Graham,et al.  Homophily and Transitivity in Dynamic Network Formation , 2016 .

[26]  C. Hillar,et al.  Inverses of symmetric, diagonally dominant positive matrices and applications , 2012, 1203.6812.

[27]  J. Montgomery Social Networks and Labor-Market Outcomes: Toward an Economic Analysis , 1991 .

[28]  P. Holland,et al.  An Exponential Family of Probability Distributions for Directed Graphs , 1981 .

[29]  Joachim De Weerdt,et al.  Risk-Sharing and Endogenous Network Formation , 2002 .

[30]  T. Yan,et al.  A central limit theorem in the β-model for undirected random graphs with a diverging number of vertices , 2012, 1202.3307.

[31]  B. Graham Methods of Identification in Social Networks , 2014 .

[32]  T. Snijders,et al.  p2: a random effects model with covariates for directed graphs , 2004 .

[33]  N. Christakis,et al.  SUPPLEMENTARY ONLINE MATERIAL FOR: The Collective Dynamics of Smoking in a Large Social Network , 2022 .

[34]  Marcel Fafchamps,et al.  Social Identity and the Formation of Health Insurance Networks , 2011 .

[35]  Matthew O. Jackson,et al.  The Formation of Networks with Transfers Among Players , 2004, J. Econ. Theory.

[36]  Adrian E. Raftery,et al.  Representing degree distributions, clustering, and homophily in social networks with latent cluster random effects models , 2009, Soc. Networks.

[37]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[38]  Jinyong Hahn,et al.  JACKKNIFE AND ANALYTICAL BIAS REDUCTION FOR NONLINEAR PANEL MODELS , 2003 .

[39]  Matthew O. Jackson,et al.  The Evolution of Social and Economic Networks , 2002, J. Econ. Theory.

[40]  Shelby J. Haberman,et al.  An Exponential Family of Probability Distributions for Directed Graphs: Comment , 1981 .

[41]  Arun G. Chandrasekhar,et al.  The Diffusion of Microfinance , 2012, Science.

[42]  Persi Diaconis,et al.  A Sequential Importance Sampling Algorithm for Generating Random Graphs with Prescribed Degrees , 2011, Internet Math..

[43]  E. Tamer,et al.  Market Structure and Multiple Equilibria in Airline Markets , 2009 .

[44]  Sanjeev Goyal,et al.  A Noncooperative Model of Network Formation , 2000 .

[45]  P. Bearman,et al.  Chains of Affection: The Structure of Adolescent Romantic and Sexual Networks1 , 2004, American Journal of Sociology.

[46]  E. B. Andersen,et al.  CONDITIONAL INFERENCE FOR MULTIPLE‐CHOICE QUESTIONNAIRES , 1973 .

[47]  Matthew O. Jackson,et al.  Relating Network Structure to Diffusion Properties through Stochastic Dominance , 2007 .

[48]  Charles F. Manski,et al.  Social Learning from Private Experiences: The Dynamics of the Selection Problem , 2004 .

[49]  Stanley Wasserman,et al.  Statistical Models for Social Networks , 2000 .

[50]  Shuyang Sheng,et al.  Identi…cation and Estimation of Network Formation Games , 2012 .

[51]  Giorgio Topa Labor Markets and Referrals , 2011 .