6. Exponential Family Models for Sampled and Census Network Data

Much progress has been made on the development of statistical methods for network analysis in the past ten years, building on the general class of exponential family random graph (ERG) network models first introduced by Holland and Leinhardt (1981). Recent examples include models for Markov graphs, “p*” models, and actor-oriented models. For empirical application, these ERG models take a logistic form, and require the equivalent of a network census: data on all dyads within the network. In a largely separate stream of research, conditional log-linear (CLL) models have been adapted for analyzing locally sampled (“egocentric”) network data. While the general relation between log-linear and logistic models is well known and has been exploited in the case of a priori blockmodels for networks, the relation for the CLL models is different due to the treatment of absent ties. For fully saturated tie independence models, CLL and ERG are equivalent and related via Bayes' rule. For other tie independence models, the two do not yield equivalent predicted values, but we show that in practice the differences are unlikely to be large. The alternate conditioning in the two models sheds light on the relationship between local and complete network data, and the role that models can play in bridging the gap between them.

[1]  Duncan J. Watts,et al.  Six Degrees: The Science of a Connected Age , 2003 .

[2]  P. Pattison,et al.  9. Neighborhood-Based Models for Social Networks , 2002 .

[3]  P. Holland,et al.  A Method for Detecting Structure in Sociometric Data , 1970, American Journal of Sociology.

[4]  P. V. Marsden,et al.  Core Discussion Networks of Americans , 1987 .

[5]  Garry Robins,et al.  Network models for social selection processes , 2001, Soc. Networks.

[6]  S. Wasserman,et al.  Logit models and logistic regressions for social networks: II. Multivariate relations. , 1999, The British journal of mathematical and statistical psychology.

[7]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[8]  D. Massey The Social and Economic Origins of Immigration , 1990 .

[9]  S. Fienberg,et al.  Categorical Data Analysis of Single Sociometric Relations , 1981 .

[10]  L. Dean,et al.  Effect of sexual behavior change on long-term human immunodeficiency virus prevalence among homosexual men. , 1994, American journal of epidemiology.

[11]  Peter Shawn Bearman,et al.  New methods for new research on adolescent sexual behavior , 1998 .

[12]  Peter V. Marsden,et al.  Models and methods for characterizing the structural parameters of groups , 1981 .

[13]  J. E. Jackson The Analysis of Cross-Classified Data Having Ordered Categories , 1986 .

[14]  E. Laumann The Social Structure of Religious and Ethnoreligious Groups in a Metropolitan Community , 1969 .

[15]  M. Morris,et al.  A log-linear modeling framework for selective mixing. , 1991, Mathematical biosciences.

[16]  Stephen J. Majeski,et al.  How foreign policy recommendations are put together: A computational model with empirical applications , 1999 .

[17]  N. Milburn To Dwell Among Friends: Personal Networks in Town and City. , 1983 .

[18]  M. Feldman,et al.  The Co-evolution of Individual Behaviors and Social Institutions , 2022 .

[19]  David Knoke,et al.  The Organizational State: Social Choice in National Policy Domains , 1987 .

[20]  P. Holland,et al.  An Exponential Family of Probability Distributions for Directed Graphs , 1981 .

[21]  P. Holland,et al.  Local Structure in Social Networks , 1976 .

[22]  J. Besag Statistical Analysis of Non-Lattice Data , 1975 .

[23]  Peter Green,et al.  Markov chain Monte Carlo in Practice , 1996 .

[24]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[25]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[26]  Ronald S. Burt,et al.  Network items and the general social survey , 1984 .

[27]  S. Wasserman,et al.  Logit models and logistic regressions for social networks: III. Valued relations , 1999 .

[28]  Leo A. Goodman,et al.  On the Statistical Analysis of Mobility Tables , 1965, American Journal of Sociology.

[29]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[30]  O. Frank RANDOM SAMPLING AND SOCIAL NETWORKS A SURVEY OF VARIOUS APPROACHES , 1985 .

[31]  M. Macy,et al.  FROM FACTORS TO ACTORS: Computational Sociology and Agent-Based Modeling , 2002 .

[32]  C. Geyer,et al.  Constrained Monte Carlo Maximum Likelihood for Dependent Data , 1992 .

[33]  S. Wasserman,et al.  Logit models and logistic regressions for social networks: I. An introduction to Markov graphs andp , 1996 .

[34]  B. Wellman,et al.  Different Strokes from Different Folks: Community Ties and Social Support , 1990, American Journal of Sociology.

[35]  Peter D. Hoff,et al.  Latent Space Approaches to Social Network Analysis , 2002 .

[36]  P. Pattison,et al.  Network models for social influence processes , 2001 .

[37]  Yu Xie,et al.  Temporal and Regional Variation in the Strength of Educational Homogamy , 2000, American Sociological Review.

[38]  R. Mare Five decades of educational assortative mating. , 1991 .

[39]  John F. Padgett,et al.  Robust Action and the Rise of the Medici, 1400-1434 , 1993, American Journal of Sociology.

[40]  S. Morgan,et al.  Intermarriage and Social Distance Among U.S. Immigrants at the Turn of the Century , 1990, American Journal of Sociology.

[41]  Trish,et al.  Protecting adolescents from harm. Findings from the National Longitudinal Study on Adolescent Health. , 1997, JAMA.

[42]  A. Portes,et al.  The educational and early occupational attainment process. , 1969 .

[43]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[44]  D. J. Strauss,et al.  Pseudolikelihood Estimation for Social Networks , 1990 .

[45]  M. Janssen,et al.  Multi-Agent Systems for the Simulation of Land-Use and Land-Cover Change: A Review , 2003 .

[46]  P. V. Marsden,et al.  Homogeneity in confiding relations , 1988 .

[47]  R J Hayes,et al.  Multicentre study on factors determining differences in rate of spread of HIV in sub-Saharan Africa: methods and prevalence of HIV infection , 2001, AIDS.

[48]  E O Laumann,et al.  Monitoring the AIDS epidemic in the United States: a network approach. , 1989, Science.

[49]  Dawn Iacobucci,et al.  Statistical Analysis of Single Relational Networks , 1994 .

[50]  T. Snijders The statistical evaluation of social network dynamics , 2001 .

[51]  Stephen E. Fienberg,et al.  The analysis of cross-classified categorical data , 1980 .