Generalizations of Edge Overlap to Weighted and Directed Networks

With the increasing availability of behavioral data from diverse digital sources, such as social media sites and cell phones, it is now possible to obtain detailed information on the strength and directionality of social interactions in various settings. While most metrics used to characterize network structure have traditionally been defined for unweighted and undirected networks only, the richness of current network data calls for extending these metrics to weighted and directed networks. One fundamental metric, especially in social networks, is edge overlap, the proportion of friends shared by two connected individuals. Here we extend definitions of edge overlap to weighted and directed networks, and we present closed-form expressions for the mean and variance of each version for the classic \er random graph and its weighted and directed counterparts. We apply these results to social network data collected in rural villages, and we use our analytical results to quantify the extent to which the average edge overlap in the empirical social networks deviates from that of corresponding random graphs. Finally, we carry out comparisons across attribute categories including sex, caste, and age, finding that women tend to form more tightly clustered friendship circles than men, where the extent of overlap depends on the nature of social interaction in question.

[1]  N. Christakis,et al.  The Spread of Obesity in a Large Social Network Over 32 Years , 2007, The New England journal of medicine.

[2]  Mikko Kivelä,et al.  Generalizations of the clustering coefficient to weighted complex networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[4]  N. Papadatos Maximum variance of order statistics , 1995 .

[5]  Tore Opsahl Triadic closure in two-mode networks: Redefining the global and local clustering coefficients , 2013, Soc. Networks.

[6]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[7]  Jari Saramäki,et al.  Emergence of communities in weighted networks. , 2007, Physical review letters.

[8]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[9]  David A Kim,et al.  The Social Geography of American Medicine , 2016 .

[10]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[11]  D. Schrag,et al.  Analytical strategies for characterizing chemotherapy diffusion with patient-level population-based data , 2010 .

[12]  N. Christakis,et al.  Estimating Peer Effects on Health in Social Networks , 2008, Journal of health economics.

[13]  T. VanderWeele Sensitivity Analysis for Contagion Effects in Social Networks , 2011, Sociological methods & research.

[14]  Arun G. Chandrasekhar,et al.  The Diffusion of Microfinance , 2012, Science.

[15]  Mason A. Porter,et al.  Communities in Networks , 2009, ArXiv.

[16]  B. Bollobás The evolution of random graphs , 1984 .

[17]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[18]  Béla Bollobás,et al.  Random Graphs , 1985 .

[19]  Santo Fortunato,et al.  Triadic closure as a basic generating mechanism of the structure of complex networks , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  N. L. Johnson,et al.  Survival Models and Data Analysis , 1982 .

[21]  Jukka-Pekka Onnela,et al.  Impact of degree truncation on the spread of a contagious process on networks , 2016, Network Science.

[22]  October I Physical Review Letters , 2022 .

[23]  A-L Barabási,et al.  Structure and tie strengths in mobile communication networks , 2006, Proceedings of the National Academy of Sciences.

[24]  D. Garlaschelli The weighted random graph model , 2009, 0902.0897.

[25]  J. G. Skellam The Frequency Distribution of the Difference between Two Poisson Variates Belonging to Different Populations , 1946 .

[26]  Thomas W. Valente,et al.  Models and Methods in Social Network Analysis: Network Models and Methods for Studying the Diffusion of Innovations , 2005 .

[27]  M. Kendall,et al.  Kendall's advanced theory of statistics , 1995 .

[28]  N. Christakis,et al.  SUPPLEMENTARY ONLINE MATERIAL FOR: The Collective Dynamics of Smoking in a Large Social Network , 2022 .

[29]  Yi Pan,et al.  Identification of Essential Proteins Based on Edge Clustering Coefficient , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[30]  Jukka-Pekka Onnela,et al.  Incorporating Contact Network Structure in Cluster Randomized Trials , 2015, Scientific Reports.