Hypergraph reconstruction from network data

Networks can describe the structure of a wide variety of complex systems by specifying which pairs of entities in the system are connected. While such pairwise representations are flexible, they are not necessarily appropriate when the fundamental interactions involve more than two entities at the same time. Pairwise representations nonetheless remain ubiquitous, because higher-order interactions are often not recorded explicitly in network data. Here, we introduce a Bayesian approach to reconstruct latent higher-order interactions from ordinary pairwise network data. Our method is based on the principle of parsimony and only includes higher-order structures when there is sufficient statistical evidence for them. We demonstrate its applicability to a wide range of datasets, both synthetic and empirical. Higher-order interactions intervene in a large variety of networked phenomena, from shared interests known to influence the creation of social ties, to co-location shaping networks embedded in space, like power grids. This work introduces a Bayesian framework to infer higher-order interactions hidden in network data.

[1]  Frederic E. Clements,et al.  Experimental pollination; an outline of the ecology of flowers and insects , 2009 .

[2]  Burleigh B. Gardner,et al.  Deep South: A Social Anthropological Study of Caste and Class , 1942 .

[3]  L. Goddard Information Theory , 1962, Nature.

[4]  P. Erdös,et al.  The Representation of a Graph by Set Intersections , 1966, Canadian Journal of Mathematics.

[5]  Robert B. Ash,et al.  Information Theory , 2020, The SAGE International Encyclopedia of Mass Media and Society.

[6]  Richard M. Karp,et al.  Reducibility Among Combinatorial Problems , 1972, 50 Years of Integer Programming.

[7]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[8]  Ron Atkin,et al.  Mathematical structure in human affairs , 1976 .

[9]  B. Bollobás,et al.  Cliques in random graphs , 1976, Mathematical Proceedings of the Cambridge Philosophical Society.

[10]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[11]  B. Bollobás The evolution of random graphs , 1984 .

[12]  P. Erdos,et al.  On the evolution of random graphs , 1984 .

[13]  S. Brenner,et al.  The structure of the nervous system of the nematode Caenorhabditis elegans. , 1986, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[14]  Makoto Kato,et al.  Insect-flower Relationship in the Primary Beech Forest of Ashu, Kyoto : An Overview of the Flowering Phenology and the Seasonal Pattern of Insect Visits , 1990 .

[15]  Dawn Iacobucci,et al.  Social networks with two sets of actors , 1990 .

[16]  Donald E. Knuth,et al.  The Stanford GraphBase - a platform for combinatorial computing , 1993 .

[17]  Trish,et al.  Protecting adolescents from harm. Findings from the National Longitudinal Study on Adolescent Health. , 1997, JAMA.

[18]  P. Bearman,et al.  Protecting adolescents from harm: Findings from the National Longitudinal Study on Adolescent Health. , 1997 .

[19]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[20]  M E J Newman,et al.  Identity and Search in Social Networks , 2002, Science.

[21]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[22]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[23]  M. Newman Properties of highly clustered networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  A Díaz-Guilera,et al.  Self-similar community structure in a network of human interactions. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  Nando de Freitas,et al.  An Introduction to MCMC for Machine Learning , 2004, Machine Learning.

[26]  Juan J. Armesto,et al.  Community studies in pollination ecology in the high temperate Andes of central Chile II. effect of temperature on visitation rates and pollination possibilities , 1985, Plant Systematics and Evolution.

[27]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[28]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[29]  R.W.R. Darling,et al.  Structure of large random hypergraphs , 2005 .

[30]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Akira Tanaka,et al.  The worst-case time complexity for generating all maximal cliques and computational experiments , 2006, Theor. Comput. Sci..

[32]  Gergely Palla,et al.  Preferential attachment of communities: The same principle, but a higher level , 2006 .

[33]  Lada A. Adamic,et al.  Networks of strong ties , 2006, cond-mat/0605279.

[34]  E. Todeva Networks , 2007 .

[35]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[36]  P. Grünwald The Minimum Description Length Principle (Adaptive Computation and Machine Learning) , 2007 .

[37]  David Samuel DeHorse,et al.  The Why of It , 2008 .

[38]  David Barber Clique Matrices for Statistical Graph Decomposition and Parameterising Restricted Positive Definite Matrices , 2008, UAI.

[39]  Kathleen M. Carley,et al.  Clearing the FOG: Fuzzy, overlapping groups for social networks , 2008, Soc. Networks.

[40]  Matthieu Latapy,et al.  Basic notions for the analysis of large two-mode networks , 2008, Soc. Networks.

[41]  Edoardo M. Airoldi,et al.  Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[42]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[43]  Tore Opsahl,et al.  For the few not the many? The effects of affirmative action on presence, prominence, and social capital of women directors in Norway , 2011 .

[44]  Tiago P. Peixoto,et al.  Trust Transitivity in Social Networks , 2010, PloS one.

[45]  Tiago P. Peixoto Hierarchical block structures and high-resolution model selection in large networks , 2013, ArXiv.

[46]  Boleslaw K. Szymanski,et al.  Overlapping community detection in networks: The state-of-the-art and comparative study , 2011, CSUR.

[47]  Daqing Zhang,et al.  Fine-grained preference-aware location search leveraging crowdsourced digital footprints from LBSNs , 2013, UbiComp.

[48]  Francesco Vaccarino,et al.  Networks and Cycles: A Persistent Homology Approach to Complex Networks , 2013 .

[49]  Francesco Vaccarino,et al.  Topological Strata of Weighted Complex Networks , 2013, PloS one.

[50]  E. Arias-Castro,et al.  Community Detection in Sparse Random Networks , 2013, 1308.2955.

[51]  Jérôme Kunegis,et al.  KONECT: the Koblenz network collection , 2013, WWW.

[52]  Ove Frank,et al.  http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained , 2007 .

[53]  Stephen E. Fienberg,et al.  Β Models for Random Hypergraphs with a given Degree Sequence , 2014, ArXiv.

[54]  Danai Koutra,et al.  VOG: Summarizing and Understanding Large Graphs , 2014, SDM.

[55]  Luke M. Gerdes,et al.  Assessing the Abu Sayyaf Group's Strategic and Learning Capacities , 2014 .

[56]  Tiago P. Peixoto,et al.  The graph-tool python library , 2014 .

[57]  Anatol E. Wegner,et al.  Subgraph covers - An information theoretic approach to motif analysis in networks , 2014, ArXiv.

[58]  Antoine Allard,et al.  Complex networks are an emerging property of hierarchical preferential attachment , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[59]  Dimitris Papadias,et al.  Uncertain Graph Processing through Representative Instances , 2015, TODS.

[60]  Ginestra Bianconi,et al.  Generalized network structures: The configuration model and the canonical ensemble of simplicial complexes. , 2016, Physical review. E.

[61]  Parviez R. Hosseini,et al.  Host and viral traits predict zoonotic spillover from mammals , 2017, Nature.

[62]  Seshadhri Comandur,et al.  A Fast and Provable Method for Estimating Clique Counts Using Turán's Theorem , 2016, WWW.

[63]  Jean-Gabriel Young,et al.  Construction of and efficient sampling from the simplicial configuration model. , 2017, Physical review. E.

[64]  Alice Patania,et al.  Topological analysis of data , 2017, EPJ Data Science.

[65]  J. Grilli,et al.  Higher-order interactions stabilize dynamics in competitive network models , 2017, Nature.

[66]  Roy C. Barnes,et al.  Structural Redundancy and Multiplicity within Networks of US Corporate Directors , 2017 .

[67]  Tiago P. Peixoto Reconstructing networks with unknown and heterogeneous errors , 2018, Physical Review X.

[68]  Tim Roughgarden,et al.  Finding Cliques in Social Networks: A New Distribution-Free Model , 2018, ICALP.

[69]  Danai Koutra,et al.  Reducing large graphs to small supergraphs: a unified approach , 2018, Social Network Analysis and Mining.

[70]  Mauricio Tec,et al.  Random Clique Covers for Graphs with Local Density and Global Sparsity , 2018, UAI.

[71]  Jean-Gabriel Young,et al.  Networks beyond pairwise interactions: structure and dynamics , 2020, ArXiv.

[72]  Tiago P. Peixoto Latent Poisson models for networks with heterogeneous density , 2020, Physical review. E.

[73]  Hai-Jun Zhou,et al.  Covering Problems and Core Percolations on Hypergraphs. , 2016, Physical review letters.

[74]  Philip S. Chodrow,et al.  Configuration Models of Random Hypergraphs and their Applications , 2019, J. Complex Networks.

[75]  Tim Roughgarden,et al.  Finding Cliques in Social Networks: A New Distribution-Free Model , 2020, SIAM J. Comput..

[76]  Cliff Joslyn,et al.  Hypernetwork science via high-order hypergraph walks , 2019, EPJ Data Science.

[77]  George T. Cantwell,et al.  Robust Bayesian inference of network structure from unreliable data , 2020, ArXiv.

[78]  Austin R. Benson,et al.  Generative hypergraph clustering: From blockmodels to modularity , 2021, Science Advances.

[79]  Tina Eliassi-Rad,et al.  The why, how, and when of representations for complex systems , 2020, SIAM Rev..

[80]  P. Alam ‘E’ , 2021, Composites Engineering: An A–Z Guide.