Co-occurrence simplicial complexes in mathematics: identifying the holes of knowledge

In the last years complex networks tools contributed to provide insights on the structure of research, through the study of collaboration, citation and co-occurrence networks. The network approach focuses on pairwise relationships, often compressing multidimensional data structures and inevitably losing information. In this paper we propose for the first time a simplicial complex approach to word co-occurrences, providing a natural framework for the study of higher-order relations in the space of scientific knowledge. Using topological methods we explore the conceptual landscape of mathematical research, focusing on homological holes, regions with low connectivity in the simplicial structure. We find that homological holes are ubiquitous, which suggests that they capture some essential feature of research practice in mathematics. k-dimensional holes die when every concept in the hole appears in an article together with other k+1 concepts in the hole, hence their death may be a sign of the creation of new knowledge, as we show with some examples. We find a positive relation between the size of a hole and the time it takes to be closed: larger holes may represent potential for important advances in the field because they separate conceptually distant areas. We provide further description of the conceptual space by looking for the simplicial analogs of stars and explore the likelihood of edges in a star to be also part of a homological cycle. We also show that authors’ conceptual entropy is positively related with their contribution to homological holes, suggesting that polymaths tend to be on the frontier of research.

[1]  Afra Zomorodian,et al.  Computing Persistent Homology , 2004, SCG '04.

[2]  Xianwen Wang,et al.  Patent co-citation networks of Fortune 500 companies , 2011, Scientometrics.

[3]  M Clara P Amorim,et al.  Painted Goby Larvae under High-CO2 Fail to Recognize Reef Sounds , 2017, PloS one.

[4]  Emanuela Merelli,et al.  Persistent Homology Analysis of RNA , 2016 .

[5]  Michel Waldschmidt Open Diophantine problems , 2001 .

[6]  Alice Patania,et al.  Topological analysis of data , 2017, EPJ Data Science.

[7]  I-Jen Chiang,et al.  Discover the semantic topology in high-dimensional data , 2007, Expert Syst. Appl..

[8]  Jean-Gabriel Young,et al.  Construction of and efficient sampling from the simplicial configuration model. , 2017, Physical review. E.

[9]  Afra Zomorodian,et al.  The Theory of Multidimensional Persistence , 2007, SCG '07.

[10]  H. Edelsbrunner,et al.  Persistent Homology — a Survey , 2022 .

[11]  Ysabel Clare,et al.  Stanislavsky’s system as an enactive guide to embodied cognition? , 2017, Connect. Sci..

[12]  Sanjay Garg,et al.  Evaluation of robenidine analog NCL195 as a novel broad-spectrum antibacterial agent , 2017, PloS one.

[13]  Pawel Dlotko,et al.  Computational Topology in Text Mining , 2012, CTIC.

[14]  F. Song,et al.  Mapping the Knowledge Structure of Research on Patient Adherence: Knowledge Domain Visualization Based Co-Word Analysis and Social Network Analysis , 2012, PloS one.

[15]  Diego R. Amancio,et al.  Text Authorship Identified Using the Dynamics of Word Co-Occurrence Networks , 2016, PloS one.

[16]  Mason A. Porter,et al.  A roadmap for the computation of persistent homology , 2015, EPJ Data Science.

[17]  Ram Ramanathan,et al.  Comparative Topological Signatures of Growing Collaboration Networks , 2017 .

[18]  Marián Boguñá,et al.  Extracting the multiscale backbone of complex weighted networks , 2009, Proceedings of the National Academy of Sciences.

[19]  Sagar Kamarthi,et al.  Correction: Novel keyword co-occurrence network-based methods to foster systematic reviews of scientific literature , 2017, PloS one.

[20]  Mark Daley,et al.  Gene co-citation networks associated with worker sterility in honey bees , 2014, BMC Systems Biology.

[21]  Gunnar E. Carlsson,et al.  Topology and data , 2009 .

[22]  Ginestra Bianconi,et al.  Generalized network structures: The configuration model and the canonical ensemble of simplicial complexes. , 2016, Physical review. E.

[23]  T. Jenssen,et al.  A literature network of human genes for high-throughput analysis of gene expression , 2001, Nature Genetics.

[24]  C. J. Carstens,et al.  Persistent Homology of Collaboration Networks , 2013 .

[25]  Radu Purice,et al.  Twisted Crossed Products and Magnetic Pseudodieren tial Operators , 2004 .

[26]  Carl Lagoze,et al.  The web of topics: discovering the topology of topic evolution in a corpus , 2011, WWW.

[27]  Danijela Horak,et al.  Persistent homology of complex networks , 2008, 0811.2203.

[28]  Heather A. Harrington,et al.  Persistent homology of time-dependent functional networks constructed from coupled time series. , 2016, Chaos.

[29]  Pei-Chun Lee,et al.  Mapping knowledge structure by keyword co-occurrence: a first look at journal papers in Technology Foresight , 2010, Scientometrics.

[30]  Jean-Pierre Eckmann,et al.  Curvature of co-links uncovers hidden thematic layers in the World Wide Web , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Ginestra Bianconi,et al.  Emergent Hyperbolic Network Geometry , 2016, Scientific Reports.

[32]  G. Petri,et al.  Homological scaffolds of brain functional networks , 2014, Journal of The Royal Society Interface.

[33]  Shing-Tung Yau,et al.  Perspectives on geometric analysis , 2005 .

[34]  Mikael Vejdemo-Johansson,et al.  javaPlex: A Research Software Package for Persistent (Co)Homology , 2014, ICMS.

[35]  Ginestra Bianconi,et al.  Emergent Complex Network Geometry , 2014, Scientific Reports.

[36]  A. Gottlieb Markov Transitions and the Propagation of Chaos , 2000, math/0001076.

[37]  T. Jenssen,et al.  A literature network of human genes for high-throughput analysis of gene expression , 2001 .

[38]  Ernesto Estrada,et al.  Centralities in Simplicial Complexes , 2017, ArXiv.

[39]  Ramon Ferrer i Cancho,et al.  The small world of human language , 2001, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[40]  Ernesto Estrada,et al.  Centralities in Simplicial Complexes , 2017, Journal of theoretical biology.

[41]  Muskan Garg,et al.  Identifying influential segments from word co-occurrence networks using AHP , 2018, Cognitive Systems Research.

[42]  Francesco Vaccarino,et al.  Topological Strata of Weighted Complex Networks , 2013, PloS one.

[43]  Herbert Edelsbrunner,et al.  Topological Persistence and Simplification , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[44]  Alexander Herzog,et al.  Complex Politics: A Quantitative Semantic and Topological Analysis of UK House of Commons Debates , 2015, ArXiv.

[45]  G. Carlsson,et al.  Topology of viral evolution , 2013, Proceedings of the National Academy of Sciences.

[46]  Leonidas J. Guibas,et al.  Persistence barcodes for shapes , 2004, SGP '04.

[47]  Katayoun Farrahi,et al.  A Simplified Topological Representation of Text for Local and Global Context , 2017, ACM Multimedia.

[48]  S. Kamarthi,et al.  Novel keyword co-occurrence network-based methods to foster systematic reviews of scientific literature , 2017, PloS one.

[49]  Paul B. Slater,et al.  A two-stage algorithm for extracting the multiscale backbone of complex weighted networks , 2009, Proceedings of the National Academy of Sciences.

[50]  Konstantin Mischaikow,et al.  Topological data analysis of contagion maps for examining spreading processes on networks , 2015, Nature communications.

[51]  Alice Patania,et al.  The shape of collaborations , 2017, EPJ Data Science.

[52]  Andrea Cerri,et al.  Computational Topology in Image Context , 2012, Lecture Notes in Computer Science.