Data Mechanics and Coupling Geometry on Binary Bipartite Networks

We quantify the notion of pattern and formalize the process of pattern discovery under the framework of binary bipartite networks. Patterns of particular focus are interrelated global interactions between clusters on its row and column axes. A binary bipartite network is built into a thermodynamic system embracing all up-and-down spin configurations defined by product-permutations on rows and columns. This system is equipped with its ferromagnetic energy ground state under Ising model potential. Such a ground state, also called a macrostate, is postulated to congregate all patterns of interest embedded within the network data in a multiscale fashion. A new computing paradigm for indirect searching for such a macrostate, called Data Mechanics, is devised by iteratively building a surrogate geometric system with a pair of nearly optimal marginal ultrametrics on row and column spaces. The coupling measure minimizing the Gromov-Wasserstein distance of these two marginal geometries is also seen to be in the vicinity of the macrostate. This resultant coupling geometry reveals multiscale block pattern information that characterizes multiple layers of interacting relationships between clusters on row and on column axes. It is the nonparametric information content of a binary bipartite network. This coupling geometry is then demonstrated to shed new light and bring resolution to interaction issues in community ecology and in gene-content-based phylogenetics. Its implied global inferences are expected to have high potential in many scientific areas.

[1]  Richard Cole,et al.  A unified access bound on comparison-based dynamic dictionaries , 2007, Theor. Comput. Sci..

[2]  R. Häggkvist,et al.  Bipartite graphs and their applications , 1998 .

[3]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[4]  Amin Saberi,et al.  A Sequential Algorithm for Generating Random Graphs , 2007, APPROX-RANDOM.

[5]  W. Sischo,et al.  A clinical trial evaluating prophylactic and therapeutic antibiotic use on health and performance of preweaned calves. , 2005, Journal of dairy science.

[6]  B. Snel,et al.  Genome phylogeny based on gene content , 1999, Nature Genetics.

[7]  P. Anderson More is different. , 1972, Science.

[8]  M. Mézard,et al.  Spin Glass Theory and Beyond , 1987 .

[9]  J. Stephen Brewer,et al.  Ecological Assembly Rules: Perspectives, Advances, Retreats , 2000 .

[10]  Werner Ulrich,et al.  A null model algorithm for presence-absence matrices based on proportional resampling , 2012 .

[11]  Frank Harary,et al.  Graph Theory , 2016 .

[12]  Amin Saberi,et al.  A Sequential Algorithm for Generating Random Graphs , 2007, Algorithmica.

[13]  J. Diamond,et al.  Origin of the New Hebridean avifauna , 1976 .

[14]  L. Stone,et al.  The checkerboard score and species distributions , 1990, Oecologia.

[15]  N. Gotelli How Do Communities Come Together? , 1999, Science.

[16]  G. Davis,et al.  Corporate Elite Networks and Governance Changes in the 1980s , 1997, American Journal of Sociology.

[17]  P. Koehl,et al.  Multi-Scale Clustering by Building a Robust and Self Correcting Ultrametric Topology on Data Points , 2013, PloS one.

[18]  T. Case Niche overlap and the assembly of island lizard communities , 1983 .

[19]  M. Mézard,et al.  Nature of the Spin-Glass Phase , 1984 .

[20]  HERBERT A. SIMON,et al.  The Architecture of Complexity , 1991 .

[21]  B. McCowan,et al.  The effect of rehabilitation of northern elephant seals (Mirounga angustirostris) on antimicrobial resistance of commensal Escherichia coli. , 2009, Veterinary microbiology.

[22]  Daniel Simberloff,et al.  The Assembly of Species Communities: Chance or Competition? , 1979 .

[23]  M. Newman,et al.  Random graphs with arbitrary degree distributions and their applications. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  Bryan F. J. Manly,et al.  A Note on the Analysis of Species Co‐Occurrences , 1995 .

[25]  John M. Logsdon,et al.  Archaeal genomics: Do archaea have a mixed heritage? , 1998, Current Biology.

[26]  Facundo Mémoli,et al.  Spectral Gromov-Wasserstein distances for shape matching , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[27]  Chen Chen,et al.  Bootstrapping on Undirected Binary Networks Via Statistical Mechanics , 2014, Journal of Statistical Physics.

[28]  Feng Chen,et al.  Patterns and Implications of Gene Gain and Loss in the Evolution of Prochlorococcus , 2007, PLoS genetics.

[29]  Paul A. Keddy,et al.  Community Assembly Rules, Morphological Dispersion, and the Coexistence of Plant Species , 1998 .

[30]  Yuguo Chen,et al.  Sequential Monte Carlo Methods for Statistical Analysis of Tables , 2005 .

[31]  P. Bork,et al.  Measuring genome evolution. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[32]  A. Naor,et al.  Ultrametric skeletons , 2011, Proceedings of the National Academy of Sciences.

[33]  J. Crutchfield Between order and chaos , 2011, Nature Physics.

[34]  R. Whittaker,et al.  Assembly Rules Demonstrated in a Saltmarsh Community , 1995 .

[35]  W. Sischo,et al.  Assessing antibiotic resistance in fecal Escherichia coli in young calves using cluster analysis techniques. , 2003, Preventive veterinary medicine.

[36]  G. Toulouse,et al.  Ultrametricity for physicists , 1986 .