Simple null model analysis subsumes a new species co‐occurrence index: A comment on Mainali et al. (2022)

Recently, Mainali et al. (2022, termed MSSF henceforth) proposed a new index of pairwise species cooccurrence based on the log odds ratio α of conditional occurrence probability. Under the assumption that the total numbers of occurrences of both species are fixed in a number of sites, they demonstrate that the associated random distribution of paired occurrences follows a noncentral hypergeometric distribution, with α being the single unknown parameter. MSSF call the maximum likelihood estimator of α the ‘affinity’ of both species and suggest that it might serve as an index of species cooccurrence. Note that in most of the biodiversity literature, α refers to local (withinsite) species richness. The affinity index α is centred around zero (random association), with positive index values indicating increased numbers of joint species occurrences. For pairs of sites that share species, it behaves similarly to Jaccard's index, with positive values indicating a relatively large number of shared species (species aggregation) and negative values indicating a relatively small number shared species (species segregation). As recognized by MSSF, an analogous probabilistic approach was proposed almost a decade ago by Veech (2013). Recently and independently, the hypergeometric distribution was introduced by Carmona and Pärtel (2020) to estimate dark diversity and by Zhou et al. (2022) in medical bibliometrics. Arita (2016) first noted that Veech's (2013) probabilistic approach is identical to Fisher's exact test for a 2 × 2 matrix of species cooccurrence incidences. Importantly, the affinity index is defined only for individual pairs of species or pairs of sites. But cooccurrence data are usually organized as a matrix, with the occurrence of multiple species (=rows) recorded at multiple sites (=columns). For such matrices, indices of species cooccurrences quantify the level of βdiversity (betweensite differences in species composition). It is unclear how the affinity index would be used with such a matrix, which contains many pairs of species and many pairs of sites. Taking average affinity scores across all possible pairs of species or sites potentially obscures patterns from multiple species (and site) interactions (Chao et al., 2008) and loses information on the ordered loss of species among sites (the degree of nestedness, Baselga, 2010). Like other pairwise indices (e.g. Jaccard, Sørensen and Simpson), the affinity index may be difficult to use or interpret within a biogeographic and ecological context based on multiple sites. MSSF argue that the known sensitivity of common indices of pairwise cooccurrence to the number of occurrences invalidates the use of these traditional indices. They also make the bold claim that ‘half a century of development in analyses of cooccurrence has been marred by failures’ and that this new index will ‘resolve all the aforementioned challenges’. However, these sweeping claims are undermined by three problems:

[1]  Kumar P. Mainali,et al.  A better index for analysis of co-occurrence and similarity. , 2022, Science advances.

[2]  Xiaobei Zhou,et al.  A probabilistic model for co-occurrence analysis in bibliometrics , 2021, J. Biomed. Informatics.

[3]  M. Pärtel,et al.  Estimating probabilistic site‐specific species pools and dark diversity from co‐occurrence data , 2020 .

[4]  P. Keil Z ‐scores unite pairwise indices of ecological similarity and association for binary data , 2019, Ecosphere.

[5]  A. Magurran,et al.  A balance of winners and losers in the Anthropocene. , 2019, Ecology letters.

[6]  N. Gotelli,et al.  Species richness correlates of raw and standardized co‐occurrence metrics , 2018 .

[7]  D. Simberloff,et al.  Plant recording across two centuries reveals dramatic changes in species diversity of a Mediterranean archipelago , 2017, Scientific Reports.

[8]  H. T. Arita,et al.  Species co-occurrence analysis: pairwise versus matrix-level approaches , 2016 .

[9]  Joseph A. Veech,et al.  A probabilistic model for analysing species co-occurrence , 2013 .

[10]  Werner Ulrich,et al.  Statistical challenges in null model analysis , 2012 .

[11]  A. Chao,et al.  A Two‐Stage Probabilistic Approach to Multiple‐Community Similarity Indices , 2008, Biometrics.

[12]  Werner Ulrich,et al.  Null model analysis of species nestedness patterns. , 2007, Ecology.

[13]  N. Gotelli Null model analysis of species co-occurrence patterns , 2000 .

[14]  Wirt Atmar,et al.  A comparative analysis of nested subset patterns of species composition , 1997, Oecologia.

[15]  J. Wilson Null models for assembly rules: the Jack Horner effect is more insidious than the Narcissus effect , 1995 .

[16]  David W. Winkler,et al.  20. A Null Model for Null Models in Biogeography , 1984 .

[17]  Werner Ulrich,et al.  Bi-dimensional null model analysis of presence-absence binary matrices. , 2018, Ecology.

[18]  Hanna Tuomisto,et al.  The tangled link between β- and γ-diversity: a Narcissus effect weakens statistical inferences in null model analyses of diversity patterns , 2017 .

[19]  A. Baselga Partitioning the turnover and nestedness components of beta diversity , 2010 .