Linear filtering reveals false negatives in species interaction data

Species interaction datasets, often represented as sparse matrices, are usually collected through observation studies targeted at identifying species interactions. Due to the extensive required sampling effort, species interaction datasets usually contain many false negatives, often leading to bias in derived descriptors. We show that a simple linear filter can be used to detect false negatives by scoring interactions based on the structure of the interaction matrices. On 180 different datasets of various sizes, sparsities and ecological interaction types, we found that on average in about 75% of the cases, a false negative interaction got a higher score than a true negative interaction. Furthermore, we show that this filter is very robust, even when the interaction matrix contains a very large number of false negatives. Our results demonstrate that unobserved interactions can be detected in species interaction datasets, even without resorting to information about the species involved.

[1]  Wilhelm Burger,et al.  Digital Image Processing - An Algorithmic Introduction using Java , 2008, Texts in Computer Science.

[2]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[3]  Owen L. Petchey,et al.  Interaction strengths in food webs: issues and opportunities , 2004 .

[4]  F. O. Isinkaye,et al.  Recommendation systems: Principles, methods and evaluation , 2015 .

[5]  Louis-Félix Bersier,et al.  Sampling effects and the robustness of quantitative and qualitative food-web descriptors. , 2004, Journal of theoretical biology.

[6]  Nico Blüthgen,et al.  Bottom‐up control and co‐occurrence in complex communities: honeydew and nectar determine a rainforest ant mosaic , 2004 .

[7]  Makoto Kato,et al.  Insect-flower Relationship in the Temperate Deciduous Forest of Kibune, Kyoto : An Overview of the Flowering Phenology and the Seasonal Pattern of Insect Visits , 1990 .

[8]  Wei Zeng,et al.  Uncovering the information core in recommender systems , 2014, Scientific Reports.

[9]  D. Roff,et al.  Fine-tuned Bee-Flower Coevolutionary State Hidden within Multiple Pollination Interactions , 2014, Scientific Reports.

[10]  Stefano Allesina,et al.  The ghost of nestedness in ecological networks , 2013, Nature Communications.

[11]  B. De Baets,et al.  Exploration and prediction of interactions between methanotrophs and heterotrophs. , 2013, Research in microbiology.

[12]  Robert R. Junker,et al.  Specialization on traits as basis for the niche‐breadth of flower visitors and as structuring mechanism of ecological networks , 2013 .

[13]  A. Dobson,et al.  Parasites dominate food web links. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Fillia Makedon,et al.  Using singular value decomposition approximation for collaborative filtering , 2005, Seventh IEEE International Conference on E-Commerce Technology (CEC'05).

[15]  M. Emmerson,et al.  MEASUREMENT OF INTERACTION STRENGTH IN NATURE , 2005 .

[16]  G. Wahba Spline models for observational data , 1990 .

[17]  Alex O. Dechtiar,et al.  Parasites of Fish from Lake of the Woods, Ontario , 1972 .

[18]  Jordi Bascompte,et al.  The architecture of mutualistic networks minimizes competition and increases biodiversity , 2009, Nature.

[19]  Ira R. Weiss,et al.  Issues and opportunities , 1988, DATB.

[20]  D. Vázquez,et al.  Evaluating sampling completeness in a desert plant-pollinator network. , 2012, The Journal of animal ecology.

[21]  William Stafford Noble,et al.  Kernel methods for predicting protein-protein interactions , 2005, ISMB.

[22]  Pedro Jordano,et al.  Sampling networks of ecological interactions , 2015, bioRxiv.

[23]  Miguel G. Matias,et al.  Inferring biotic interactions from proxies. , 2015, Trends in ecology & evolution.

[24]  William Stafford Noble,et al.  A new pairwise kernel for biological network inference with support vector machines , 2007, BMC Bioinformatics.

[25]  W. L. Jorgensen The Many Roles of Computation in Drug Discovery , 2004, Science.

[26]  A. Ives,et al.  Phylogenetic trait-based analyses of ecological networks. , 2013, Ecology.

[27]  J. Bascompte,et al.  The modularity of pollination networks , 2007, Proceedings of the National Academy of Sciences.

[28]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[29]  Shinichi Nakagawa,et al.  A Tale of Two Phylogenies: Comparative Analyses of Ecological Interactions , 2013, The American Naturalist.

[30]  Tamiji Inoue,et al.  Insect-flower Relationship in the Campus of Kyoto University, Kyoto : An Overview of the Flowering Phenology and the Seasonal Pattern of Insect Visits , 1990 .

[31]  Roger Guimerà,et al.  Missing and spurious interactions and the reconstruction of complex networks , 2009, Proceedings of the National Academy of Sciences.

[32]  Robert Tibshirani,et al.  Spectral Regularization Algorithms for Learning Large Incomplete Matrices , 2010, J. Mach. Learn. Res..

[33]  Jochen Fründ,et al.  Sampling bias is a challenge for quantifying specialization and network structure: lessons from a quantitative niche model , 2016 .

[34]  Jordi Bascompte,et al.  Missing and forbidden links in mutualistic networks , 2011, Proceedings of the Royal Society B: Biological Sciences.

[35]  Stefano Allesina,et al.  The dimensionality of ecological networks. , 2013, Ecology letters.

[36]  Julie L. Yang,et al.  Affinity regression predicts the recognition code of nucleic acid binding proteins , 2015, Nature Biotechnology.

[37]  Carlos J. Melián,et al.  The nested assembly of plant–animal mutualistic networks , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Lloyd Goldwasser,et al.  SAMPLING EFFECTS AND THE ESTIMATION OF FOOD‐WEB PROPERTIES , 1997 .

[39]  N. Blüthgen,et al.  Preferences for sugars and amino acids and their conditionality in a diverse nectar‐feeding ant community , 2004 .

[40]  Nico Blüthgen,et al.  Why network analysis is often disconnected from community ecology: A critique and an ecologist's guide , 2010 .