Evaluating the popularity of R in ecology

The programming language R is widely used in many fields. We explored the extent of reported R use in the field of ecology using the Web of Science and text mining. We analyzed the frequencies of R packages reported in more than 60,000 peer‐reviewed articles published in 30 ecology journals during a 10‐yr period ending in 2017. The number of studies reported using R as their primary tool in data analysis increased linearly from 11.4% in 2008 to 58.0% in 2017. The top 10 packages reported were lme4, vegan, nlme, ape, MuMIn, MASS, mgcv, ade4, multcomp, and car. The increasing popularity of R has most likely furthered open science in ecological research because it can improve reproducibility of analyses and captures workflows when scripts and codes are included and shared. These findings may not be entirely unique to R because there are other programming languages used by ecologists, but they do strongly suggest that given the relatively high frequency of reported use of R, it is a significant component of contemporary analytics in the field of ecology.

[1]  Robert Lanfear,et al.  Public Data Archiving in Ecology and Evolution: How Well Are We Doing? , 2015, PLoS biology.

[2]  K. A. S. Mislan,et al.  Elevating the status of code in ecology , 2015, bioRxiv.

[3]  Justin C. Touchon,et al.  The mismatch between current statistical practice and doctoral training in ecology , 2016 .

[4]  Ethan P. White,et al.  Skills and Knowledge for Data-Intensive Environmental Research , 2017, Bioscience.

[5]  G. Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Permutation P -values Should Never Be Zero: Calculating Exact P -values When Permutations Are Randomly Drawn , 2011 .

[6]  W. Christopher Lenhardt,et al.  The Tao of open science for ecology , 2015 .

[7]  Mollie E. Brooks,et al.  Generalized linear mixed models: a practical guide for ecology and evolution. , 2009, Trends in ecology & evolution.

[8]  Sylvia Tippmann,et al.  Programming tools: Adventures with R , 2014, Nature.

[9]  L. Excoffier,et al.  Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. , 1992, Genetics.

[10]  P. Dixon VEGAN, a package of R functions for community ecology , 2003 .

[11]  Achim Zeileis,et al.  Motivation, values, and work design as drivers of participation in the R open source project for statistical computing , 2015, Proceedings of the National Academy of Sciences.

[12]  P. Peres‐Neto,et al.  Why phylogenies do not always predict ecological differences , 2017 .

[13]  Pierre Legendre,et al.  Numerical Ecology with R , 2011 .

[14]  Christopher J. Lortie Open Sesame: R for Data Science is Open Science , 2017 .

[15]  Luis Mauricio Bini,et al.  Mantel test in population genetics , 2013, Genetics and molecular biology.

[16]  Ning Jiang,et al.  Our path to better science in less time using open data science tools , 2017, Nature Ecology &Evolution.

[17]  Nick Barnes Publish your computer code: it is good enough , 2010, Nature.

[18]  Philip M. Dixon,et al.  Speeding Up Ecological and Evolutionary Computations in R; Essentials of High Performance Computing for Biologists , 2015, PLoS Comput. Biol..

[19]  Megan D. Higgs Ecology and Statistics: A Healthy Union? , 2015 .