taxize: taxonomic search and retrieval in R

All species are hierarchically related to one another, and we use taxonomic names to label the nodes in this hierarchy. Taxonomic data is becoming increasingly available on the web, but scientists need a way to access it in a programmatic fashion that’s easy and reproducible. We have developed taxize, an open-source software package (freely available from http://cran.r-project.org/web/packages/taxize/index.html) for the R language. taxize provides simple, programmatic access to taxonomic data for 13 data sources around the web. We discuss the need for a taxonomic toolbelt in R, and outline a suite of use cases for which taxize is ideally suited (including a full workflow as an appendix). The taxize package facilitates open and reproducible science by allowing taxonomic data collection to be done in the open-source R platform.

[1]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[2]  T. Cuffney,et al.  Ambiguous taxa: effects on the characterization and interpretation of invertebrate assemblages , 2007, Journal of the North American Benthological Society.

[3]  Campbell O. Webb,et al.  Phylomatic: tree assembly for applied phylogenetics , 2005 .

[4]  A. Ives,et al.  Phylogenetic trait-based analyses of ecological networks. , 2013, Ecology.

[5]  Hadley Wickham,et al.  The Split-Apply-Combine Strategy for Data Analysis , 2011 .

[6]  Luis Cayuela,et al.  taxonstand: An r package for species names standardisation in vegetation databases , 2012 .

[7]  Scott Federhen,et al.  The NCBI Taxonomy database , 2011, Nucleic Acids Res..

[8]  Gustavo Henrique Carvalho,et al.  Plantminer: A web tool for checking and gathering plant species taxonomic information , 2010, Environ. Model. Softw..

[9]  B. Statzner,et al.  Can biological invertebrate traits resolve effects of multiple stressors on running water ecosystems , 2010 .

[10]  M. Benton Stems, nodes, crown clades, and rank‐free lists: is Linnaeus dead? , 2000, Biological reviews of the Cambridge Philosophical Society.

[11]  Thomas E. Lacher,et al.  Latitudinal patterns of range size and species richness of New World woody plants , 2007 .

[12]  Victoria Stodden,et al.  Reproducible Research , 2019, The New Statistics with R.

[13]  Dick de Zwart,et al.  Toward a knowledge infrastructure for traits‐based ecological risk assessment , 2011, Integrated environmental assessment and management.

[14]  Zhenyuan Lu,et al.  The taxonomic name resolution service: an online tool for automated standardization of plant names , 2013, BMC Bioinformatics.

[15]  Campbell O. Webb,et al.  Phylogenies and Community Ecology , 2002 .

[16]  S. Dray,et al.  Assessing species and community functional responses to environmental gradients: which multivariate methods? , 2012 .

[17]  Korbinian Strimmer,et al.  APE: Analyses of Phylogenetics and Evolution in R language , 2004, Bioinform..

[18]  Philippe Usseglio-Polatera,et al.  Biological and ecological traits of benthic freshwater macroinvertebrates: relationships and definition of groups with similar traits , 2000 .

[19]  N. LeRoy Poff,et al.  Functional trait niches of North American lotic insects: traits-based ecological applications in light of phylogenetic relationships , 2006, Journal of the North American Benthological Society.