neotoma: A Programmatic Interface to the Neotoma Paleoecological Database

Paleoecological data are integral to ecological and evolutionary analyses. First, they provide an opportunity to study ecological and evolutionary interactions between communities and abiotic environments across time scales. Second, they allow us to study the long-term outcomes of processes that occur infrequently, such as megadroughts, hurricanes, and rapid climate change. Third, the past allows us to study ecological processes in the absence of widespread anthropogenic influence. The R package neotoma, described here, obtains and manipulates data from the NeotomaPaleoecological Database (Neotoma Database: http://www.neotomadb.org ). The Neotoma Database is a public-domain searchable repository for multiproxypaleoecological records spanning the past 5 million years. The Neotoma Database provides the data and cyberinfrastructure to study spatiotemporal dynamics of species and communities from the Pliocene to the present; neotoma provides a user interface to enable these studies. neotoma searches the Neotoma Database using terms that can include location, taxon name, or dataset type (e.g., pollen, vertebrate fauna, ostracode) using the Database’s Application Programming Interface (API). The package returns a set of nested metadata associated with the site, including the full assemblage record, geochronological data to enable the rebuilding of age models, dataset metadata (e.g. age range of samples, date of accession into Neotoma, principal investigator), and site metadata (e.g. location, site name and description). neotoma also provides tools to allow cross-site analysis, including the ability to standardize taxonomies using built-in taxonomies derived from the published literature or user-provided taxonomies. We show how key functions in the neotoma package can be used, by reproducing analytic examples from the published literature focusing on Pinus migration following deglaciation and shifts in mammal species distributions during the Pleistocene.

[1]  S. Peters,et al.  paleobioDB: an R package for downloading, visualizing and processing data from the Paleobiology Database , 2015 .

[2]  HighWire Press Philosophical Transactions of the Royal Society of London , 1781, The London Medical Journal.

[3]  S. Harrison,et al.  Global biomass burning: a synthesis and review of Holocene paleofire records and their controls , 2013 .

[4]  M. Blaauw Methods and code for 'classical' age-modelling of radiocarbon sequences. , 2010 .

[5]  H. Gregory McDonald,et al.  Spatial Response of Mammals to Late Quaternary Environmental Fluctuations , 1996, Science.

[6]  D. Raup,et al.  Periodicity of extinctions in the geologic past. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[7]  R. Mathewes A palynological study of postglacial vegetation changes in the University Research Forest, southwestern British Columbia , 1973 .

[8]  P. Tzedakis Vegetation Change through Glacial-Interglacial Cycles: A Long Pollen Sequence Perspective , 1994 .

[9]  S. Juggins rioja: Analysis of Quaternary Science Data , 2012 .

[10]  P C Wainwright,et al.  rfishbase: exploring, manipulating and visualizing FishBase data from R. , 2012, Journal of fish biology.

[11]  G. Dietl,et al.  Conservation paleobiology: putting the dead to work. , 2011, Trends in ecology & evolution.

[12]  Olivier Blarquez,et al.  paleofire: An R package to analyse sedimentary charcoal records from the Global Charcoal Database to reconstruct past biomass burning , 2014, Comput. Geosci..

[13]  M. B. Davis,et al.  Quaternary history and the stability of forest communities , 1981 .

[14]  M. Pellatt,et al.  Holocene Tree Line and Climate Change on the Queen Charlotte Islands, Canada , 1997, Quaternary Research.

[15]  Gavin Simpson,et al.  Analogue and Weighted Averaging Methods for Palaeoecology , 2015 .

[16]  L. Hills,et al.  Holocene migration of lodgepole pine (Pinus contorta var. latifolia) in southern Yukon, Canada , 2013 .

[17]  A. Burton,et al.  Biodiversity: past, present and future , 2012, Biology Letters.

[18]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[19]  C. Barnosky Postglacial Vegetation and Climate in the Northwestern Great Plains of Montana , 1989, Quaternary Research.

[20]  D. Gavin,et al.  A statistical approach to evaluating distance metrics and analog assignments for pollen records , 2003, Quaternary Research.

[21]  Simon Brewer,et al.  Paleoecoinformatics: applying geohistorical data to ecological questions. , 2012, Trends in ecology & evolution.

[22]  Matthew B. Jones,et al.  Challenges and Opportunities of Open Data in Ecology , 2011, Science.

[23]  A. Lézine,et al.  Databases and their application , 2013 .

[24]  P. D. Polly,et al.  From Card Catalogs to Computers: Databases in Vertebrate Paleontology , 2013 .

[25]  L. Cwynar,et al.  Post-glacial population growth rates of Pinus contorta ssp. Latifolia in western Canada , 1991 .

[26]  K. Gajewski,et al.  Modern pollen data from North America and Greenland for multi-scale paleoenvironmental applications , 2005 .

[27]  A. Lézine,et al.  POLLEN METHODS AND STUDIES | Databases and Their Application , 2013 .

[28]  James Regetz,et al.  Advances in global change research require open science by individual researchers , 2012 .

[29]  C. Marshall,et al.  Has the Earth’s sixth mass extinction already arrived? , 2011, Nature.

[30]  S. Goring,et al.  Pollen assemblage richness does not reflect regional plant species richness: a cautionary tale , 2013 .

[31]  C. Paciorek,et al.  Deposition times in the northeastern United States during the Holocene: establishing valid priors for Bayesian age models , 2012 .

[32]  Karen M. Layou,et al.  Phanerozoic Trends in the Global Diversity of Marine Invertebrates , 2008, Science.

[33]  Y. Bergeron,et al.  Disentangling the trajectories of alpha, beta and gamma plant diversity of North American boreal ecoregions since 15,500 years , 2014, Front. Ecol. Evol..

[34]  P. Bartlein,et al.  LATE‐QUATERNARY VEGETATION DYNAMICS IN NORTH AMERICA: SCALING FROM TAXA TO BIOMES , 2004 .

[35]  H. TUXEN,et al.  Vegetation history , 1988, Handbook of vegetation science.

[36]  John W. Williams,et al.  Obtaining accurate and precise environmental reconstructions from the modern analog technique and North American surface pollen dataset , 2008 .

[37]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[38]  Gavin Simpson,et al.  Analogue Methods in Palaeoecology: Using the analogue Package , 2007 .

[39]  Susanne A. Fritz,et al.  Diversity in time and space: wanted dead and alive. , 2013, Trends in ecology & evolution.

[40]  Hadley Wickham,et al.  Reshaping Data with the reshape Package , 2007 .

[41]  Eduard Szöcs,et al.  taxize: taxonomic search and retrieval in R , 2013, F1000Research.

[42]  J. Christen,et al.  Flexible paleoclimate age-depth models using an autoregressive gamma process , 2011 .

[43]  A. Behrensmeyer,et al.  Building Links Between Ecology and Paleontology Using Taphonomic Studies of Recent Vertebrate Communities , 2012 .

[44]  S. Ferrier,et al.  Modeling the climatic drivers of spatial patterns in vegetation composition since the Last Glacial Maximum , 2013 .