Effect of inventory method on niche models: Random versus systematic error

Abstract Data from large-scale biological inventories are essential for understanding and managing Earth's ecosystems. The Forest Inventory and Analysis Program (FIA) of the U.S. Forest Service is the largest biological inventory in North America; however, the FIA inventory recently changed from an amalgam of different approaches to a nationally-standardized approach in 2000. Full use of both data sets is clearly warranted to target many pressing research questions including those related to climate change and forest resources. However, full use requires lumping FIA data from different regionally-based designs (pre-2000) and/or lumping the data across the temporal changeover. Combining data from different inventory types must be approached with caution as inventory types represent different probabilities of detecting trees per sample unit, which can ultimately confound temporal and spatial patterns found in the data. Consequently, the main goal of this study is to evaluate the effect of inventory on a common analysis in ecology, modeling of climatic niches (or species-climate relations). We use non-parametric multiplicative regression (NPMR) to build and compare niche models for 41 tree species from the old and new FIA design in the Pacific coastal United States. We discover two likely effects of inventory on niche models and their predictions. First, there is an increase from 4 to 6% in random error for modeled predictions from the different inventories when compared to modeled predictions from two samples of the same inventory. Second, systematic error (or directional disagreement among modeled predictions) is detectable for 4 out of 41 species among the different inventories: Calocedrus decurrens , Pseudotsuga menziesii , and Pinus ponderosa , and Abies concolor . Hence, at least 90% of niche models and predictions of probability of occurrence demonstrate no obvious effect from the change in inventory design. Further, niche models built from sub-samples of the same data set can yield systematic error that rivals systematic error in predictions for models built from two separate data sets. This work corroborates the pervasive and pressing need to quantify different types of error in niche modeling to address issues associated with data quality and large-scale data integration.

[1]  C. Graham,et al.  INTEGRATING PHYLOGENETICS AND ENVIRONMENTAL NICHE MODELS TO EXPLORE SPECIATION MECHANISMS IN DENDROBATID FROGS , 2004, Evolution; international journal of organic evolution.

[2]  Terrestrial Environments,et al.  Ecological indicators for the nation , 2013 .

[3]  Peter E. Thornton,et al.  Generating surfaces of daily meteorological variables over large regions of complex terrain , 1997 .

[4]  Trevor Hastie,et al.  The Elements of Statistical Learning Theory , 2001 .

[5]  G. Ferrell,et al.  Indicators of Fomes annosus Root Decay and Bark Beetle Susceptibility in Sapling White Fir , 1976 .

[6]  Jennifer A. Miller,et al.  Incorporating spatial dependence in predictive vegetation models , 2007 .

[7]  Ralf Wieland,et al.  Classification in conservation biology: A comparison of five machine-learning methods , 2010, Ecol. Informatics.

[8]  W. B. Critchfield Hybridization of the California Firs , 1988, Forest Science.

[9]  G. E. Hutchinson,et al.  A treatise on limnology. , 1957 .

[10]  David R. B. Stockwell,et al.  Effects of sample size on accuracy of species distribution models , 2002 .

[11]  R. Kadmon,et al.  A SYSTEMATIC ANALYSIS OF FACTORS AFFECTING THE PERFORMANCE OF CLIMATIC ENVELOPE MODELS , 2003 .

[12]  Andrew N. Gray,et al.  Monitoring stand structure in mature coastal Douglas-fir forests: effect of plot size , 2003 .

[13]  William A. Bechtold,et al.  The enhanced forest inventory and analysis program - national sampling design and estimation procedures , 2005 .

[14]  J. Hoeting,et al.  FACTORS AFFECTING SPECIES DISTRIBUTION PREDICTIONS: A SIMULATION MODELING EXPERIMENT , 2005 .

[15]  A. Peterson,et al.  Predicting distributions of known and unknown reptile species in Madagascar , 2003, Nature.

[16]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[17]  J. Svenning,et al.  Limited filling of the potential range in European tree species , 2004 .

[18]  Beat Kleiner,et al.  Graphical Methods for Data Analysis , 1983 .

[19]  Christopher M. Oswalt,et al.  Tree migration detection through comparisons of historic and current forest inventories , 2009 .

[20]  Antoine Guisan,et al.  Predictive habitat distribution models in ecology , 2000 .

[21]  M. Araújo,et al.  Five (or so) challenges for species distribution modelling , 2006 .

[22]  A. Prasad,et al.  Estimating potential habitat for 134 eastern US tree species under six climate scenarios , 2008 .

[23]  Rafael Pino-Mejías,et al.  Predicting the potential habitat of oaks with data mining models and the R system , 2010, Environ. Model. Softw..

[24]  Victor A. Rudis,et al.  Comprehensive regional resource assessments and multipurpose uses of forest inventory and analysis data, 1976 to 2001: a review. , 2003 .

[25]  Graham Floater,et al.  Spatial distribution and habitat preference of the endangered tarantula, Brachypelma klaasi (Araneae: Theraphosidae) in Mexico , 2000, Biodiversity & Conservation.

[26]  Carol A. Gotway,et al.  Statistical Methods for Spatial Data Analysis , 2004 .

[27]  A. Hampe Bioclimate envelope models: what they detect and what they hide , 2004 .

[28]  M. Araújo,et al.  Validation of species–climate impact models under climate change , 2005 .

[29]  Marie-Josée Fortin,et al.  Spatial Autocorrelation in Ecological Studies: A Legacy of Solutions and Myths , 2009 .

[30]  John Stanisic,et al.  Reconciling paleodistribution models and comparative phylogeography in the Wet Tropics rainforest land snail Gnarosophia bellendenkerensis (Brazier 1875) , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[31]  R. G. Davies,et al.  Methods to account for spatial autocorrelation in the analysis of species distributional data : a review , 2007 .

[32]  M. Sykes,et al.  Methods and uncertainties in bioclimatic envelope modelling under climate change , 2006 .

[33]  O. Pybus,et al.  Phylogeny, niches, and relative abundance in natural communities. , 2008, Ecology.

[34]  K. J. Wessels,et al.  An evaluation of the gradsect biological survey method , 1998, Biodiversity & Conservation.

[35]  Neville D. Crossman,et al.  An invasive plant and climate change threat index for weed risk management: Integrating habitat distribution pattern and dispersal process , 2011 .

[36]  Simon Ferrier,et al.  Evaluating the predictive performance of habitat models developed using logistic regression , 2000 .

[37]  A. Jarvis,et al.  Use of GIS for Optimizing a Collecting Mission for a Rare Wild Pepper (Capsicum flexuosum Sendtn.) in Paraguay , 2005, Genetic Resources and Crop Evolution.

[38]  J. Oksanen,et al.  Continuum theory revisited: what shape are species responses along ecological gradients? , 2002 .

[39]  M. Hutchinson,et al.  Splines — more than just a smooth interpolator , 1994 .

[40]  N. Crookston,et al.  Quantifying the abundance of co-occurring conifers along Inland Northwest (USA) climate gradients. , 2008, Ecology.

[41]  Trevor Hastie,et al.  Making better biogeographical predictions of species’ distributions , 2006 .

[42]  Jacques Hebert,et al.  Study on European forestry information and communication system - Reports on forestry inventory and survey systems - Volume 1 - Country report for Belgium , 1997 .

[43]  F. Massey The Kolmogorov-Smirnov Test for Goodness of Fit , 1951 .

[44]  B. Nelson,et al.  Endemism centres, refugia and botanical collection density in Brazilian Amazonia , 1990, Nature.

[45]  F. Jiguet,et al.  Predicted Climate‐Driven Bird Distribution Changes and Forecasted Conservation Conflicts in a Neotropical Savanna , 2009, Conservation biology : the journal of the Society for Conservation Biology.

[46]  Bruce McCune,et al.  Quantifying ecological thresholds from response surfaces , 2011 .

[47]  David W. Peterson,et al.  Modelling conifer species distributions in mountain forests of Washington State, USA , 2003 .

[48]  Steven J. Phillips,et al.  WHAT MATTERS FOR PREDICTING THE OCCURRENCES OF TREES: TECHNIQUES, DATA, OR SPECIES' CHARACTERISTICS? , 2007 .

[49]  R. Kadmon,et al.  EFFECT OF ROADSIDE BIAS ON THE ACCURACY OF PREDICTIVE MAPS PRODUCED BY BIOCLIMATIC MODELS , 2004 .

[50]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[51]  Vicente J. Monleon,et al.  Differences in forest area classification based on tree tally from variable- and fixed-radius plots , 2011 .

[52]  Jane Elith,et al.  Error and uncertainty in habitat models , 2006 .

[53]  Cástor Guisande,et al.  Factors affecting species richness of marine elasmobranchs , 2013, Biodiversity and Conservation.

[54]  P. Hernandez,et al.  The effect of sample size and species characteristics on performance of different species distribution modeling methods , 2006 .

[55]  J. Eadie,et al.  USING SPECIES–HABITAT MODELS TO TARGET CONSERVATION: A CASE STUDY WITH BREEDING MALLARDS , 2004 .

[56]  J. Evans,et al.  Gradient modeling of conifer species using random forests , 2009, Landscape Ecology.

[57]  A. Townsend Peterson,et al.  Novel methods improve prediction of species' distributions from occurrence data , 2006 .

[58]  Helen M. Regan,et al.  Mapping epistemic uncertainties and vague concepts in predictions of species distribution , 2002 .

[59]  Victor A. Rudis Fresh Ideas, Perspectives, and Protocols Associated with Forest Inventory and Analysis Surveys: Graduate Reports, 1974 to July 2001 , 2003 .

[60]  George M. Furnival,et al.  Forest Survey Sampling Designs: A History , 1999, Journal of Forestry.

[61]  Janet L. Ohmann,et al.  Predictive mapping of forest composition and structure with direct gradient analysis and nearest- neighbor imputation in coastal Oregon, U.S.A. , 2002 .

[62]  R. Carroll,et al.  Nonparametric Function Estimation for Clustered Data When the Predictor is Measured without/with Error , 2000 .

[63]  S. Yakowitz,et al.  A comparison of kriging with nonparametric regression methods , 1985 .

[64]  M. Austin Spatial prediction of species distribution: an interface between ecological theory and statistical modelling , 2002 .

[65]  N. Crookston,et al.  Empirical Analyses of Plant‐Climate Relationships for the Western United States , 2006, International Journal of Plant Sciences.

[66]  L. R. Grosenbaugh Plotless Timber Estimates—New, Fast, Easy , 1952 .

[67]  T. Barrett,et al.  Estimation procedures for the combined 1990s periodic forest inventories of California, Oregon, and Washington. , 2004 .

[68]  R. Carroll,et al.  Nonparametric Function Estimation of the Relationship between two Repeatedly Measured Variables , 1997 .

[69]  C. Sasser Ecological Indicators for the Nation , 2009 .

[70]  Stephen P. Hubbell,et al.  Drought sensitivity shapes species distribution patterns in tropical forests , 2007, Nature.

[71]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[72]  Donald B. Zobel,et al.  Local Variation in Intergrading Abies grandis-A. concolor Populations in the Central Oregon Cascades. II. Stomatal Reaction to Moisture Stress , 1974, Botanical Gazette.

[73]  L. Hannah,et al.  Climate change‐integrated conservation strategies , 2002 .

[74]  John M. Chambers,et al.  Graphical Methods for Data Analysis , 1983 .

[75]  Bruce McCune,et al.  Non-parametric habitat models with automatic interactions , 2006 .

[76]  A. Prasad,et al.  PREDICTING ABUNDANCE OF 80 TREE SPECIES FOLLOWING CLIMATE CHANGE IN THE EASTERN UNITED STATES , 1998 .

[77]  M. Araújo,et al.  Consequences of spatial autocorrelation for niche‐based models , 2006 .

[78]  Aaron M.Ellison PC‐ORD: Multivariate Analysis of Ecological Data , 1998, The Bulletin of the Ecological Society of America.

[79]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[80]  Peter E. Thornton,et al.  Climatic and biophysical controls on conifer species distributions in mountain forests of Washington State, USA , 2003 .

[81]  I. Jolliffe A Note on the Use of Principal Components in Regression , 1982 .

[82]  Z. Huaman,et al.  Assessing the Geographic Representativeness of Genebank Collections: the Case of Bolivian Wild Potatoes , 2000, Conservation biology : the journal of the Society for Conservation Biology.