Sufficient sampling for asymptotic minimum species richness estimators.

Biodiversity sampling is labor intensive, and a substantial fraction of a biota is often represented by species of very low abundance, which typically remain undetected by biodiversity surveys. Statistical methods are widely used to estimate the asymptotic number of species present, including species not yet detected. Additional sampling is required to detect and identify these species, but richness estimators do not indicate how much sampling effort (additional individuals or samples) would be necessary to reach the asymptote of the species accumulation curve. Here we develop the first statistically rigorous nonparametric method for estimating the minimum number of additional individuals, samples, or sampling area required to detect any arbitrary proportion (including 100%) of the estimated asymptotic species richness. The method uses the Chao1 and Chao2 nonparametric estimators of asymptotic richness, which are based on the frequencies of rare species in the original sampling data. To evaluate the performance of the proposed method, we randomly subsampled individuals or quadrats from two large biodiversity inventories (light trap captures of Lepidoptera in Great Britain and censuses of woody plants on Barro Colorado Island [BCI], Panama). The simulation results suggest that the method performs well but is slightly conservative for small sample sizes. Analyses of the BCI results suggest that the method is robust to nonindependence arising from small-scale spatial aggregation of species occurrences. When the method was applied to seven published biodiversity data sets, the additional sampling effort necessary to capture all the estimated species ranged from 1.05 to 10.67 times the original sample (median approximately equal to 2.23). Substantially less effort is needed to detect 90% of the species (0.33-1.10 times the original effort; median approximately equal to 0.80). An Excel spreadsheet tool is provided for calculating necessary sampling effort for either abundance data or replicated incidence data.

[1]  Robert K. Colwell,et al.  THE ANT FAUNA OF A TROPICAL RAIN FOREST: ESTIMATING SPECIES RICHNESS THREE DIFFERENT WAYS , 2002 .

[2]  Randall D. Babb,et al.  Reaction of lizard populations to a catastrophic wildfire in a central Arizona mountain range , 2002 .

[3]  Robert K. Colwell,et al.  ESTIMATION OF SPECIES RICHNESS: MIXTURE MODELS, THE ROLE OF RARE SPECIES, AND INFERENTIAL CHALLENGES , 2005 .

[4]  N. Mawdsley,et al.  Biodiversity inventories, indicator taxa and effects of habitat modification in tropical forest , 1998, Nature.

[5]  Shiro Kobayashi THE RAREFACTION DIVERSITY MEASUREMENT AND THE SPATIAL DISTRIBUTION OF INDIVIDUALS , 1982 .

[6]  A. Chao,et al.  PREDICTING THE NUMBER OF NEW SPECIES IN FURTHER TAXONOMIC SAMPLING , 2003 .

[7]  Robert K. Colwell,et al.  INTERPOLATING, EXTRAPOLATING, AND COMPARING INCIDENCE-BASED SPECIES ACCUMULATION CURVES , 2004 .

[8]  A. Chao Species Estimation and Applications , 2006 .

[9]  A. Magurran,et al.  Measuring Biological Diversity , 2004 .

[10]  S. T. Buckland,et al.  An Introduction to the Bootstrap. , 1994 .

[11]  R. Chazdon,et al.  Species Richness, Spatial Variation, and Abundance of the Soil Seed Bank of a Secondary Tropical Rain Forest 1 , 1998 .

[12]  N. Starr,et al.  Optimal and Adaptive Stopping in the Search for New Species , 1979 .

[13]  R. Fisher,et al.  The Relation Between the Number of Species and the Number of Individuals in a Random Sample of an Animal Population , 1943 .

[14]  M. D. Dahlberg Annual Cycles of Species Occurrence, Abundance, and Diversity in Georgia Estuarine Fish Populations , 1970 .

[15]  Robert K. Colwell,et al.  Quantifying biodiversity: procedures and pitfalls in the measurement and comparison of species richness , 2001 .

[16]  Robert K. Colwell,et al.  Estimating terrestrial biodiversity through extrapolation. , 1994, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[17]  O. Lewis,et al.  Spatial distribution patterns of predatory arthropods within an English hedgerow in early winter in relation to habitat variables , 2002 .

[18]  I. Good THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .

[19]  Anne Chao,et al.  Estimating population size for sparse data in capture-recapture experiments , 1989 .

[20]  A. Ellison,et al.  Rapid Inventory of the Ant Assemblage in a Temperate Hardwood Forest: Species Composition and Assessment of Sampling Methods , 2007, Environmental entomology.

[21]  I. Good,et al.  Turing’s anticipation of empirical bayes in connection with the cryptanalysis of the naval enigma , 2000 .

[22]  E. Smith,et al.  Similarities between rarefaction methods , 2004, Hydrobiologia.

[23]  F. W. Preston The Canonical Distribution of Commonness and Rarity: Part I , 1962 .

[24]  Jorge SoberónM.,et al.  The Use of Species Accumulation Functions for the Prediction of Species Richness , 1993 .

[25]  Sean R Connolly,et al.  Community Structure of Corals and Reef Fishes at Multiple Scales , 2005, Science.

[26]  Robert K. Colwell,et al.  BIODIVERSITY ASSESSMENT USING STRUCTURED INVENTORY: CAPTURING THE ANT FAUNA OF A TROPICAL RAIN FOREST , 1997 .