Abstract
In this paper we present a total species estimator based on modelling the rate of change of a species accumulation curve (SAC). The proposed approach calculates an accumulation rate curve (ARC) for new species conditional on observed data and extrapolates it using parametric functions with varying rates of decay. The curve fits are integrated to obtain estimates for undetected species and a weighted estimate is calculated by optimizing a loss function subject to a set of restrictions. Confidence intervals are evaluated using a parametric bootstrap of aggregate counts, with the underlying count covariances estimated from a regularized mixture distribution fit to observed count data. A data smoothing technique and adjusting for bias are also discussed. The method is tested using a simulation study and applied to two example datasets. The results indicate that the proposed method is robust in a majority of cases and outperforms existing methods in bias and mean squared error. Performance is especially improved when the proportion of unobserved species is high. Confidence interval coverage is noticeably better compared to existing methods and conservative interval widths are maintained. The smoothing technique is also shown to be effective in reducing mean squared error under certain conditions.
| Original language | English |
|---|---|
| Pages (from-to) | 311-345 |
| Number of pages | 35 |
| Journal | Environmental and Ecological Statistics |
| Volume | 32 |
| Issue number | 1 |
| Early online date | 18 Feb 2025 |
| DOIs | |
| Publication status | Published - Mar 2025 |
Bibliographical note
Acknowledgements: Professors Michael Escobar and Wei Xu for discussions on this topic. Professor Alexander Shestopaloff for feedback and reviewing this manuscript.Data Availability Statement
Plant microbiome data from Applied Example 1 is available on Dryad (https://datadryad.org/stash/dataset/doi:https://doi.org/10.5061/dryad.g60r3). Code and data frames to replicate ARC estimator results for the simulation study and applied examples are available on GitHub (https://github.com/kshestop/ARC).Funding
No funding was received to assist with the preparation of this manuscript.
Keywords
- Accumulation rate
- Statistical ecology
- Tail estimation
- Total species
- Variance estimation
- Weighted estimator
Fingerprint
Dive into the research topics of 'An accumulation rate curve estimator for total species'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS