An introduction to state-space modeling of ecological time series

State-space models (SSMs) are an important modeling framework for analyzing ecological time series. These hierarchical models are commonly used to model population dynamics and animal movement, and are now increasingly being used to model other ecological processes. SSMs are popular because they are flexible and they model the natural variation in ecological processes separately from observation error. Their flexibility allows ecologists to model continuous, count, binary, and categorical data with linear or nonlinear processes that evolve in discrete or continuous time. Modeling the two sources of stochasticity separately allows researchers to differentiate between biological stochasticity (e.g., in birth processes) and imprecision in the sampling methodology, and generally provides better estimates of the ecological quantities of interest than if only one source of stochasticity is directly modeled. Since the introduction of SSMs, a broad range of fitting procedures have been proposed. However, the variety and complexity of these procedures can limit the ability of ecologists to formulate and fit their own SSMs. In addition, many SSM users are unaware of the potential estimation problems they could encounter, and of the model selection and validation tools that can help them assess how well their models fit their data. In this paper, we present a review of SSMs that will provide a strong foundation to ecologists interested in learning about SSMs, introduce new tools to veteran SSM users, and highlight promising research directions for statisticians interested in ecological applications. The review is accompanied by an in-depth tutorial that demonstrates how SSMs models can be fitted and validated in R. Together, the review and tutorial present an introduction to SSMs that will help ecologists to formulate, fit, and validate their models.

[1]  Russell B. Millar,et al.  Conditional vs marginal estimation of the predictive loss of hierarchical models using WAIC and cross-validation , 2018, Stat. Comput..

[2]  Brian Dennis,et al.  Replicated sampling increases efficiency in monitoring biological populations. , 2010, Ecology.

[3]  Michael Betancourt,et al.  A Conceptual Introduction to Hamiltonian Monte Carlo , 2017, 1701.02434.

[4]  Edward L. Ionides,et al.  Statistical Inference for Partially Observed Markov Processes , 2015 .

[5]  Martyn Plummer,et al.  JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling , 2003 .

[6]  Jonas Knape,et al.  ESTIMABILITY OF DENSITY DEPENDENCE IN MODELS OF TIME SERIES DATA. , 2008, Ecology.

[7]  Andrew Gelman,et al.  The Prior Can Often Only Be Understood in the Context of the Likelihood , 2017, Entropy.

[8]  A. Andrews,et al.  Applications of Kalman Filtering to Aerospace: 1960 to Present , 2010 .

[9]  B. Block,et al.  Shark baselines and the conservation role of remote coral reef ecosystems , 2018, Science Advances.

[10]  Dootika Vats,et al.  Revisiting the Gelman–Rubin Diagnostic , 2018, Statistical Science.

[11]  R Choquet,et al.  A hybrid symbolic-numerical method for determining model structure. , 2012, Mathematical biosciences.

[12]  Abstr Am SOME PROBLEMS IN ESTIMATING POPULATION SIZES FROM CATCH-ATAGE DATA , 1988 .

[13]  O. François,et al.  Approximate Bayesian Computation (ABC) in practice. , 2010, Trends in ecology & evolution.

[14]  John T. Finn,et al.  Bayesian state-space models reveal unobserved off-shore nocturnal migration from Motus data , 2018, Ecological Modelling.

[15]  N. Gordon,et al.  Novel approach to nonlinear/non-Gaussian Bayesian state estimation , 1993 .

[16]  J. Cavanaugh,et al.  A BOOTSTRAP VARIANT OF AIC FOR STATE-SPACE MODEL SELECTION , 1997 .

[17]  Stephen P. Brooks,et al.  Density dependence in North American ducks , 2004 .

[18]  Murdoch K. McAllister,et al.  A Bayesian state-space mark-recapture model to estimate exploitation rates in mixed-stock fisheries , 2006 .

[19]  M. Pitt,et al.  Filtering via Simulation: Auxiliary Particle Filters , 1999 .

[20]  Andrew Thomas,et al.  WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[21]  Ken Aho,et al.  Model selection for ecologists: the worldviews of AIC and BIC. , 2014, Ecology.

[22]  Robert H. Shumway,et al.  Time series analysis and its applications : with R examples , 2017 .

[23]  Jun S. Liu,et al.  Sequential Monte Carlo methods for dynamic systems , 1997 .

[24]  Daniel P. Costa,et al.  Accuracy of ARGOS Locations of Pinnipeds at-Sea Estimated Using Fastloc GPS , 2010, PloS one.

[25]  Mikihiko Kai,et al.  Performance evaluation of information criteria for estimating a shape parameter in a Bayesian state-space biomass dynamics model , 2019, Fisheries Research.

[26]  M. Sköld,et al.  On observation distributions for state space models of population survey data. , 2011, The Journal of animal ecology.

[27]  Nando de Freitas,et al.  An Introduction to Sequential Monte Carlo Methods , 2001, Sequential Monte Carlo Methods in Practice.

[28]  Mollie E. Brooks,et al.  Generalized linear mixed models: a practical guide for ecology and evolution. , 2009, Trends in ecology & evolution.

[29]  Melvin J. Hinich,et al.  Time Series Analysis by State Space Methods , 2001 .

[30]  William A. Link,et al.  Model selection for the North American Breeding Bird Survey: A comparison of methods , 2017, The Condor.

[31]  Olivier Gimenez,et al.  Estimation of immigration rate using integrated population models , 2010 .

[32]  Richard A. Parker,et al.  WinBUGS for population ecologists: bayesian modeling using markov chain Monte Carlo methods , 2009 .

[33]  R. E. Kalman,et al.  New Results in Linear Filtering and Prediction Theory , 1961 .

[34]  Mevin B. Hooten,et al.  State‐space modeling to support management of brucellosis in the Yellowstone bison population , 2015 .

[35]  Martin Wæver Pedersen,et al.  State-space models for bio-loggers: A methodological road map , 2013 .

[36]  Matteo Fasiolo,et al.  A comparison of inferential methods for highly nonlinear state space models in ecology and epidemiology , 2014 .

[37]  James S. Clark,et al.  Using Hierarchical Bayes to Understand Movement, Health, and Survival in the Endangered North Atlantic Right Whale , 2013, PloS one.

[38]  José Manuel Benítez,et al.  On the use of cross-validation for time series predictor evaluation , 2012, Inf. Sci..

[39]  Ian Jonsen,et al.  Joint estimation over multiple individuals improves behavioural state inference from animal movement data , 2016, Scientific Reports.

[40]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[41]  Alan Hastings,et al.  FITTING POPULATION MODELS INCORPORATING PROCESS NOISE AND OBSERVATION ERROR , 2002 .

[42]  Nicholas G. Polson,et al.  Tracking Epidemics With Google Flu Trends Data and a State-Space SEIR Model , 2012, Journal of the American Statistical Association.

[43]  L. Wasserman,et al.  The Selection of Prior Distributions by Formal Rules , 1996 .

[44]  T. Rothenberg Identification in Parametric Models , 1971 .

[45]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[46]  Aki Vehtari,et al.  Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC , 2015, Statistics and Computing.

[47]  David Campbell,et al.  An ANOVA test for parameter estimability using data cloning with application to statistical inference for dynamic systems , 2014, Comput. Stat. Data Anal..

[48]  Leo Polansky,et al.  Improving inference for nonlinear state‐space models of animal population dynamics given biased sequential life stage data , 2019, Biometrics.

[49]  P Besbeas,et al.  Exact inference for integrated population modelling , 2019, Biometrics.

[50]  Anders Nielsen,et al.  Estimation of time-varying selectivity in stock assessments using state-space models , 2014 .

[51]  A. Dobson,et al.  A Disease-Mediated Trophic Cascade in the Serengeti and its Implications for Ecosystem C , 2009, PLoS biology.

[52]  Byron J. T. Morgan,et al.  Weak Identifiability in Models for Mark-Recapture-Recovery Data , 2009 .

[53]  Anders Nielsen,et al.  Validation of ecological state space models using the Laplace approximation , 2017, Environmental and Ecological Statistics.

[54]  R. O’Hara,et al.  A review of Bayesian variable selection methods: what, how and which , 2009 .

[55]  D. Gamerman,et al.  A NON‐GAUSSIAN FAMILY OF STATE‐SPACE MODELS WITH EXACT MARGINAL LIKELIHOOD , 2013 .

[56]  J. Grand,et al.  Effects of model complexity and priors on estimation using sequential importance sampling/resampling for species conservation , 2016 .

[57]  Pejman Rohani,et al.  Avoidable errors in the modelling of outbreaks of emerging pathogens, with special reference to Ebola , 2014, Proceedings of the Royal Society B: Biological Sciences.

[58]  Henrik Madsen,et al.  Estimation methods for nonlinear state-space models in ecology , 2011 .

[59]  S. Müller,et al.  Model Selection in Linear Mixed Models , 2013, 1306.2427.

[60]  Jim Q. Smith,et al.  Diagnostic checks of non‐standard time series models , 1985 .

[61]  Devin S Johnson,et al.  Continuous-time correlated random walk model for animal telemetry data. , 2008, Ecology.

[62]  Scott C. Schmidler,et al.  Monitoring Joint Convergence of MCMC Samplers , 2017 .

[63]  Jamie S Sanderlin,et al.  Precision gain versus effort with joint models using detection/non‐detection and banding data , 2019, Ecology and evolution.

[64]  George H. Balazs,et al.  Using Bayesian state-space modelling to assess the recovery and harvest potential of the Hawaiian green sea turtle stock , 2007 .

[65]  Sylvia Früiiwirth-Schnatter,et al.  Recursive residuals and model diagnostics for normal and non-normal state space models , 1996, Environmental and Ecological Statistics.

[66]  Jay Barlow,et al.  Bayesian state-space model of fin whale abundance trends from a 1991–2008 time series of line-transect surveys in the California Current , 2011 .

[67]  Mevin B. Hooten,et al.  A guide to Bayesian model selection for ecologists , 2015 .

[68]  P. Valpine Monte Carlo State-Space Likelihoods by Weighted Posterior Kernel Density Estimation , 2004 .

[69]  Robert Kohn,et al.  Efficient generalized cross-validation for state space models , 1987 .

[70]  Carsten F. Dormann,et al.  Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure , 2017 .

[71]  Anders Nielsen,et al.  TMB: Automatic Differentiation and Laplace Approximation , 2015, 1509.00660.

[72]  Ian D. Jonsen,et al.  Spatiotemporal modelling of marine movement data using Template Model Builder (TMB) , 2017 .

[73]  William H. Aeberhard,et al.  Review of State-Space Models for Fisheries Science , 2018 .

[74]  Aki Vehtari,et al.  Understanding predictive information criteria for Bayesian models , 2013, Statistics and Computing.

[75]  O. Ovaskainen,et al.  State-space models of individual animal movement. , 2008, Trends in ecology & evolution.

[76]  P Besbeas,et al.  Integrating Mark–Recapture–Recovery and Census Data to Estimate Animal Abundance and Demographic Parameters , 2002, Biometrics.

[77]  Michael A. West,et al.  Time Series: Modeling, Computation, and Inference , 2010 .

[78]  Brett T McClintock,et al.  When to be discrete: the importance of time formulation in understanding animal movement , 2014, Movement Ecology.

[79]  Johannes Ledolter,et al.  State-Space Analysis of Wildlife Telemetry Data , 1991 .

[80]  F. Vaida,et al.  Conditional Akaike information for mixed-effects models , 2005 .

[81]  E. Ionides,et al.  Inference for dynamic and latent variable models via iterated, perturbed Bayes maps , 2015, Proceedings of the National Academy of Sciences.

[82]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[83]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[84]  Joseph M. O’Malley,et al.  Model selection and multi-model inference for Bayesian surplus production models: A case study for Pacific blue and striped marlin , 2015 .

[85]  Carsten F. Dormann,et al.  Model averaging in ecology: a review of Bayesian, information-theoretic, and tactical approaches for predictive inference , 2018, Ecological Monographs.

[86]  Anders Nielsen,et al.  Fast fitting of non-Gaussian state-space models to animal movement data via Template Model Builder. , 2015, Ecology.

[87]  Jason R. W. Merrick,et al.  A Hellinger distance approach to MCMC diagnostics , 2014 .

[88]  Mark A. Lewis,et al.  State-space models’ dirty little secrets: even simple linear Gaussian models can have estimation problems , 2015, Scientific Reports.

[89]  Byron J. T. Morgan,et al.  Detecting parameter redundancy , 1997 .

[90]  Bradley P. Carlin,et al.  Markov Chain Monte Carlo conver-gence diagnostics: a comparative review , 1996 .

[91]  Subhash R. Lele,et al.  Estimability and Likelihood Inference for Generalized Linear Mixed Models Using Data Cloning , 2010 .

[92]  Byron J. T. Morgan,et al.  Methods for investigating parameter redundancy , 2004 .

[93]  C. Gouriéroux,et al.  Non-Gaussian State-Space Modeling of Nonstationary Time Series , 2008 .

[94]  Brett T. McClintock,et al.  A general discrete‐time modeling framework for animal movement using multistate random walks , 2012 .

[95]  Duncan Temple Lang,et al.  Programming With Models: Writing Statistical Algorithms for General Model Structures With NIMBLE , 2015, 1505.05093.

[96]  Ian D. Jonsen,et al.  Hierarchical State-Space Estimation of Leatherback Turtle Navigation Ability , 2010, PloS one.

[97]  Catherine A Calder,et al.  Accounting for uncertainty in ecological analysis: the strengths and limitations of hierarchical statistical modeling. , 2009, Ecological applications : a publication of the Ecological Society of America.

[98]  W. Zucchini,et al.  Hidden Markov Models for Time Series: An Introduction Using R , 2009 .

[99]  Andrew Thomas,et al.  MultiBUGS: A Parallel Implementation of the BUGS Modeling Framework for Faster Bayesian Inference , 2017, J. Stat. Softw..

[100]  Marc Mangel,et al.  Overcoming the Data Crisis in Biodiversity Conservation. , 2018, Trends in ecology & evolution.

[101]  William A. Link,et al.  Bayesian cross-validation for model evaluation and selection, with application to the North American Breeding Bird Survey. , 2015, Ecology.

[102]  Brian Dennis,et al.  DENSITY DEPENDENCE IN TIME SERIES OBSERVATIONS OF NATURAL POPULATIONS: ESTIMATION AND TESTING' , 1994 .

[103]  Byron J. T. Morgan,et al.  Modelling population dynamics : model formulation, fitting and assessment using state-space methods , 2014 .

[104]  E. A. Catchpole,et al.  Parameter redundancy in mark‐recovery models , 2012, Biometrical journal. Biometrische Zeitschrift.

[105]  A. Gelfand,et al.  Identifiability, Improper Priors, and Gibbs Sampling for Generalized Linear Models , 1999 .

[106]  Piet de Jong,et al.  A cross-validation filter for time series models , 1988 .

[107]  Andrew Gelman,et al.  Inference from Simulations and Monitoring Convergence , 2011 .

[108]  Giovanni Petris,et al.  An R Package for Dynamic Linear Models , 2010 .

[109]  Elizabeth E. Holmes,et al.  MARSS: Multivariate Autoregressive State-space Models for Analyzing Time-series Data , 2012, R J..

[110]  E. A. Catchpole,et al.  On the Near‐Singularity of Models for Animal Recovery Data , 2001, Biometrics.

[111]  William H. Aeberhard,et al.  Identifiable state‐space models: A case study of the Bay of Fundy sea scallop fishery , 2018, Canadian Journal of Statistics.

[112]  Mark N. Maunder,et al.  A state-space multistage life cycle model to evaluate population impacts in the presence of density dependence: illustrated with application to delta smelt (Hyposmesus transpacificus) , 2011 .

[113]  Ian D. Jonsen,et al.  ROBUST STATE-SPACE MODELING OF ANIMAL MOVEMENT DATA , 2005 .

[114]  S. Lele,et al.  ESTIMATING DENSITY DEPENDENCE, PROCESS NOISE, AND OBSERVATION ERROR , 2006 .

[115]  Margaret C. Siple,et al.  Population diversity in Pacific herring of the Puget Sound, USA , 2015, Oecologia.

[116]  Claire M. Postlethwaite,et al.  Effects of Temporal Resolution on an Inferential Model of Animal Movement , 2013, PloS one.

[117]  Subhash R. Lele Is non-informative Bayesian analysis appropriate for wildlife management: survival of San Joaquin Kit Fox and declines in amphibian populations , 2015 .

[118]  Jennifer Pohle,et al.  Selecting the Number of States in Hidden Markov Models: Pragmatic Solutions Illustrated Using Animal Movement , 2017 .

[119]  Andrew Gelman,et al.  General methods for monitoring convergence of iterative simulations , 1998 .

[120]  Andrew Harvey,et al.  Forecasting, Structural Time Series Models and the Kalman Filter , 1990 .

[121]  Ursula Klingmüller,et al.  Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood , 2009, Bioinform..

[122]  Peter Green,et al.  Markov chain Monte Carlo in Practice , 1996 .

[123]  Andrew Gelman,et al.  Handbook of Markov Chain Monte Carlo , 2011 .

[124]  K. Burnham,et al.  Model selection: An integral part of inference , 1997 .

[125]  Jean-Dominique Lebreton,et al.  Parameter Identifiability and Model Selection in Capture‐Recapture Models: A Numerical Approach , 1998 .

[126]  Brendan A. Wintle,et al.  The Use of Bayesian Model Averaging to Better Represent Uncertainty in Ecological Models , 2003 .

[127]  Elizabeth E. Holmes,et al.  Inferring spatial structure from time‐series data: using multivariate state‐space models to detect metapopulation structure of California sea lions in the Gulf of California, Mexico , 2010 .

[128]  Scott A. Shaffer,et al.  State‐space framework for estimating measurement error from double‐tagging telemetry experiments , 2012 .

[129]  Rachel S. McCrea,et al.  Kent Academic Repository Versions of Research Enquiries Citation for Published Version Link to Record in Kar Parameter Redundancy in Discrete State-space and Integrated Models , 2022 .

[130]  David I Warton,et al.  The PIT-trap—A “model-free” bootstrap procedure for inference about regression models with discrete, multivariate responses , 2017, PloS one.

[131]  Mevin B. Hooten,et al.  A guide to Bayesian model checking for ecologists , 2018, Ecological Monographs.

[132]  Martin Krkošek,et al.  Study design and parameter estimability for spatial and temporal ecological models , 2017, Ecology and evolution.

[133]  R. Kohn,et al.  On Gibbs sampling for state space models , 1994 .

[134]  Christian P. Robert,et al.  The Bayesian choice : from decision-theoretic foundations to computational implementation , 2007 .

[135]  James R. Bence,et al.  Performance of deviance information criterion model selection in statistical catch-at-age analysis , 2008 .

[136]  Andrew Thomas,et al.  The BUGS project: Evolution, critique and future directions , 2009, Statistics in medicine.

[137]  Giovanni Petris,et al.  Dynamic Linear Models with R , 2009 .

[138]  S. Zeger,et al.  Latent Class Model Diagnosis , 2000, Biometrics.

[139]  Joseph E. Cavanaugh,et al.  An improved Akaike information criterion for state-space model selection , 2006, Comput. Stat. Data Anal..

[140]  Patrick J. Sullivan,et al.  A Kalman filter approach to catch-at-length analysis , 1992 .

[141]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[142]  Roland Langrock,et al.  Flexible and practical modeling of animal telemetry data: hidden Markov models and extensions. , 2012, Ecology.

[143]  Russell B. Millar,et al.  BUGS in Bayesian stock assessments , 1999 .

[144]  M. Plummer,et al.  CODA: convergence diagnosis and output analysis for MCMC , 2006 .

[145]  Anders Nielsen,et al.  Accounting for correlated observations in an age-based state-space stock assessment model , 2015, ICES 2015.

[146]  Perry de Valpine,et al.  Population dynamics of an Arctiid caterpillar-tachinid parasitoid system using state-space models. , 2010, The Journal of animal ecology.

[147]  Nicolai Schipper Jespersen,et al.  An Introduction to Markov Chain Monte Carlo , 2010 .

[148]  C. Cobelli,et al.  Parameter and structural identifiability concepts and ambiguities: a critical review and analysis. , 1980, The American journal of physiology.

[149]  John Sibert,et al.  AD Model Builder: using automatic differentiation for statistical inference of highly parameterized complex nonlinear models , 2012, Optim. Methods Softw..

[150]  Radford M. Neal MCMC Using Hamiltonian Dynamics , 2011, 1206.1901.

[151]  James T. Thorson,et al.  Faster estimation of Bayesian models in ecology using Hamiltonian Monte Carlo , 2017 .

[152]  Matteo Fasiolo,et al.  ABC in Ecological Modelling , 2018 .

[153]  Benjamin M. Bolker,et al.  Ecological Models and Data in R , 2008 .

[154]  Russell B. Millar Reference priors for Bayesian fisheries models , 2002 .

[155]  Sophia Rabe-Hesketh,et al.  Bayesian Comparison of Latent Variable Models: Conditional Versus Marginal Likelihoods , 2018, Psychometrika.