Discretized and Aggregated: Modeling Dive Depth of Harbor Seals from Ordered Categorical Data with Temporal Autocorrelation

Ordered categorical data are pervasive in environmental and ecological data, and often arise from constraints that require discretizing a continuous variable into ordered categories. A great deal of data have been collected toward the study of marine mammal dive behavior using satellite depth recorders (SDRs), which often discretize a continuous variable such as depth. Additionally, data storage or transmission constraints may also necessitate the aggregation of data over time intervals of a specified length. The categorization and aggregation create a time series of ordered multicategory counts for each animal, which present challenges in terms of statistical modeling and practical interpretation. We describe an intuitive strategy for modeling such aggregated, ordered categorical data allowing for inference regarding the category probabilities and a measure of central tendency on the original scale of the data (e.g., meters), along with incorporation of temporal correlation and overdispersion. The strategy extends covariate-specific cutpoint models for ordinal data. We demonstrate the method in an analysis of SDR dive-depth data collected on harbor seals in Alaska. The primary goal of the analysis is to assess the relationship of covariates, such as time of day, with number of dives and maximum depth of dives. We also predict missing values and introduce novel graphical summaries of the data and results.

[1]  P. McCullagh Regression Models for Ordinal Data , 1980 .

[2]  J. Anderson Regression and Ordered Categorical Variables , 1984 .

[3]  Martyn Plummer,et al.  JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling , 2003 .

[4]  O. Schabenberger The use of ordinal response methodology in forestry , 1995 .

[5]  L. Fahrmeir,et al.  Multivariate statistical modelling based on generalized linear models , 1994 .

[6]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[7]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[8]  G. Tutz,et al.  Semi- and Nonparametric Modeling of Ordinal Data , 2003 .

[9]  Eric R. Ziegel,et al.  Multivariate Statistical Modelling Based on Generalized Linear Models , 2002, Technometrics.

[10]  Jennifer A. Hoeting,et al.  A clipped latent variable model for spatially correlated ordered categorical data , 2010, Comput. Stat. Data Anal..

[11]  A. Agresti,et al.  Analysis of Ordinal Categorical Data. , 1985 .

[12]  Andrew Thomas,et al.  WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[13]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2006 .

[14]  A. Agresti Analysis of Ordinal Categorical Data: Agresti/Analysis , 2010 .

[15]  Ann A. O'Connell,et al.  Longitudinal Data Analysis , 2013 .

[16]  D. Hedeker,et al.  A Multilevel Thresholds of Change Model for Analysis of Stages of Change Data , 1998 .

[17]  Alan Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[18]  H. Ishwaran Univariate and multirater ordinal cumulative link regression with covariate specific cutpoints , 2000 .

[19]  Joseph V. Ierza Ordinal probit: A generalization , 1985 .

[20]  F. Harrell,et al.  Partial Proportional Odds Models for Ordinal Response Variables , 1990 .

[21]  Devin S Johnson,et al.  Continuous-time correlated random walk model for animal telemetry data. , 2008, Ecology.

[22]  Peter Congdon Applied Bayesian Hierarchical Methods , 2010 .

[23]  Jay M. Ver Hoef,et al.  Fast computing of some generalized linear mixed pseudo-models with temporal autocorrelation , 2010, Comput. Stat..

[24]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[25]  Yonghai Li,et al.  Likelihood analysis of the multivariate ordinal probit regression model for repeated ordinal responses , 2008, Comput. Stat. Data Anal..