Semiparametric bivariate zero‐inflated Poisson models with application to studies of abundance for multiple species

Ecological studies involving counts of abundance, presence–absence or occupancy rates often produce data having a substantial proportion of zeros. Furthermore, these types of processes are typically multivariate and only adequately described by complex nonlinear relationships involving externally measured covariates. Ignoring these aspects of the data and implementing standard approaches can lead to models that fail to provide adequate scientific understanding of the underlying ecological processes, possibly resulting in a loss of inferential power. One method of dealing with data having excess zeros is to consider the class of univariate zero-inflated generalized linear models. However, this class of models fails to address the multivariate and nonlinear aspects associated with the data usually encountered in practice. Therefore, we propose a semiparametric bivariate zero-inflated Poisson model that takes into account both of these data attributes. The general modeling framework is hierarchical Bayes and is suitable for a broad range of applications. We demonstrate the effectiveness of our model through a motivating example on modeling catch per unit area for multiple species using data from the Missouri River Benthic Fishes Study, implemented by the United States Geological Survey. Copyright © 2011 John Wiley & Sons, Ltd.

[1]  Jye-Chyi Lu,et al.  Multivariate zero-inflated Poisson models and their applications , 1999 .

[2]  D. Hall Zero‐Inflated Poisson and Binomial Regression with Random Effects: A Case Study , 2000, Biometrics.

[3]  Scott H. Holan,et al.  Semiparametric geographically weighted response curves with application to site-specific agriculture , 2008 .

[4]  Daniel Filipe Carvalho Miranda Pires,et al.  Fish distribution and abundance in mediterranean streams:the role of habitat quality, spatial context, and movement patterns , 2012 .

[5]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[6]  L. Fahrmeir,et al.  Structured additive regression for overdispersed and zero‐inflated count data , 2006 .

[7]  M. Wand,et al.  Semiparametric Regression: Parametric Regression , 2003 .

[8]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[9]  C. Wikle Hierarchical Models in Environmental Science , 2003 .

[10]  A. M. Schmidt,et al.  Modelling Multivariate Counts Varying Continuously in Space , 2011 .

[11]  Christopher K. Wikle,et al.  Zero-Inflated Modeling of Fish Catch per Unit Area Resulting from Multiple Gears: Application to Channel Catfish and Shovelnose Sturgeon in the Missouri River , 2008 .

[12]  J. Andrew Royle,et al.  Hierarchical Modeling and Inference in Ecology: The Analysis of Data from Populations, Metapopulations and Communities , 2008 .

[13]  Monica Musio,et al.  Bayesian semi-parametric ZIP models with space-time interactions: an application to cancer registry data. , 2010, Mathematical medicine and biology : a journal of the IMA.

[14]  Monica Chiogna,et al.  Semiparametric zero-inflated Poisson models with application to animal abundance studies , 2007 .

[15]  M. Wand,et al.  General design Bayesian generalized linear mixed models , 2006, math/0606491.

[16]  K. R. Clarke,et al.  Statistical Design And Analysis For A Biological Effects Study , 1988 .

[17]  Hongqi Xue,et al.  Semiparametric Analysis of Zero‐Inflated Count Data , 2006, Biometrics.

[18]  Jay M. Ver Hoef,et al.  Space—time zero‐inflated count models of Harbor seals , 2007 .

[19]  D. W. Gladish,et al.  DISTRIBUTION AND HABITAT USE OF THE MISSOURI RIVER AND LOWER YELLOWSTONE RIVER BENTHIC FISHES FROM 1996 TO 1998: A BASELINE FOR FISH COMMUNITY RECOVERY , 2012 .

[20]  J. G. Skellam STUDIES IN STATISTICAL ECOLOGY SPATIAL PATTERN , 1952 .

[21]  Christopher K. Wikle,et al.  Climatological analysis of tornado report counts using a hierarchical Bayesian spatiotemporal model , 2003 .

[22]  Mihoko Minami,et al.  Modeling shark bycatch: The zero-inflated negative binomial regression model with smoothing , 2007, Fisheries Research.

[23]  Ciprian M. Crainiceanu,et al.  Bayesian Analysis for Penalized Spline Regression Using WinBUGS , 2005 .

[24]  Kung-Sik Chan,et al.  Nonparametric Threshold Model of Zero-Inflated Spatio-Temporal Data with Application to Shifts in Jellyfish Distribution , 2011 .

[25]  C. Robert,et al.  Bayesian Modeling Using WinBUGS , 2009 .

[26]  Hugh P Possingham,et al.  Zero tolerance ecology: improving ecological inference by modelling the source of zero observations. , 2005, Ecology letters.

[27]  Christian P. Robert,et al.  Monte Carlo Statistical Methods , 2005, Springer Texts in Statistics.

[28]  David Ruppert,et al.  Semiparametric regression during 2003-2007. , 2009, Electronic journal of statistics.

[29]  Eric R. Ziegel,et al.  Multivariate Statistical Modelling Based on Generalized Linear Models , 2002, Technometrics.

[30]  Getachew A Dagne,et al.  Bayesian semiparametric zero-inflated Poisson model for longitudinal count data. , 2010, Mathematical biosciences.

[31]  N. L. Johnson,et al.  Discrete Multivariate Distributions , 1998 .

[32]  D. Lindenmayer,et al.  Modelling the abundance of rare species: statistical models for counts with extra zeros , 1996 .

[33]  Peter Müller,et al.  Variable Selection and Function Estimation in Additive Nonparametric Regression Using a Data-Based Prior: Comment , 1999 .

[34]  Kui Wang,et al.  A zero-inflated Poisson mixed model to analyze diagnosis related groups with majority of same-day hospital stays , 2002, Comput. Methods Programs Biomed..

[35]  Diane Lambert,et al.  Zero-inflacted Poisson regression, with an application to defects in manufacturing , 1992 .

[36]  Noel A Cressie,et al.  Statistics for Spatio-Temporal Data , 2011 .

[37]  C. Crainiceanu,et al.  Semiparametric Regression in Capture–Recapture Modeling , 2006, Biometrics.

[38]  Joseph Anthony Navarro,et al.  STUDIES IN STATISTICAL ECOLOGY , 1955 .

[39]  Efthymios G. Tsionas,et al.  BAYESIAN MULTIVARIATE POISSON REGRESSION , 2001 .

[40]  Corinna Gries,et al.  Bivariate Zero-Inflated Regression for Count Data: A Bayesian Approach with Application to Plant Counts , 2010, The international journal of biostatistics.

[41]  S. Kocherlakota,et al.  Bivariate discrete distributions , 1992 .

[42]  A. Arab,et al.  Hierarchical spatio-temporal models for environmental processes , 2007 .