Regression Models for Count Data in R

The classical Poisson, geometric and negative binomial regression models for count data belong to the family of generalized linear models and are available at the core of the statistics toolbox in the R system for statistical computing. After reviewing the conceptual and computational features of these methods, a new implementation of zero-inflated and hurdle regression models in the functions zeroinfl() and hurdle() from the package pscl is introduced. It re-uses design and functionality of the basic R functions just as the underlying conceptual tools extend the classical models. Both model classes are able to incorporate over-dispersion and excess zeros - two problems that typically occur in count data sets in economics and the social and political sciences - better than their classical counterparts. Using cross-section data on the demand for medical care, it is illustrated how the classical as well as the zero-augmented models can be fitted, inspected and tested in practice.

[1]  Achim Zeileis,et al.  Applied Econometrics with R , 2008 .

[2]  W. Wien,et al.  Object-oriented Computation of Sandwich Estimators , 2006 .

[3]  R. Rigby,et al.  Generalized Additive Models for Location Scale and Shape (GAMLSS) in R , 2007 .

[4]  Achim Zeileis,et al.  Regression Models for Count Data in , 2007 .

[5]  Søren Højsgaard,et al.  The R Package geepack for Generalized Estimating Equations , 2005 .

[6]  A. Cameron,et al.  Microeconometrics: Methods and Applications , 2005 .

[7]  F. Leisch FlexMix: A general framework for finite mixture models and latent class regression in R , 2004 .

[8]  Yvonne Freeh,et al.  An R and S–PLUS Companion to Applied Regression , 2004 .

[9]  W. Wien,et al.  Econometric Computing with HC and HAC Covariance Matrix Estimators , 2004 .

[10]  U. Ligges Review of An R and S-PLUS companion to applied regression by J. Fox, Sage Publications, Thousand Oaks, California 2002 , 2003 .

[11]  Christina Gloeckner,et al.  Modern Applied Statistics With S , 2003 .

[12]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[13]  J. T. Wulu,et al.  Regression analysis of count data , 2002 .

[14]  V. Carey,et al.  Mixed-Effects Models in S and S-Plus , 2001 .

[15]  P. Deb,et al.  Demand for Medical Care by the Elderly: A Finite Mixture Approach , 1997 .

[16]  Diane Lambert,et al.  Zero-inflacted Poisson regression, with an application to defects in manufacturing , 1992 .

[17]  Trevor Hastie,et al.  Statistical Models in S , 1991 .

[18]  J. Mullahy Specification and testing of some modified count data models , 1986 .