A Flexible Regression Model for Count Data

Poisson regression is a popular tool for modeling count data and is applied in a vast array of applications from the social to the physical sciences and beyond. Real data, however, are often over- or under-dispersed and, thus, not conducive to Poisson regression. We propose a regression model based on the Conway-Maxwell-Poisson (CMP) distribution to address this problem. The CMP regression generalizes the well-known Poisson and logistic regression models, and is suitable for fitting count data with a wide range of dispersion levels. With a GLM approach that takes advantage of exponential family properties, we discuss model estimation, inference, diagnostics, and interpretation, and present a test for determining the need for a CMP regression over a standard Poisson regression. We compare the CMP to several alternatives and illustrate its advantages and usefulness using four datasets with varying dispersion.

[1]  Joseph C. Nunes,et al.  The Effect of Product Assortment Changes on Customer Retention , 2005 .

[2]  Sharad Borle,et al.  Computing with the COM-Poisson distribution , 2003 .

[3]  Joseph B. Kadane,et al.  The Timing of Bid Placement and Extent of Multiple Bidding: An Empirical Investigation Using eBay Online Auctions , 2006, math/0609194.

[4]  T. Minka,et al.  A useful distribution for fitting discrete data: revival of the Conway–Maxwell–Poisson distribution , 2005 .

[5]  Ramayya Krishnan,et al.  A Data Disclosure Policy for Count Data Based on the COM-Poisson Distribution , 2006, Manag. Sci..

[6]  Seth D Guikema,et al.  A Flexible Count Data Regression Model for Risk Analysis , 2008, Risk analysis : an official publication of the Society for Risk Analysis.

[7]  J. A. Calvin Regression Models for Categorical and Limited Dependent Variables , 1998 .

[8]  Galit Shmueli,et al.  Conjugate Analysis of the Conway-Maxwell-Poisson Distribution , 2006 .

[9]  Jun Zhu,et al.  On the Generalized Poisson Regression Mixture Model for Mapping Quantitative Trait Loci With Count Data , 2006, Genetics.

[10]  Sharad Borle,et al.  A Model of the Joint Distribution of Purchase Quantity and Timing , 2003 .

[11]  John Hinde,et al.  Compound Poisson Regression Models , 1982 .

[12]  Srinivas Reddy Geedipally,et al.  Application of the Conway-Maxwell-Poisson generalized linear model for analyzing motor vehicle crashes. , 2008, Accident; analysis and prevention.

[13]  Felix Famoye,et al.  On the Generalized Poisson Regression Model with an Application to Accident Data , 2004, Journal of Data Science.

[14]  Eric R. Ziegel,et al.  An Introduction to Generalized Linear Models , 2002, Technometrics.

[15]  F. Famoye Restricted generalized poisson regression model , 1993 .

[16]  Kirthi Kalyanam,et al.  Deconstructing Each Item's Category Contribution , 2007 .

[17]  Anthony C. Davison,et al.  Regression model diagnostics , 1992 .