Model selection criteria in beta regression with varying dispersion

ABSTRACT We address the issue of model selection in beta regressions with varying dispersion. The model consists of two submodels, namely: for the mean and for the dispersion. Our focus is on the selection of the covariates for each submodel. Our Monte Carlo evidence reveals that the joint selection of covariates for the two submodels is not accurate in finite samples. We introduce two new model selection criteria that explicitly account for varying dispersion and propose a fast two step model selection scheme which is considerably more accurate and is computationally less costly than usual joint model selection. Monte Carlo evidence is presented and discussed. We also present the results of an empirical application.

[1]  Francisco Cribari-Neto,et al.  Detecting Model Misspecification in Inflated Beta Regressions , 2014, Commun. Stat. Simul. Comput..

[2]  Francisco Cribari-Neto,et al.  On testing inference in beta regressions , 2014 .

[3]  Michael Smithson,et al.  Evaluating Predictors of Dispersion: A Comparison of Dominance Analysis and Bayesian Model Averaging , 2013, Psychometrika.

[4]  Saskia A Rutjes,et al.  Virus transfer proportions between gloved fingertips, soft berries, and lettuce, and associated health risks. , 2013, International journal of food microbiology.

[5]  Tatiene C. Souza,et al.  Testing inference in variable dispersion beta regressions , 2012 .

[6]  Huiqiong Li,et al.  Variable selection for joint mean and dispersion models of the inverse Gaussian distribution , 2012 .

[7]  Francisco Cribari-Neto,et al.  Diagnostic tools in beta regression with varying dispersion , 2011 .

[8]  Liu-Cang Wu,et al.  Variable selection for joint mean and dispersion models of the inverse Gaussian distribution , 2011, Metrika.

[9]  L. Frazer,et al.  Toward Parsimony in Shoreline Change Prediction (I): Basis Function Methods , 2009 .

[10]  Jun Shao,et al.  Generalized linear model selection using R2 , 2008 .

[11]  Wagner Barreto-Souza,et al.  Improved estimators for a general class of beta regression models , 2008, Comput. Stat. Data Anal..

[12]  Cesar Zucco,et al.  The President's ‘New’ Constituency: Lula and the Pragmatic Vote in Brazil's 2006 Presidential Elections* , 2008, Journal of Latin American Studies.

[13]  R. Rigby,et al.  Generalized Additive Models for Location Scale and Shape (GAMLSS) in R , 2007 .

[14]  Kristen Pammer,et al.  The Contribution of Visual Sensitivity, Phonological Processing, and Nonverbal IQ to Children's Reading , 2007 .

[15]  Joseph E. Cavanaugh,et al.  An improved Akaike information criterion for state-space model selection , 2006, Comput. Stat. Data Anal..

[16]  Michael Smithson,et al.  A better lemon squeezer? Maximum-likelihood regression with beta-distributed dependent variables. , 2006, Psychological methods.

[17]  S. Ferrari,et al.  Beta Regression for Modelling Rates and Proportions , 2004 .

[18]  B. McCullough,et al.  Regression analysis of variates observed on (0, 1): percentages, proportions and fractions , 2003 .

[19]  E. Ziegel Generalized Linear Models , 2002, Technometrics.

[20]  Chih-Ling Tsai,et al.  Regression model selection—a residual likelihood approach , 2002 .

[21]  Martina Mittlböck,et al.  Explained Variation for Logistic Regression - Small Sample Adjustments, Confidence Intervals and Predictive Precision , 2002 .

[22]  Errol C. Caby Regressiona nd Time SeriesM odel Selectionb , 2000, Technometrics.

[23]  C. Mallows Some Comments on Cp , 2000, Technometrics.

[24]  Gordon K. Smyth,et al.  Adjusted likelihood methods for modelling dispersion in generalized linear models , 1999 .

[25]  Allan D R McQuarrie,et al.  A small-sample correction for the Schwarz SIC model selection criterion , 1999 .

[26]  A. McQuarrie,et al.  Regression and Time Series Model Selection , 1998 .

[27]  Robert H. Shumway,et al.  The model selection criterion AICu , 1997 .

[28]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[29]  J. S. Long,et al.  Regression Models for Categorical and Limited Dependent Variables , 1997 .

[30]  J. Shao Bootstrap Model Selection , 1996 .

[31]  Carlos Alberto de Bragança Pereira,et al.  ON IDENTIFIABILITY OF PARAMETRIC STATISTICAL MODELS , 1994 .

[32]  Scott Gates,et al.  Donut Shops and Speed Traps: Evaluating Models of Supervision on Police Behavior , 1993 .

[33]  N. Nagelkerke,et al.  A note on a general definition of the coefficient of determination , 1991 .

[34]  John A. Nelder,et al.  Generalized linear models for the analysis of Taguchi-type experiments , 1991 .

[35]  P. McCullagh,et al.  Generalized Linear Models, 2nd Edn. , 1990 .

[36]  Gordon K. Smyth,et al.  Generalized linear models with varying dispersion , 1989 .

[37]  Clifford M. Hurvich,et al.  Regression and time series model selection in small samples , 1989 .

[38]  H. Akaike A Bayesian analysis of the minimum AIC procedure , 1978 .

[39]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[40]  H. Akaike A new look at the statistical model identification , 1974 .

[41]  T. Rothenberg Identification in Parametric Models , 1971 .

[42]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[43]  S. Pierce,et al.  Electromyographic activity of rectus capitis posterior minor muscles associated with voluntary retraction of the head. , 2014, The spine journal : official journal of the North American Spine Society.

[44]  Joseph E. Cavanaugh,et al.  Bootstrap variants of the Akaike information criterion for mixed model selection , 2008, Comput. Stat. Data Anal..

[45]  Hua Liang,et al.  Improved AIC selection strategy for survival analysis , 2008, Comput. Stat. Data Anal..

[46]  Wei Pan,et al.  Bootstrapping Likelihood for Model Selection with Small Samples , 1998 .

[47]  L. Kwek,et al.  Some comments on , 1998 .

[48]  R. Shibata Asymptotically Efficient Selection of the Order of the Model for Estimating Parameters of a Linear Process , 1980 .

[49]  B. G. Quinn,et al.  The determination of the order of an autoregression , 1979 .

[50]  N. Sugiura Further analysts of the data by akaike' s information criterion and the finite corrections , 1978 .

[51]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[52]  C. L. Mallows Some comments on C_p , 1973 .