1. Overcoming the Reference Category Problem in the Presentation of Statistical Models

Effects of categorical variables in statistical models typically are reported in terms of comparison either with a reference category or with a suitably defined “mean effect,” for reasons of parameter identification. A conventional presentation of estimates and standard errors, but without the full variance-covariance matrix, does not allow subsequent readers either to make inference on a comparison of interest that is not presented or to compare or combine results from different studies where the same variables but different reference levels are used. It is shown how an alternative presentation, in terms of “quasi standard errors,” overcomes this problem in an economical and intuitive way. A primary application is the reporting of effects of categorical predictors, often called factors, in linear and generalized linear models, hazard models, multinomial-response models, generalized additive models, etc. Other applications include the comparison of coefficients between related regression equations—for example, log-odds ratios in a multinomial logit model—and the presentation of multipliers or “scores” in models with multiplicative interaction structure.

[1]  Yvonne Freeh,et al.  An R and S–PLUS Companion to Applied Regression , 2004 .

[2]  David Firth,et al.  Quasi-variances in Xlisp-Stat and on the web , 2000 .

[3]  Rick L. Edgeman,et al.  LISP-STAT: An Object-Oriented Environment for Statistical Computing and Dynamic Graphics , 1992 .

[4]  R. Wong Research in Social Stratification and Mobility , 1993 .

[5]  Summarizing the Results of Fitting Generalized Linear Models to Data from Designed Experiments , 1989 .

[6]  J. Goldthorpe,et al.  Class, Mobility and Merit The Experience of Two British Birth Cohorts , 2001 .

[7]  A G Babiker,et al.  Floating absolute risk: an alternative to relative risk in survival and case-control analysis avoiding an arbitrary reference group. , 1991, Statistics in medicine.

[8]  L. Vallet Quarante années de mobilité sociale en France. L’évolution de la fluidité sociale à la lumière de modèles récents , 1999 .

[9]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[10]  G. Marsaglia,et al.  The Ziggurat Method for Generating Random Variables , 2000 .

[11]  Yu-wei Xie,et al.  THE LOG-MULTIPLICATIVE LAYER EFFECT MODEL FOR COMPARING MOBILITY TABLES* , 1992 .

[12]  R. A. Bradley,et al.  RANK ANALYSIS OF INCOMPLETE BLOCK DESIGNS THE METHOD OF PAIRED COMPARISONS , 1952 .

[13]  K. Riley,et al.  Forty years of social mobility in France : change in social fluidity in the light of recent models , 2001 .

[14]  J. Goldthorpe,et al.  The Constant Flux: A Study of Class Mobility in Industrial Societies , 1993 .

[15]  J. Chimka Categorical Data Analysis, Second Edition , 2003 .