Centering categorical predictors in multilevel models: Best practices and interpretation.

The topic of centering in multilevel modeling (MLM) has received substantial attention from methodologists, as different centering choices for lower-level predictors present important ramifications for the estimation and interpretation of model parameters. However, the centering literature has focused almost exclusively on continuous predictors, with little attention paid to whether and how categorical predictors should be centered, despite their ubiquity across applied fields. Alongside this gap in the methodological literature, a review of applied articles showed that researchers center categorical predictors infrequently and inconsistently. Algebraically and statistically, continuous and categorical predictors behave the same, but researchers using them do not, and for many, interpreting the effects of categorical predictors is not intuitive. Thus, the goals of this tutorial article are twofold: to clarify why and how categorical predictors should be centered in MLM, and to explain how multilevel regression coefficients resulting from centered categorical predictors should be interpreted. We first provide algebraic support showing that uncentered coding variables result in a conflated blend of the within- and between-cluster effects of a multicategorical predictor, whereas appropriate centering techniques yield level-specific effects. Next, we provide algebraic derivations to illuminate precisely how the within- and between-cluster effects of a multicategorical predictor should be interpreted under dummy, contrast, and effect coding schemes. Finally, we provide a detailed demonstration of our conclusions with an empirical example. Implications for practice, including relevance of our findings to categorical control variables (i.e., covariates), interaction terms with categorical focal predictors, and multilevel latent variable models, are discussed. (PsycInfo Database Record (c) 2021 APA, all rights reserved).

[1]  J. Faraway Categorical Predictors , 2021, Linear Models with Python.

[2]  Atanu Bhattacharjee Longitudinal Data Analysis , 2020, Bayesian Approaches in Oncology Using R and OpenBUGS.

[3]  Jason D Rights,et al.  The danger of conflating level-specific effects of control variables when primary interest lies in level-2 effects. , 2019, The British journal of mathematical and statistical psychology.

[4]  Jason D Rights,et al.  Quantifying explained variance in multilevel models: An integrative framework for defining R-squared measures. , 2019, Psychological methods.

[5]  B. Muthén,et al.  Latent Variable Centering of Predictors and Mediators in Multilevel and Time-Series Models , 2018, Structural Equation Modeling: A Multidisciplinary Journal.

[6]  Per B. Brockhoff,et al.  lmerTest Package: Tests in Linear Mixed Effects Models , 2017 .

[7]  Craig K. Enders,et al.  Centering Predictor Variables in Three-Level Contextual Models , 2017, Multivariate behavioral research.

[8]  Kenneth C Land,et al.  Multicollinearity in hierarchical linear models. , 2015, Social science research.

[9]  Lesa Hoffman,et al.  Longitudinal Analysis: Modeling Within-Person Fluctuation and Change , 2014 .

[10]  Ulrich Frank,et al.  Multilevel Modeling , 2014, Business & Information Systems Engineering.

[11]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[12]  Jeffrey S. Simonoff,et al.  The SAGE Handbook of Multilevel Modeling , 2013 .

[13]  Yaping Gong,et al.  A Multilevel Model of Team Goal Orientation, Information Exchange, and Creativity , 2013 .

[14]  T. Little,et al.  Effectiveness of the KiVa Antibullying Program: Grades 1-3 and 7-9 , 2013 .

[15]  Marc A. Brackett,et al.  Classroom emotional climate, student engagement, and academic achievement , 2012 .

[16]  A. Kärnä,et al.  Effectiveness of the KiVa Antibullying Program , 2012 .

[17]  Eileen G. Merritt,et al.  The Contribution of Teachers' Emotional Support to Children's Social Behaviors and Self-Regulatory Skills in First Grade , 2012 .

[18]  Wilhelm Hofmann,et al.  Everyday temptations: an experience sampling study of desire, conflict, and self-control. , 2012, Journal of personality and social psychology.

[19]  D. A. Kenny,et al.  Treating stimuli as a random factor in social psychology: a new and comprehensive solution to a pervasive but largely ignored problem. , 2012, Journal of personality and social psychology.

[20]  Samuel Aryee,et al.  Impact of high-performance work systems on individual- and branch-level performance: test of a multilevel model of intermediate linkages. , 2012, The Journal of applied psychology.

[21]  Craig K. Enders,et al.  Effects of Cognitive Strategy Instruction on Math Problem Solving of Middle School Students With Learning Disabilities* , 2011 .

[22]  Alex J. Bowers,et al.  Does High School Facility Quality Affect Student Achievement?: A Two-Level Hierarchical Linear Model , 2011 .

[23]  C. Vandenberghe,et al.  A Multilevel Model of Transformational Leadership and Adaptive Performance and the Moderating Role of Climate for Innovation , 2010 .

[24]  Marcia C. Linn,et al.  An investigation of teacher impact on student inquiry science performance using a hierarchical linear model , 2010 .

[25]  Kristopher J Preacher,et al.  A general multilevel SEM framework for assessing multilevel mediation. , 2010, Psychological methods.

[26]  S. Son,et al.  Parent-school relationships and children's academic and social outcomes in public school pre-kindergarten. , 2010, Journal of school psychology.

[27]  Elisa Poskiparta,et al.  Vulnerable Children in Varying Classroom Contexts: Bystanders' Behaviors Moderate the Effects of Risk Factors on Victimization , 2010 .

[28]  James L Peugh,et al.  A practical guide to multilevel modeling. , 2010, Journal of school psychology.

[29]  Stephen W. Raudenbush,et al.  Adaptive Centering with Random Effects: An Alternative to the Fixed Effects Model for Studying Time-Varying Treatments in School Settings , 2009, Education Finance and Policy.

[30]  Ulrich Trautwein,et al.  Predicting homework motivation and homework effort in six school subjects: The role of person and family characteristics, classroom factors, and school track , 2009 .

[31]  A. Elliot,et al.  The joint influence of personal achievement goals and classroom goal structures on achievement-relevant outcomes. , 2009 .

[32]  Ulrich Trautwein,et al.  Assessing the impact of learning environments: How to use student ratings of classroom or school characteristics in multilevel modeling , 2009 .

[33]  Martijn van de Pol,et al.  A simple method for distinguishing within- versus between-subject effects using mixed models , 2009, Animal Behaviour.

[34]  Reginald S. Lee,et al.  Multilevel Modeling: A Review of Methodological Issues and Applications , 2009 .

[35]  Thomas D. Fletcher,et al.  The influence of work‐family culture and workplace relationships on work interference with family: a multilevel model , 2008 .

[36]  B. Muthén,et al.  The multilevel latent covariate model: a new, more reliable approach to group-level effects in contextual studies. , 2008, Psychological methods.

[37]  Craig K. Enders,et al.  Centering predictor variables in cross-sectional multilevel models: a new look at an old issue. , 2007, Psychological methods.

[38]  Daniel J. Bauer,et al.  Computational Tools for Probing Interactions in Multiple Linear Regression, Multilevel Modeling, and Latent Curve Analysis , 2006 .

[39]  Charles E. McCulloch,et al.  Separating between‐ and within‐cluster covariate effects by using conditional and partitioning methods , 2006 .

[40]  A. O'connell,et al.  Growing readers : A hierarchical linear model of children's reading growth during the first 2 years of school , 2006 .

[41]  R. Dodhia A Review of Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences (3rd ed.) , 2005 .

[42]  Neal Schmitt,et al.  A dynamic multilevel model of demographic diversity and misfit effects. , 2005, The Journal of applied psychology.

[43]  R. Fouladi,et al.  The Effect of Multicollinearity on Multilevel Modeling Parameter Estimates and Standard Errors , 2003 .

[44]  Michael K Parides,et al.  Separation of individual‐level and cluster‐level covariate effects in regression analysis of correlated data , 2003, Statistics in medicine.

[45]  A. Roux,et al.  A glossary for multilevel analysis , 2002, Journal of epidemiology and community health.

[46]  Vincent J. Carey,et al.  Mixed-Effects Models in S and S-Plus , 2001 .

[47]  F. Earls,et al.  Assessing exposure to violence using multiple informants: application of hierarchical linear model. , 2000, Journal of child psychology and psychiatry, and allied disciplines.

[48]  Julia H. Littell,et al.  A Multilevel Model of Client Participation in Intensive Family Preservation Services , 2000, Social Service Review.

[49]  P. Onghena,et al.  The effect of different centering methods in multilevel analysis , 1999 .

[50]  D. Hofmann,et al.  Centering Decisions in Hierarchical Linear Models: Implications for Research in Organizations , 1998 .

[51]  J. Kalbfleisch,et al.  Between- and within-cluster covariate effects in the analysis of clustered data. , 1998, Biometrics.

[52]  Gary H. McClelland,et al.  Optimal design in psychological research. , 1997 .

[53]  L. Skovgaard NONLINEAR MODELS FOR REPEATED MEASUREMENT DATA. , 1996 .

[54]  Stephen W. Raudenbush,et al.  The Estimation of School Effects , 1995 .

[55]  S. Raudenbush,et al.  Hierarchical Linear Models: Applications and Data Analysis Methods , 1992 .

[56]  D. A. Kenny,et al.  Separating individual and group effects , 1985 .

[57]  K. Sirotnik PSYCHOMETRIC IMPLICATIONS OF THE UNIT‐OF‐ANALYSIS PROBLEM (WITH EXAMPLES FROM THE MEASUREMENT OF ORGANIZATIONAL CLIMATE) , 1980 .

[58]  Jacob Cohen,et al.  Applied multiple regression/correlation analysis for the behavioral sciences , 1979 .

[59]  Kjell Harnqvist,et al.  Primary Mental Abilities at Collective and Individual Levels. , 1978 .

[60]  Lee J. Cronbach,et al.  Research on Classrooms and Schools: Formulation of Questions, Design and Analysis. , 1976 .

[61]  Noreen M. Webb,et al.  Between-class and within-class effects in a reported aptitude * treatment interaction: Reanalysis of a study by G. L. Anderson. , 1975 .

[62]  W. S. Robinson,et al.  Ecological correlations and the behavior of individuals. , 1950, International journal of epidemiology.

[63]  Multilevel Model , 2021, Encyclopedia of Gerontology and Population Aging.

[64]  Leonardo Grilli,et al.  A handful of critical choices in multilevel modelling , 2018 .

[65]  Tobias Aufenanger Treatment Allocation for Linear Models with Covariate Information , 2017 .

[66]  Jennifer L. Lowman,et al.  Group-mean-centering independent variables in multi-level models is dangerous , 2017 .

[67]  Zhen Zhang,et al.  Multilevel structural equation models for assessing moderation within and across levels of analysis. , 2016, Psychological methods.

[68]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[69]  Patrick Carl,et al.  The Effects of Multicollinearity in Multilevel Models , 2013 .

[70]  Craig K. Enders,et al.  Centering predictors and contextual effects , 2013 .

[71]  Steven B. Sheldon,et al.  School and home connections and children's kindergarten achievement gains: The mediating role of family involvement , 2012 .

[72]  J. Nezlek Multilevel modeling analyses of diary-style data. , 2012 .

[73]  J. Nezlek Multilevel modeling for psychologists. , 2012 .

[74]  M. Mehl,et al.  Handbook of research methods for studying daily life , 2012 .

[75]  Ulrich Trautwein,et al.  Students emotions during homework in mathematics: Testing a theoretical model of antecedents and ac , 2011 .

[76]  Daniel J. Bauer,et al.  The disaggregation of within-person and between-person effects in longitudinal models of change. , 2011, Annual review of psychology.

[77]  Dishan Kamdar,et al.  Speaking up in groups: a cross-level study of group voice climate and voice. , 2011, The Journal of applied psychology.

[78]  Martijn P. F. Berger,et al.  Optimal Designs for Multilevel Studies , 2008 .

[79]  Bengt Muthén,et al.  Constructing Covariates in Multilevel Regression , 2007 .

[80]  Roel Bosker,et al.  Multilevel analysis : an introduction to basic and advanced multilevel modeling , 1999 .

[81]  Jan de Leeuw,et al.  Introducing Multilevel Modeling , 1998 .

[82]  J. Leeuw,et al.  The Effect of Different Forms of Centering in Hierarchical Linear Models. , 1995, Multivariate behavioral research.

[83]  Lindsay Paterson,et al.  Socio-economic status and educational attainment: a multi-dimensional and multi-level study , 1991 .

[84]  G R Grice,et al.  Dependence of empirical laws upon the source of experimental variation. , 1966, Psychological bulletin.