Robust Inference with Multi-Way Clustering

In this paper we propose a new variance estimator for OLS as well as for nonlinear estimators such as logit, probit and GMM, that provcides cluster-robust inference when there is two-way or multi-way clustering that is non-nested. The variance estimator extends the standard cluster-robust variance estimator or sandwich estimator for one-way clustering (e.g. Liang and Zeger (1986), Arellano (1987)) and relies on similar relatively weak distributional assumptions. Our method is easily implemented in statistical packages, such as Stata and SAS, that already offer cluster-robust standard errors when there is one-way clustering. The method is demonstrated by a Monte Carlo analysis for a two-way random effects model; a Monte Carlo analysis of a placebo law that extends the state-year effects example of Bertrand et al. (2004) to two dimensions; and by application to two studies in the empirical public/labor literature where two-way clustering is present.

[1]  Teun Kloek,et al.  OLS Estimation in a Model Where a Microvariable Is Explained by Aggregates and Contemporaneous Disturbances Are Equicorrelated , 1979 .

[2]  Bruce C. Greenwald,et al.  A general analysis of bias in the estimated standard errors of least squares coefficients , 1980 .

[3]  H. White A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity , 1980 .

[4]  D. Pfeffermann,et al.  Regression Analysis of Data from a Cluster Sample , 1981 .

[5]  A. Scott,et al.  The Effect of Two-Stage Sampling on Ordinary Least Squares Methods , 1982 .

[6]  H. White,et al.  Nonlinear Regression with Dependent Observations , 1984 .

[7]  H. White Asymptotic theory for econometricians , 1985 .

[8]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[9]  Brent R. Moulton Random group effects and the precision of regression estimates , 1986 .

[10]  Brent R. Moulton An Illustration of a Pitfall in Estimating the Effects of Aggregate Variables on Micro Unit , 1990 .

[11]  Brigitte C. Madrian,et al.  Health Insurance Availability and the Retirement Decision , 1993 .

[12]  W. Rogers Regression standard errors in clustered samples , 1994 .

[13]  R. Shimer The Impact of Young Workers on the Aggregate Labor Market , 1999 .

[14]  Daron Acemoglu,et al.  Minimum Wages and On-the-Job Training , 1999 .

[15]  Timothy G. Conley GMM estimation with cross sectional dependence , 1999 .

[16]  E. Duflo,et al.  How Much Should We Trust Differences-in-Differences Estimates? , 2001 .

[17]  E. Duflo,et al.  How Much Should We Trust Differences-in-Differences Estimates? , 2001 .

[18]  Daniel F. McCaffrey,et al.  Bias reduction in standard errors for linear regression with multi-stage samples , 2002 .

[19]  Jeffrey M. Woodbridge Econometric Analysis of Cross Section and Panel Data , 2002 .

[20]  P. Davis,et al.  Estimating multi-way error components models with unbalanced data structures , 2002 .

[21]  John V. Pepper,et al.  Robust inferences from random clustered samples: an application using data from the panel study of income dynamics , 2002 .

[22]  Jeffrey M. Wooldridge,et al.  Cluster-Sample Methods in Applied Econometrics , 2003 .

[23]  G. Kézdi Robust Standard Error Estimation in Fixed-Effects Panel Models , 2003 .

[24]  Asli Demirgüç-Kunt,et al.  Finance, Firm Size, and Growth , 2004 .

[25]  Debopam Bhattacharya,et al.  Asymptotic inference from multi-stage samples , 2005 .

[26]  Philippe Martin,et al.  Make Trade Not War? , 2005 .

[27]  A. Cameron,et al.  Microeconometrics: Methods and Applications , 2005 .

[28]  A. Colin Cameron,et al.  Estimation of Country-Pair Data Models Controlling for Clustered Errors: with International Trade Applications , 2005 .

[29]  Donald Hedeker,et al.  Longitudinal Data Analysis , 2006 .

[30]  C. Hansen Asymptotic properties of a robust variance matrix estimator for panel data when T is large , 2007 .

[31]  Sophie Shive,et al.  The Impact of Venture Capital Investments On Industry Performance , 2007 .

[32]  Christopher L. Foote Space and Time in Macroeconomic Panel Data: Young Workers and State-Level Unemployment Revisited , 2007 .

[33]  Marcel Fafchamps,et al.  The formation of risk sharing networks , 2007 .

[34]  Kristin E. Smith,et al.  The labor market for direct care workers , 2007 .

[35]  Stephen G. Donald,et al.  Inference with Difference-in-Differences and Other Panel Data , 2007, The Review of Economics and Statistics.

[36]  Patrick J Heagerty,et al.  Marginal modeling of nonnested multilevel data using standard software. , 2006, American journal of epidemiology.

[37]  Joel Peress Product Market Competition, Insider Trading and Stock Market Efficiency , 2008 .

[38]  James P. Weston,et al.  Do Investors Value Smooth Performance? , 2008 .

[39]  Jerry A. Hausman,et al.  Difference in Difference Meets Generalized Least Squares: Higher Order Properties of Hypotheses Tests , 2008 .

[40]  Lamar Pierce,et al.  Ethical Spillovers in Firms: Evidence from Vehicle Emissions Testing , 2008, Manag. Sci..

[41]  K. Mitchener,et al.  The Baring Crisis and the Great Latin American Meltdown of the 1890s , 2008, The Journal of Economic History.