Fast and wild: Bootstrap inference in Stata using boottest

The wild bootstrap was originally developed for regression models with heteroskedasticity of unknown form. Over the past 30 years, it has been extended to models estimated by instrumental variables and maximum likelihood and to ones where the error terms are (perhaps multiway) clustered. Like bootstrap methods in general, the wild bootstrap is especially useful when conventional inference methods are unreliable because large-sample assumptions do not hold. For example, there may be few clusters, few treated clusters, or weak instruments. The package boottest can perform a wide variety of wild bootstrap tests, often at remarkable speed. It can also invert these tests to construct confidence sets. As a postestimation command, boottest works after linear estimation commands, including regress, cnsreg, ivregress, ivreg2, areg, and reghdfe, as well as many estimation commands based on maximum likelihood. Although it is designed to perform the wild cluster bootstrap, boottest can also perform the ordinary (nonclustered) version. Wrappers offer classical Wald, score/Lagrange multiplier, and Anderson–Rubin tests, optionally with (multiway) clustering. We review the main ideas of the wild cluster bootstrap, offer tips for use, explain why it is particularly amenable to computational optimization, state the syntax of boottest, artest, scoretest, and waldtest, and present several empirical examples.

[1]  C. Hansen Asymptotic properties of a robust variance matrix estimator for panel data when T is large , 2007 .

[2]  James G. MacKinnon,et al.  Wild Bootstrap Tests for IV Regression , 2010 .

[3]  James G. MacKinnon,et al.  Wild Bootstrap Inference for Wildly Different Cluster Sizes , 2017 .

[4]  Jean-Marie Dufour,et al.  Some Impossibility Theorems in Econometrics with Applications to Structural and Dynamic Models , 1997 .

[5]  Andrew V. Carter,et al.  Asymptotic Behavior of a t-Test Robust to Cluster Heterogeneity , 2017, Review of Economics and Statistics.

[6]  Kim Christensen,et al.  The Realized Empirical Distribution Function of Stochastic Variance with Application to Goodness-of-Fit Testing , 2018, Journal of Econometrics.

[7]  J. MacKinnon,et al.  Estimation and inference in econometrics , 1994 .

[8]  J. Stock,et al.  Heteroskedasticity-Robust Standard Errors for Fixed Effects Panel Data Regression , 2006 .

[9]  Christopher F. Baum,et al.  Enhanced routines for instrumental variables/GMM estimation and testing , 2007 .

[10]  L. Magnusson,et al.  Bootstrap Methods for Inference with Cluster Sample IV Models , 2014 .

[11]  E. Mammen,et al.  Comparing Nonparametric Versus Parametric Regression Fits , 1993 .

[12]  J. MacKinnon,et al.  Asymptotic theory and wild bootstrap inference with clustered errors , 2019, Journal of Econometrics.

[13]  B. Peng,et al.  Modelling Time-Varying Income Elasticities of Health Care Expenditure for the OECD , 2018 .

[14]  James G. MacKinnon,et al.  Thirty Years of Heteroskedasticity-Robust Inference , 2013 .

[15]  C. Field,et al.  Bootstrapping clustered data , 2007 .

[16]  Yukai Yang,et al.  A mixed-frequency Bayesian vector autoregression with a steady-state prior , 2018 .

[17]  Tue Gørgens,et al.  Threshold Regression with Endogeneity for Short Panels , 2019, Econometrics.

[18]  Ulrich Hounyo,et al.  Inference for Local Distributions at High Sampling Frequencies: A Bootstrap Approach , 2018, Journal of Econometrics.

[19]  Debashis Kushary,et al.  Bootstrap Methods and Their Application , 2000, Technometrics.

[20]  Douglas L. Miller,et al.  Robust Inference With Multiway Clustering , 2011 .

[21]  James Davidson,et al.  Implementing the wild bootstrap using a two-point distribution☆ , 2007 .

[22]  J. MacKinnon Bootstrap and Asymptotic Inference with Multiway Clustering ∗ , 2017 .

[23]  Mikko S. Pakkanen,et al.  State-dependent Hawkes processes and their application to limit order book modelling , 2018, Quantitative Finance.

[24]  M. Schaffer,et al.  WEAKIV: Stata module to perform weak-instrument-robust tests and confidence intervals for instrumental-variable (IV) estimation of linear, probit and tobit models , 2013 .

[25]  Matthew D. Webb Reworking wild bootstrap‐based inference for clustered errors , 2014, Canadian Journal of Economics/Revue canadienne d'économique.

[26]  Jonathan Gruber,et al.  Tax Incentives and the Decision to Purchase Health Insurance: Evidence from the Self-Employed , 1993 .

[27]  S. B. Thompson Simple Formulas for Standard Errors that Cluster by Both Firm and Time , 2009 .

[28]  S. Johansen,et al.  Nonstationary Cointegration in the Fractionally Cointegrated VAR Model , 2018, Journal of Time Series Analysis.

[29]  Steven D. Levitt,et al.  The Effect of Prison Population Size on Crime Rates: Evidence from Prison Overcrowding Litigation , 1995 .

[30]  Erik Christian Montes Schütte In Search of a Job: Forecasting Employment Growth in the US using Google Trends , 2018 .

[31]  J. Kalbfleisch,et al.  The estimating function bootstrap , 2000 .

[32]  J. MacKinnon Bootstrap Hypothesis Testing , 2007 .

[33]  G. Mirone Cross-Sectional Noise Reduction and More Efficient Estimation of Integrated Variance , 2018 .

[34]  Tom Engsted,et al.  Disappearing Money Illusion , 2018 .

[35]  M. Schaffer XTIVREG2: Stata module to perform extended IV/2SLS, GMM and AC/HAC, LIML and k-class regression for panel data models , 2012 .

[36]  F. Eicker Asymptotic Normality and Consistency of the Least Squares Estimators for Families of Linear Regressions , 1963 .

[37]  J. Zidek,et al.  A bootstrap based on the estimating equations of the linear model , 1995 .

[38]  D. Roodman Fitting Fully Observed Recursive Mixed-process Models with cmp , 2011 .

[39]  Timothy G. Conley,et al.  Inference with “Difference in Differences” with a Small Number of Policy Changes , 2005, The Review of Economics and Statistics.

[40]  Changbao Wu,et al.  Jackknife, Bootstrap and Other Resampling Methods in Regression Analysis , 1986 .

[41]  Matthew D. Webb,et al.  Pitfalls when Estimating Treatment Effects Using Clustered Data , 2017 .

[42]  J. Morduch,et al.  The Impact of Microcredit on the Poor in Bangladesh: Revisiting the Evidence , 2009 .

[43]  Jeffrey M. Woodbridge Econometric Analysis of Cross Section and Panel Data , 2002 .

[44]  J. MacKinnon WILD CLUSTER BOOTSTRAP CONFIDENCE INTERVALS , 2016, L'Actualité économique.

[45]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[46]  Emilio Zanetti Chini,et al.  Forecasters’ utility and forecast coherence , 2018 .

[47]  K. Hadri,et al.  Diffusion copulas: Identification and estimation , 2020, Journal of Econometrics.

[48]  Konrad Menzel,et al.  Bootstrap with Clustering in Two or More Dimensions , 2017, 1703.03043.

[49]  S. Michalopoulos,et al.  Pre-Colonial Ethnic Institutions and Contemporary African Development , 2012, Econometrica : journal of the Econometric Society.

[50]  Massimiliano Caporin,et al.  A multilevel factor approach for the analysis of CDS commonality and risk contribution , 2019, Journal of International Financial Markets, Institutions and Money.

[51]  Christopher F. Baum,et al.  Enhanced Routines for Instrumental Variables/Generalized Method of Moments Estimation and Testing , 2007 .

[52]  Emmanuel Flachaire,et al.  The wild bootstrap, tamed at last , 2001 .

[53]  L. Bauwens,et al.  State-Space Models on the Stiefel Manifold with a New Approach to Nonlinear Filtering , 2018, Econometrics.

[54]  E. Duflo,et al.  How Much Should We Trust Differences-in-Differences Estimates? , 2001 .

[55]  Matthew D. Webb,et al.  The Wild Bootstrap for Few (Treated) Clusters , 2018 .

[56]  Patrick M. Kline,et al.  A Score Based Approach to Wild Bootstrap Inference , 2010 .

[57]  S. Khandker,et al.  The impact of Group‐Based Credit Programs on Poor Households in Bangladesh: Does the Gender of Participants Matter? , 1998, Journal of Political Economy.

[58]  William Gould Mata Matters: Stata in Mata , 2010 .

[59]  Regina Y. Liu Bootstrap Procedures under some Non-I.I.D. Models , 1988 .

[60]  James G. MacKinnon,et al.  Confidence Sets Based on Inverting Anderson–Rubin Tests , 2014 .

[61]  E. Mammen Bootstrap and Wild Bootstrap for High Dimensional Linear Models , 1993 .

[62]  Douglas G. Steigerwald,et al.  Inference for Clustered Data , 2018, The Stata Journal: Promoting communications on statistics and Stata.

[63]  James G. MacKinnon,et al.  THE SIZE DISTORTION OF BOOTSTRAP TESTS , 1999, Econometric Theory.

[64]  H. White A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity , 1980 .

[65]  Rudolf Beran Discussion: Jackknife, Bootstrap and Other Resampling Methods in Regression Analysis , 1986 .

[66]  Timothy G. Conley,et al.  Inference with dependent data using cluster covariance estimators , 2011 .