Inference in High-Dimensional Panel Models With an Application to Gun Control

We consider estimation and inference in panel data models with additive unobserved individual specific heterogeneity in a high-dimensional setting. The setting allows the number of time-varying regressors to be larger than the sample size. To make informative estimation and inference feasible, we require that the overall contribution of the time-varying variables after eliminating the individual specific heterogeneity can be captured by a relatively small number of the available variables whose identities are unknown. This restriction allows the problem of estimation to proceed as a variable selection problem. Importantly, we treat the individual specific heterogeneity as fixed effects which allows this heterogeneity to be related to the observed time-varying variables in an unspecified way and allows that this heterogeneity may differ for all individuals. Within this framework, we provide procedures that give uniformly valid inference over a fixed subset of parameters in the canonical linear fixed effects model and over coefficients on a fixed vector of endogenous variables in panel data instrumental variable models with fixed effects and many instruments. We present simulation results in support of the theoretical developments and illustrate the use of the methods in an application aimed at estimating the effect of gun prevalence on crime rates.

[1]  M. Farrell Robust Inference on Average Treatment Effects with Possibly More Covariates than Observations , 2013, 1309.4686.

[2]  Christian Hansen,et al.  Identification of Marginal Effects in a Nonparametric Correlated Random Effects Model , 2006 .

[3]  G. Chamberlain Asymptotic efficiency in estimation with conditional moment restrictions , 1987 .

[4]  B. M. Pötscher,et al.  CAN ONE ESTIMATE THE UNCONDITIONAL DISTRIBUTION OF POST-MODEL-SELECTION ESTIMATORS? , 2007, Econometric Theory.

[5]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[6]  J. Friedman,et al.  [A Statistical View of Some Chemometrics Regression Tools]: Response , 1993 .

[7]  Christian Hansen,et al.  Grouped effects estimators in fixed effects models , 2016 .

[8]  Matias D. Cattaneo,et al.  ALTERNATIVE ASYMPTOTICS AND THE PARTIALLY LINEAR MODEL WITH MANY REGRESSORS , 2015, Econometric Theory.

[9]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[10]  Victor Chernozhukov,et al.  Central limit theorems and multiplier bootstrap when p is much larger than n , 2012 .

[11]  Norman R. Swanson,et al.  ASYMPTOTIC DISTRIBUTION OF JIVE IN A HETEROSKEDASTIC IV REGRESSION WITH MANY INSTRUMENTS , 2009, Econometric Theory.

[12]  A. Belloni,et al.  SPARSE MODELS AND METHODS FOR OPTIMAL INSTRUMENTS WITH AN APPLICATION TO EMINENT DOMAIN , 2012 .

[13]  E. Duflo,et al.  How Much Should We Trust Differences-in-Differences Estimates? , 2001 .

[14]  C. Hansen Asymptotic properties of a robust variance matrix estimator for panel data when T is large , 2007 .

[15]  S. Geer,et al.  On asymptotically optimal confidence regions and tests for high-dimensional models , 2013, 1303.0518.

[16]  A. Belloni,et al.  Program evaluation with high-dimensional data , 2013 .

[17]  Shuheng Zhou,et al.  25th Annual Conference on Learning Theory Reconstruction from Anisotropic Random Measurements , 2022 .

[18]  Cun-Hui Zhang,et al.  The sparsity and bias of the Lasso selection in high-dimensional linear regression , 2008, 0808.0967.

[19]  Victor Chernozhukov,et al.  Inference on Treatment Effects after Selection Amongst High-Dimensional Controls , 2011 .

[20]  M. Duggan,et al.  More Guns, More Crime , 2000, Journal of Political Economy.

[21]  Cun-Hui Zhang,et al.  Confidence intervals for low dimensional parameters in high dimensional linear models , 2011, 1110.2563.

[22]  Christian Hansen,et al.  Lasso Methods for Gaussian Instrumental Variables Models , 2010, 1012.1297.

[23]  Tomohiro Ando,et al.  Panel Data Models with Grouped Factor Structure Under Unknown Group Membership , 2013 .

[24]  Benoit Liquet,et al.  Confidence Intervals and Hypothesis Testing , 2013 .

[25]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[26]  M. Rudelson,et al.  On sparse reconstruction from Fourier and Gaussian measurements , 2008 .

[27]  Terence Tao,et al.  The Dantzig selector: Statistical estimation when P is much larger than n , 2005, math/0506081.

[28]  Lott Jr. John R.,et al.  More Guns, Less Crime , 2010 .

[29]  Christian Hansen,et al.  Instrumental variables estimation with many weak instruments using regularized JIVE , 2014 .

[30]  George Norman,et al.  Instrumental Variables Estimation , 2014 .

[31]  P. Cook,et al.  The Social Costs of Gun Ownership , 2004 .

[32]  Elena Manresa,et al.  Grouped Patterns of Heterogeneity in Panel Data , 2015 .

[33]  Stephen L. Morgan,et al.  Instrumental Variables Regression , 2014 .

[34]  J. Friedman,et al.  A Statistical View of Some Chemometrics Regression Tools , 1993 .

[35]  J. Horowitz,et al.  VARIABLE SELECTION IN NONPARAMETRIC ADDITIVE MODELS. , 2010, Annals of statistics.

[36]  A. Belloni,et al.  Least Squares After Model Selection in High-Dimensional Sparse Models , 2009, 1001.0188.

[37]  Anders Bredahl Kock,et al.  Oracle Inequalities for High-Dimensional Panel Data Models , 2013, 1310.8207.

[38]  Tomohiro Ando,et al.  Asset Pricing with a General Multifactor Structure , 2015 .

[39]  Rosa L. Matzkin,et al.  Cross Section and Panel Data Estimators for Nonseparable Models with Endogenous Regressors , 2005 .

[40]  Adel Javanmard,et al.  Confidence intervals and hypothesis testing for high-dimensional regression , 2013, J. Mach. Learn. Res..

[41]  A. Belloni,et al.  Inference on Treatment Effects after Selection Amongst High-Dimensional Controls , 2011, 1201.0224.

[42]  Benedikt M. Potscher,et al.  Confidence Sets Based on Sparse Estimators Are Necessarily Large , 2007, 0711.1036.

[43]  Whitney K. Newey,et al.  EFFICIENT INSTRUMENTAL VARIABLES ESTIMATION OF NONLINEAR MODELS , 1990 .

[44]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[45]  A. Belloni,et al.  Inference for High-Dimensional Sparse Econometric Models , 2011, 1201.0220.

[46]  Bing-Yi Jing,et al.  Self-normalized Cramér-type large deviations for independent random variables , 2003 .

[47]  N. Meinshausen,et al.  LASSO-TYPE RECOVERY OF SPARSE REPRESENTATIONS FOR HIGH-DIMENSIONAL DATA , 2008, 0806.0145.

[48]  P. Bickel,et al.  SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR , 2008, 0801.1095.

[49]  J. Stock,et al.  Instrumental Variables Regression with Weak Instruments , 1994 .

[50]  Takeshi Amemiya,et al.  The nonlinear two-stage least-squares estimator , 1974 .

[51]  Michel Verleysen,et al.  High-Dimensional Data , 2007 .

[52]  M. Arellano,et al.  Computing Robust Standard Errors for Within-Groups Estimators , 2009 .