Robust Generalized Method of Moments: A Finite Sample Viewpoint

For many inference problems in statistics and econometrics, the unknown parameter is identified by a set of moment conditions. A generic method of solving moment conditions is the Generalized Method of Moments (GMM). However, classical GMM estimation is potentially very sensitive to outliers. Robustified GMM estimators have been developed in the past, but suffer from several drawbacks: computational intractability, poor dimension-dependence, and no quantitative recovery guarantees in the presence of a constant fraction of outliers. In this work, we develop the first computationally efficient GMM estimator (under intuitive assumptions) that can tolerate a constant fraction of adversarially corrupted samples, and that has an `2 recovery guarantee of O( √ ). To achieve this, we draw upon and extend a recent line of work on algorithmic robust statistics for related but simpler problems such as mean estimation, linear regression and stochastic optimization. As two examples of the generality of our algorithm, we show how our estimation algorithm and assumptions apply to instrumental variables linear and logistic regression. Moreover, we experimentally validate that our estimator outperforms classical IV regression and two-stage Huber regression on synthetic and semi-synthetic datasets with corruption.

[1]  F. Hampel A General Qualitative Definition of Robustness , 1971 .

[2]  Ilias Diakonikolas,et al.  Efficient Algorithms and Lower Bounds for Robust Linear Regression , 2018, SODA.

[3]  Daniel M. Kane,et al.  Robust Estimators in High Dimensions without the Computational Intractability , 2016, 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS).

[4]  Tamara Broderick,et al.  An Automatic Finite-Sample Robustness Metric: Can Dropping a Little Data Change Conclusions? , 2020 .

[5]  W. S. Krasker Two-Stage Bounded-Influence Estimators for Simultaneous-Equations Models , 1986 .

[6]  E. Ronchetti,et al.  Robust inference with GMM estimators , 2001 .

[7]  F. Hampel The Influence Curve and Its Role in Robust Estimation , 1974 .

[8]  Joel A. Tropp,et al.  An Introduction to Matrix Concentration Inequalities , 2015, Found. Trends Mach. Learn..

[9]  Tselil Schramm,et al.  Robust Regression Revisited: Acceleration and Improved Estimation Rates , 2021, NeurIPS.

[10]  L. Hansen Large Sample Properties of Generalized Method of Moments Estimators , 1982 .

[11]  Jerry Li,et al.  Sever: A Robust Meta-Algorithm for Stochastic Optimization , 2018, ICML.

[12]  Takeshi Amemiya,et al.  Two Stage Least Absolute Deviations Estimators , 1982 .

[13]  R. Durrett Probability: Theory and Examples , 1993 .

[14]  Frederick R. Forst,et al.  On robust estimation of the location parameter , 1980 .

[15]  Jerry Li,et al.  Being Robust (in High Dimensions) Can Be Practical , 2017, ICML.

[16]  R. Zamar,et al.  A Natural Robustification of the Ordinary Instrumental Variables Estimator , 2013, Biometrics.

[17]  David Card,et al.  Using Geographic Variation in College Proximity to Estimate the Return to Schooling , 1993 .

[18]  Ainesh Bakshi,et al.  Robust linear regression: optimal rates in polynomial time , 2020, STOC.