论文信息 - Robust and Heavy-Tailed Mean Estimation Made Simple, via Regret Minimization

Robust and Heavy-Tailed Mean Estimation Made Simple, via Regret Minimization

We study the problem of estimating the mean of a distribution in high dimensions when either the samples are adversarially corrupted or the distribution is heavy-tailed. Recent developments in robust statistics have established efficient and (near) optimal procedures for both settings. However, the algorithms developed on each side tend to be sophisticated and do not directly transfer to the other, with many of them having ad-hoc or complicated analyses. In this paper, we provide a meta-problem and a duality theorem that lead to a new unified view on robust and heavy-tailed mean estimation in high dimensions. We show that the meta-problem can be solved either by a variant of the Filter algorithm from the recent literature on robust estimation or by the quantum entropy scoring scheme (QUE), due to Dong, Hopkins and Li (NeurIPS '19). By leveraging our duality theorem, these results translate into simple and efficient algorithms for both robust and heavy-tailed settings. Furthermore, the QUE-based procedure has run-time that matches the fastest known algorithms on both fronts. Our analysis of Filter is through the classic regret bound of the multiplicative weights update method. This connection allows us to avoid the technical complications in previous works and improve upon the run-time analysis of a gradient-descent-based algorithm for robust mean estimation by Cheng, Diakonikolas, Ge and Soltanolkotabi (ICML '20).

Samuel B. Hopkins | Fred Zhang | Jerry Li

[1] Elad Hazan,et al. Introduction to Online Convex Optimization , 2016, Found. Trends Optim..

[2] Frederick R. Forst,et al. On robust estimation of the location parameter , 1980 .

[3] Samuel B. Hopkins. Mean estimation with sub-Gaussian rates in polynomial time , 2018, The Annals of Statistics.

[4] Jerry Li,et al. Sever: A Robust Meta-Algorithm for Stochastic Optimization , 2018, ICML.

[5] Gregory Valiant,et al. Resilience: A Criterion for Learning in the Presence of Arbitrary Outliers , 2017, ITCS.

[6] Henryk Wozniakowski,et al. Estimating the Largest Eigenvalue by the Power and Lanczos Algorithms with a Random Start , 1992, SIAM J. Matrix Anal. Appl..

[7] Noga Alon,et al. The space complexity of approximating the frequency moments , 1996, STOC '96.

[8] Jacob Steinhardt,et al. ROBUST LEARNING: INFORMATION THEORY AND ALGORITHMS A DISSERTATION SUBMITTED TO THE DEPARTMENT OF COMPUTER SCIENCE AND THE COMMITTEE ON GRADUATE STUDIES OF STANFORD UNIVERSITY IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY , 2018 .

[9] Zeyuan Allen Zhu,et al. Spectral Sparsification and Regret Minimization Beyond Matrix Multiplicative Updates , 2015, STOC.

[10] G. Lugosi,et al. Robust multivariate mean estimation: The optimality of trimmed mean , 2019, The Annals of Statistics.

[11] Yu Cheng,et al. High-Dimensional Robust Mean Estimation via Gradient Descent , 2020, ICML.

[12] R. Albert. Scale-free networks in cell biology , 2005, Journal of Cell Science.

[13] Santosh S. Vempala,et al. Agnostic Estimation of Mean and Covariance , 2016, 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS).

[14] G. Lugosi,et al. Sub-Gaussian mean estimators , 2015, 1509.05845.

[15] Pradeep Ravikumar,et al. A Unified Approach to Robust Mean Estimation , 2019, ArXiv.

[16] G. Lugosi,et al. Sub-Gaussian estimators of the mean of a random vector , 2017, The Annals of Statistics.

[17] Sanjeev Arora,et al. The Multiplicative Weights Update Method: a Meta-Algorithm and Applications , 2012, Theory Comput..

[18] John Darzentas,et al. Problem Complexity and Method Efficiency in Optimization , 1983 .

[19] David P. Woodruff,et al. Faster Algorithms for High-Dimensional Robust Covariance Estimation , 2019, COLT.

[20] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[21] Samuel B. Hopkins,et al. Quantum Entropy Scoring for Fast Robust Mean Estimation and Improved Outlier Detection , 2019, NeurIPS.

[22] M. Ledoux. The concentration of measure phenomenon , 2001 .

[23] Zhixian Lei,et al. A Fast Spectral Algorithm for Mean Estimation with Sub-Gaussian Rates , 2019, COLT 2019.

[24] Christos Faloutsos,et al. Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[25] Gerold Alsmeyer,et al. Chebyshev's Inequality , 2011, International Encyclopedia of Statistical Science.