Learning Structured Distributions From Untrusted Batches: Faster and Simpler

We revisit the problem of learning from untrusted batches introduced by Qiao and Valiant [QV17]. Recently, Jain and Orlitsky [JO19] gave a simple semidefinite programming approach based on the cut-norm that achieves essentially information-theoretically optimal error in polynomial time. Concurrently, Chen et al. [CLM19] considered a variant of the problem where $\mu$ is assumed to be structured, e.g. log-concave, monotone hazard rate, $t$-modal, etc. In this case, it is possible to achieve the same error with sample complexity sublinear in $n$, and they exhibited a quasi-polynomial time algorithm for doing so using Haar wavelets. In this paper, we find an appealing way to synthesize the techniques of [JO19] and [CLM19] to give the best of both worlds: an algorithm which runs in polynomial time and can exploit structure in the underlying distribution to achieve sublinear sample complexity. Along the way, we simplify the approach of [JO19] by avoiding the need for SDP rounding and giving a more direct interpretation of it through the lens of soft filtering, a powerful recent technique in high-dimensional robust estimation. We validate the usefulness of our algorithms in preliminary experimental evaluations.

[1]  Alon Orlitsky,et al.  A General Method for Robust Learning from Batches , 2020, NeurIPS.

[2]  A. Orlitsky,et al.  Robust Learning of Discrete Distributions from Batches , 2019, ArXiv.

[3]  Daniel M. Kane,et al.  Recent Advances in Algorithmic High-Dimensional Robust Statistics , 2019, ArXiv.

[4]  Ankur Moitra,et al.  Efficiently learning structured distributions from untrusted batches , 2019, STOC.

[5]  Samuel B. Hopkins,et al.  Quantum Entropy Scoring for Fast Robust Mean Estimation and Improved Outlier Detection , 2019, NeurIPS.

[6]  Pravesh Kothari,et al.  Robust moment estimation and improved clustering via sum of squares , 2018, STOC.

[7]  Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing , 2018, STOC.

[8]  Jerry Li,et al.  Sever: A Robust Meta-Algorithm for Stochastic Optimization , 2018, ICML.

[9]  Gregory Valiant,et al.  Learning Discrete Distributions from Untrusted Batches , 2017, ITCS.

[10]  Jerry Li,et al.  Mixture models, robustness, and sum of squares proofs , 2017, STOC.

[11]  Gregory Valiant,et al.  Learning Populations of Parameters , 2017, NIPS.

[12]  Pierre McKenzie,et al.  Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing , 2017, STOC.

[13]  Gregory Valiant,et al.  Resilience: A Criterion for Learning in the Presence of Arbitrary Outliers , 2017, ITCS.

[14]  Jerry Li,et al.  Being Robust (in High Dimensions) Can Be Practical , 2017, ICML.

[15]  Pasin Manurangsi,et al.  Almost-polynomial ratio ETH-hardness of approximating densest k-subgraph , 2016, STOC.

[16]  Daniel M. Kane,et al.  Statistical Query Lower Bounds for Robust Estimation of High-Dimensional Gaussians and Gaussian Mixtures , 2016, 2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS).

[17]  Gregory Valiant,et al.  Learning from untrusted data , 2016, STOC.

[18]  Peter Richtárik,et al.  Federated Learning: Strategies for Improving Communication Efficiency , 2016, ArXiv.

[19]  Carl M. O’Brien,et al.  Nonparametric Estimation under Shape Constraints: Estimators, Algorithms and Asymptotics , 2016 .

[20]  Santosh S. Vempala,et al.  Agnostic Estimation of Mean and Covariance , 2016, 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS).

[21]  Daniel M. Kane,et al.  Robust Estimators in High Dimensions without the Computational Intractability , 2016, 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS).

[22]  Mark J. van der Laan,et al.  Handbook of Big Data , 2016 .

[23]  Ilias Diakonikolas,et al.  Learning Structured Distributions , 2016, Handbook of Big Data.

[24]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[25]  Daniel M. Kane,et al.  A New Approach for Testing Properties of Discrete Distributions , 2016, 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS).

[26]  Ilias Diakonikolas,et al.  Sample-Optimal Density Estimation in Nearly-Linear Time , 2015, SODA.

[27]  Chinmay Hegde,et al.  Fast and Near-Optimal Algorithms for Approximating Distributions by Histograms , 2015, PODS.

[28]  Rocco A. Servedio,et al.  Near-Optimal Density Estimation in Near-Linear Time Using Variable-Width Histograms , 2014, NIPS.

[29]  Ilias Diakonikolas,et al.  Efficient density estimation via piecewise polynomial approximation , 2013, STOC.

[30]  Ronitt Rubinfeld,et al.  Testing Properties of Collections of Distributions , 2013, Theory Comput..

[31]  Rocco A. Servedio,et al.  Learning mixtures of structured distributions over discrete domains , 2012, SODA.

[32]  D. Steinberg,et al.  Technometrics , 2008 .

[33]  Robert D. Nowak,et al.  Multiscale Poisson Intensity and Density Estimation , 2007, IEEE Transactions on Information Theory.

[34]  Noga Alon,et al.  Approximating the cut-norm via Grothendieck's inequality , 2004, STOC '04.

[35]  Avi Wigderson,et al.  Theory of computing , 1997, SIGACT News.

[36]  Young K. Truong,et al.  Polynomial splines and their tensor products in extended linear modeling: 1994 Wald memorial lecture , 1997 .

[37]  Avi Wigderson,et al.  Theory of computing: a scientific perspective , 1996, CSUR.

[38]  C. J. Stone,et al.  The Use of Polynomial Splines and Their Tensor Products in Multivariate Function Estimation , 1994 .

[39]  R. Lathe Phd by thesis , 1988, Nature.

[40]  I. W. Wright Splines in Statistics , 1983 .

[41]  H. D. Brunk,et al.  Statistical inference under order restrictions : the theory and application of isotonic regression , 1973 .

[42]  P. J. Huber Robust Estimation of a Location Parameter , 1964 .

[43]  W. R. Buckland,et al.  Contributions to Probability and Statistics , 1960 .

[44]  F. J. Anscombe,et al.  Rejection of Outliers , 1960 .

[45]  Carroll Morgan,et al.  Robustness , 2020, Encyclopedia of the UN Sustainable Development Goals.

[46]  Jerry Zheng Li,et al.  Principled approaches to robust machine learning and beyond , 2018 .

[47]  J. Steinhardt Robust learning: information theory and algorithms , 2018 .

[48]  Luc Devroye,et al.  Combinatorial methods in density estimation , 2001, Springer series in statistics.

[49]  Frederick R. Forst,et al.  On robust estimation of the location parameter , 1980 .

[50]  J. Tukey Mathematics and the Picturing of Data , 1975 .

[51]  Thomas S. Ferguson,et al.  On the Rejection of Outliers , 1961 .

[52]  J. Tukey A survey of sampling from contaminated distributions , 1960 .