A Statistical Taylor Theorem and Extrapolation of Truncated Densities

We show a statistical version of Taylor’s theorem and apply this result to non-parametric density estimation from truncated samples, which is a classical challenge in Statistics [Woo85, Stu93]. The single-dimensional version of our theorem has the following implication: “For any distribution P on [0, 1] with a smooth log-density function, given samples from the conditional distribution of P on [a, a + ε] ⊂ [0, 1], we can efficiently identify an approximation to P over the whole interval [0, 1], with quality of approximation that improves with the smoothness of P.” To the best of knowledge, our result is the first in the area of non-parametric density estimation from truncated samples, which works under the hard truncation model, where the samples outside some survival set S are never observed, and applies to multiple dimensions. In contrast, previous works assume single dimensional data where each sample has a different survival set S so that samples from the whole support will ultimately be collected. 1 1Accepted for presentation at the Conference on Learning Theory (COLT) 2021. 1 ar X iv :2 10 6. 15 90 8v 1 [ m at h. ST ] 3 0 Ju n 20 21

[1]  John C. W. Rayner,et al.  Smooth test of goodness of fit , 1989 .

[2]  G. S. Maddala,et al.  Limited Dependent Variable Models Using Panel Data , 1987 .

[3]  Qi Li,et al.  Nonparametric Econometrics: Theory and Practice , 2006 .

[4]  V. Hajivassiliou,et al.  Smooth unbiased multivariate probability simulators for maximum likelihood estimation of limited dependent variable models , 1993 .

[5]  Suleyman S. Kozat,et al.  Online Density Estimation of Nonstationary Sources Using Exponential Family of Distributions , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Xiaohong Chen,et al.  Efficient Estimation of Semiparametric Multivariate Copula Models Efficient Estimation of Semiparametric Multivariate Copula Models * , 2004 .

[7]  A. Barron,et al.  APPROXIMATION OF DENSITY FUNCTIONS BY SEQUENCES OF EXPONENTIAL FAMILIES , 1991 .

[8]  Dirk P. Kroese,et al.  Kernel density estimation via diffusion , 2010, 1011.2602.

[9]  D. W. Scott,et al.  Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[10]  Piotr Kokoszka,et al.  Non–Parametric Econometrics , 2012 .

[11]  Alexander J. Smola,et al.  Kernel methods and the exponential family , 2006, ESANN.

[12]  J. G. Pierce,et al.  Geometric Algorithms and Combinatorial Optimization , 2016 .

[13]  James Renegar,et al.  On the Computational Complexity of Approximating Solutions for Real Algebraic Formulae , 1992, SIAM J. Comput..

[14]  A. Carbery,et al.  Distributional and L-q norm inequalities for polynomials over convex bodies in R-n , 2001 .

[15]  James Renegar,et al.  On the Computational Complexity and Geometry of the First-Order Theory of the Reals, Part III: Quantifier Elimination , 1992, J. Symb. Comput..

[16]  Christos Tzamos,et al.  Efficient Truncated Statistics with Unknown Truncation , 2019, 2019 IEEE 60th Annual Symposium on Foundations of Computer Science (FOCS).

[17]  Matthew P. Wand,et al.  Kernel Smoothing , 1995 .

[18]  Zhiliang Ying,et al.  Estimating a distribution function with truncated and censored data , 1991 .

[19]  Shai Ben-David,et al.  Understanding Machine Learning: From Theory to Algorithms , 2014 .

[20]  László Györfi,et al.  Distribution estimation consistent in total variation and in two types of information divergence , 1992, IEEE Trans. Inf. Theory.

[21]  I. Good Maximum Entropy for Hypothesis Formulation, Especially for Multidimensional Contingency Tables , 1963 .

[22]  Alexandre B. Tsybakov,et al.  Introduction to Nonparametric Estimation , 2008, Springer series in statistics.

[23]  J. Simonoff Smoothing Methods in Statistics , 1998 .

[24]  Russell Greiner,et al.  Consistency and Generalization Bounds for Maximum Entropy Density Estimation , 2013, Entropy.

[25]  Winfried Stute,et al.  Almost Sure Representations of the Product-Limit Estimator for Truncated Data , 1993 .

[26]  Ximing Wu Exponential Series Estimator of Multivariate Densities , 2007 .

[27]  Alfred O. Hero,et al.  Entropy estimation using the principle of maximum entropy , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[28]  C. B. Morgan Truncated and Censored Samples, Theory and Applications , 1993 .

[29]  Aapo Hyvärinen,et al.  Density Estimation in Infinite Dimensional Exponential Families , 2013, J. Mach. Learn. Res..

[30]  Ankit Garg,et al.  Classical Lower Bounds from Quantum Upper Bounds , 2018, 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS).

[31]  Christos Tzamos,et al.  Computationally and Statistically Efficient Truncated Regression , 2020, COLT.

[32]  何书元 ESTIMATING A DISTRIBUTION FUNCTION WITH TRUNCATED DATA , 1994 .

[33]  Rocco A. Servedio,et al.  Learning from satisfying assignments under continuous distributions , 2020, SODA.

[34]  W. J. Padgett,et al.  Nonparametric density estimation from censored data , 1984 .

[35]  Christos Tzamos,et al.  Efficient Statistics, in High Dimensions, from Truncated Samples , 2018, 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS).

[36]  L. Gajek On the Minimax Value in the Scale Model with Truncated Data , 1988 .

[37]  C. D. Boor,et al.  Polynomial interpolation in several variables , 1994 .

[38]  Constantinos Daskalakis,et al.  A Theoretical and Practical Framework for Regression and Classification from Truncated Samples , 2020, AISTATS.

[39]  Stergios B. Fotopoulos,et al.  All of Nonparametric Statistics , 2007, Technometrics.

[40]  Daniel McDonald,et al.  Minimax Density Estimation for Growing Dimension , 2017, AISTATS.

[41]  J. Heckman The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models , 1976 .