Pointwise adaptive kernel density estimation under local approximate differential privacy

We consider non-parametric density estimation in the framework of local approximate differential privacy. In contrast to centralized privacy scenarios with a trusted curator, in the local setup anonymization must be guaranteed already on the individual data owners' side and therefore must precede any data mining tasks. Thus, the published anonymized data should be compatible with as many statistical procedures as possible. We suggest adding Laplace noise and Gaussian processes (both appropriately scaled) to kernel density estimators to obtain approximate differential private versions of the latter ones. We obtain minimax type results over Sobolev classes indexed by a smoothness parameter $s>1/2$ for the mean squared error at a fixed point. In particular, we show that taking the average of private kernel density estimators from $n$ different data owners attains the optimal rate of convergence if the bandwidth parameter is correctly specified. Notably, the optimal convergence rate in terms of the sample size $n$ is $n^{-(2s-1)/(2s+1)}$ under local differential privacy and thus deteriorated to the rate $n^{-(2s-1)/(2s)}$ which holds without privacy restrictions. Since the optimal choice of the bandwidth parameter depends on the smoothness $s$ and is thus not accessible in practice, adaptive methods for bandwidth selection are necessary and must, in the local privacy framework, be performed directly on the anonymized data. We address this problem by means of a variant of Lepski's method tailored to the privacy setup and obtain general oracle inequalities for private kernel density estimators. In the Sobolev case, the resulting adaptive estimator attains the optimal rate of convergence at least up to extra logarithmic factors.

[1]  Cynthia Dwork,et al.  Differential Privacy: A Survey of Results , 2008, TAMC.

[2]  C. Butucea Exact adaptive pointwise estimation on Sobolev classes of densities , 2001 .

[3]  A. Tsybakov,et al.  Introduction à l'estimation non-paramétrique , 2003 .

[4]  Cynthia Dwork,et al.  Differential Privacy , 2006, ICALP.

[5]  V. Spokoiny,et al.  Optimal pointwise adaptive methods in nonparametric estimation , 1997 .

[6]  Angelika Rohde,et al.  Geometrizing rates of convergence under differential privacy constraints , 2018, 1805.01422.

[7]  Alexandre B. Tsybakov,et al.  Introduction to Nonparametric Estimation , 2008, Springer series in statistics.

[8]  Laurent Cavalier,et al.  On the problem of local adaptive estimation in tomography , 2001 .

[9]  Douglas J. Leith,et al.  Differential privacy in metric spaces: Numerical, categorical and functional data under the one roof , 2015, Inf. Sci..

[10]  O. Lepskii On a Problem of Adaptive Estimation in Gaussian White Noise , 1991 .

[11]  Amandine Dubois,et al.  Local differential privacy: Elbow effect in optimal density estimation and adaptation over Besov ellipsoids , 2019, Bernoulli.

[12]  A. Goldenshluger,et al.  Adaptive estimation of linear functionals in Hilbert scales from indirect white noise observations , 2000 .

[13]  A. Tsybakov Pointwise and sup-norm sharp adaptive estimation of functions on the Sobolev classes , 1998 .

[14]  L. Brown,et al.  A constrained risk inequality with applications to nonparametric functional estimation , 1996 .

[15]  John C. Duchi,et al.  Privacy and Statistical Risk: Formalisms and Minimax Bounds , 2014, ArXiv.

[16]  A. Berlinet,et al.  Reproducing kernel Hilbert spaces in probability and statistics , 2004 .

[17]  Martin J. Wainwright,et al.  Minimax Optimal Procedures for Locally Private Estimation , 2016, ArXiv.

[18]  D. Donoho,et al.  Renormalization Exponents and Optimal Pointwise Rates of Convergence , 1992 .

[19]  L. Wasserman,et al.  A Statistical Framework for Differential Privacy , 2008, 0811.2501.

[20]  Larry A. Wasserman,et al.  Differential privacy for functions and functional data , 2012, J. Mach. Learn. Res..

[21]  Cynthia Dwork,et al.  Calibrating Noise to Sensitivity in Private Data Analysis , 2006, TCC.