Density Estimation by Randomized Quasi-Monte Carlo

We consider the problem of estimating the density of a random variable $X$ that can be sampled exactly by Monte Carlo (MC). We investigate the effectiveness of replacing MC by randomized quasi Monte Carlo (RQMC) or by stratified sampling over the unit cube, to reduce the integrated variance (IV) and the mean integrated square error (MISE) for kernel density estimators. We show theoretically and empirically that the RQMC and stratified estimators can achieve substantial reductions of the IV and the MISE, and even faster convergence rates than MC in some situations, while leaving the bias unchanged. We also show that the variance bounds obtained via a traditional Koksma-Hlawka-type inequality for RQMC are much too loose to be useful when the dimension of the problem exceeds a few units. We describe an alternative way to estimate the IV, a good bandwidth, and the MISE, under RQMC or stratification, and we show empirically that in some situations, the MISE can be reduced significantly even in high-dimensional settings.

[1]  Averill M. Law,et al.  Simulation Modeling and Analysis , 1982 .

[2]  I. Sobol On the distribution of points in a cube and the approximate evaluation of integrals , 1967 .

[3]  Art B. Owen,et al.  A randomized Halton algorithm in R , 2017, ArXiv.

[4]  Michael Hardy Combinatorics of Partial Derivatives , 2006, Electron. J. Comb..

[5]  A. Owen Scrambled net variance for integrals of smooth functions , 1997 .

[6]  Kinjal Basu,et al.  Transformations and Hardy-Krause Variation , 2015, SIAM J. Numer. Anal..

[7]  Matthew P. Wand,et al.  Kernel Smoothing , 1995 .

[8]  Pierre L'Ecuyer,et al.  A Randomized Quasi-Monte Carlo Simulation Method for Markov Chains , 2006, Oper. Res..

[9]  Ramani Duraiswami,et al.  Fast optimal bandwidth selection for kernel density estimation , 2006, SDM.

[10]  Pierre L'Ecuyer,et al.  Recent Advances in Randomized Quasi-Monte Carlo Methods , 2002 .

[11]  P. L’Ecuyer,et al.  Randomized quasi-Monte Carlo: An introduction for practitioners , 2016 .

[12]  M. C. Jones,et al.  A Brief Survey of Bandwidth Selection for Density Estimation , 1996 .

[13]  Art B. Owen,et al.  Latin supercube sampling for very high-dimensional simulations , 1998, TOMC.

[14]  Paul Glasserman,et al.  Monte Carlo Methods in Financial Engineering , 2003 .

[15]  David W. Scott,et al.  Multivariate Density Estimation: Theory, Practice, and Visualization , 1992, Wiley Series in Probability and Statistics.

[16]  P. Glasserman,et al.  A Comparison of Some Monte Carlo and Quasi Monte Carlo Techniques for Option Pricing , 1998 .

[17]  Harald Niederreiter,et al.  Random number generation and Quasi-Monte Carlo methods , 1992, CBMS-NSF regional conference series in applied mathematics.

[18]  Art B. Owen,et al.  Scrambling Sobol' and Niederreiter-Xing Points , 1998, J. Complex..

[19]  A. Owen Randomly Permuted (t,m,s)-Nets and (t, s)-Sequences , 1995 .

[20]  Brian D. Ripley,et al.  Stochastic Simulation , 2005 .

[21]  A. Owen,et al.  Valuation of mortgage-backed securities using Brownian bridges to reduce effective dimension , 1997 .

[22]  Pierre L'Ecuyer,et al.  Quasi-Monte Carlo methods with applications in finance , 2008, Finance Stochastics.

[23]  D. W. Scott,et al.  Variable Kernel Density Estimation , 1992 .

[24]  F. Pillichshammer,et al.  Digital Nets and Sequences: Discrepancy Theory and Quasi-Monte Carlo Integration , 2010 .

[25]  D. W. Scott On optimal and data based histograms , 1979 .

[26]  Art B. Owen,et al.  Variance with alternative scramblings of digital nets , 2003, TOMC.

[27]  Michael Josephy Composing functions of bounded variation , 1981 .