Asymptotic theory for density ridges

The large sample theory of estimators for density modes is well understood. In this paper we consider density ridges, which are a higher-dimensional extension of modes. Modes correspond to zero-dimensional, local high-density regions in point clouds. Density ridges correspond to $s$-dimensional, local high-density regions in point clouds. We establish three main results. First we show that under appropriate regularity conditions, the local variation of the estimated ridge can be approximated by an empirical process. Second, we show that the distribution of the estimated ridge converges to a Gaussian process. Third, we establish that the bootstrap leads to valid confidence sets for density ridges.

[1]  Paul Tseng,et al.  Analysis of Nonsmooth Symmetric-Matrix-Valued Functions with Applications to Semidefinite Complementarity Problems , 2003, SIAM J. Optim..

[2]  Larry A. Wasserman,et al.  Statistical analysis of metric graph reconstruction , 2013, J. Mach. Learn. Res..

[3]  Larry A. Wasserman,et al.  Nonparametric Ridge Estimation , 2012, ArXiv.

[4]  James N. Damon,et al.  Properties of Ridges and Cores for Two-Dimensional Images , 1999, Journal of Mathematical Imaging and Vision.

[5]  Stefan Funke,et al.  Curve reconstruction from noisy samples , 2003, SCG '03.

[6]  A. Rinaldo,et al.  Generalized density clustering , 2009, 0907.3454.

[7]  G. Tutz,et al.  Modelling beyond regression functions: an application of multimodal regression to speed–flow data , 2006 .

[8]  Victor Chernozhukov,et al.  Anti-concentration and honest, adaptive confidence bands , 2013 .

[9]  Steve Oudot,et al.  The Structure and Stability of Persistence Modules , 2012, Springer Briefs in Mathematics.

[10]  Joseph P. Romano On weak convergence and optimality of kernel density estimates of the mode , 1988 .

[11]  Jussi Klemelä,et al.  Visualization of Multivariate Density Estimates With Level Set Trees , 2004 .

[12]  Ilya Zaliapin,et al.  Tokunaga and Horton self-similarity for level set trees of Markov chains , 2011, 1101.2429.

[13]  Kengo Kato,et al.  Gaussian approximation of suprema of empirical processes , 2014 .

[14]  Ming-Yen Cheng,et al.  Estimating gradient trees , 2004 .

[15]  Surajit Ray,et al.  The topography of multivariate normal mixtures , 2005 .

[16]  Larry A. Wasserman,et al.  Manifold Estimation and Singular Deconvolution Under Hausdorff Loss , 2011, ArXiv.

[17]  Liang Peng,et al.  Local likelihood tracking of fault lines and boundaries , 2001 .

[18]  Uwe Einmahl,et al.  An Empirical Process Approach to the Uniform Consistency of Kernel-Type Function Estimators , 2000 .

[19]  Yen-Chi Chen,et al.  Density Level Sets: Asymptotics, Inference, and Visualization , 2015, 1504.05438.

[20]  Deniz Erdogmus,et al.  Locally Defined Principal Curves and Surfaces , 2011, J. Mach. Learn. Res..

[21]  E. Giné,et al.  Rates of strong uniform consistency for multivariate kernel density estimators , 2002 .

[22]  L. Wasserman,et al.  Nonparametric modal regression , 2014, 1412.1716.

[23]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[24]  I. Molchanov Theory of Random Sets , 2005 .

[25]  David H. Eberly,et al.  Ridges in Image and Data Analysis , 1996, Computational Imaging and Vision.

[26]  Frédéric Chazal,et al.  Normal-Map between Normal-Compatible Manifolds , 2007, Int. J. Comput. Geom. Appl..

[27]  G. Stewart On the Perturbation of Pseudo-Inverses, Projections and Linear Least Squares Problems , 1977 .