Provable Dimension Detection Using Principal Component Analysis

We analyze an algorithm based on principal component analysis (PCA) for detecting the dimension k of a smooth manifold from a set P of point samples. The best running time so far is O(d 2O(k7log k)) by Giesen and Wagner after the adaptive neighborhood graph is constructed. Given the adaptive neighborhood graph, the PCA-based algorithm outputs the true dimension in O(d2O(k)) time, provided that P satisfies a standard sampling condition as in previous results. Our experimental results validate the effectiveness of the approach. A further advantage is that both the algorithm and its analysis can be generalized to the noisy case, in which small perturbations of the samples and a small portion of outliers are allowed.

[1]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[2]  Tamal K. Dey,et al.  Curve and Surface Reconstruction , 2004, Handbook of Discrete and Computational Geometry, 2nd Ed..

[3]  Joachim Giesen,et al.  Shape dimension and intrinsic metric from samples of manifolds with high co-dimension , 2003, SCG '03.

[4]  Jean-Daniel Boissonnat,et al.  Complexity of the delaunay triangulation of points on surfaces the smooth case , 2003, SCG '03.

[5]  B. Parlett The Symmetric Eigenvalue Problem , 1981 .

[6]  J. Friedman,et al.  Computing Betti Numbers via Combinatorial Laplacians , 1996, STOC '96.

[7]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[8]  Sunghee Choi,et al.  The power crust, unions of balls, and the medial axis transform , 2001, Comput. Geom..

[9]  Sunghee Choi,et al.  A Simple Algorithm for Homeomorphic Surface Reconstruction , 2002, Int. J. Comput. Geom. Appl..

[10]  Shing-Tung Yau,et al.  Reducing the Symmetric Matrix Eigenvalue Problem to Matrix Multiplications , 1993, SIAM J. Sci. Comput..

[11]  Keinosuke Fukunaga,et al.  An Algorithm for Finding Intrinsic Dimensionality of Data , 1971, IEEE Transactions on Computers.

[12]  Gene H. Golub,et al.  Matrix computations , 1983 .

[13]  Tamal K. Dey,et al.  Shape Dimension and Approximation from Samples , 2002, SODA '02.

[14]  Tamal K. Dey,et al.  Manifold reconstruction from point samples , 2005, SODA '05.

[15]  Herbert Edelsbrunner,et al.  Algorithms in Combinatorial Geometry , 1987, EATCS Monographs in Theoretical Computer Science.

[16]  Tamal K. Dey,et al.  Provable surface reconstruction from noisy samples , 2006, Comput. Geom..

[17]  Stefan Funke,et al.  Curve reconstruction from noisy samples , 2003, SCG '03.

[18]  Yajun Wang,et al.  Provable dimension detection using principal component analysis , 2005, Symposium on Computational Geometry.

[19]  W. Press Numerical recipes in Fortran 77 : the art of scientific computing : volume 1 of fortran numerical recipes , 1996 .

[20]  Sunghee Choi,et al.  The power crust , 2001, SMA '01.