Minimax-optimal semi-supervised regression on unknown manifolds

We consider semi-supervised regression when the predictor variables are drawn from an unknown manifold. A simple two step approach to this problem is to: (i) estimate the manifold geodesic distance between any pair of points using both the labeled and unlabeled instances; and (ii) apply a k nearest neighbor regressor based on these distance estimates. We prove that given sufficiently many unlabeled points, this simple method of geodesic kNN regression achieves the optimal finite-sample minimax bound on the mean squared error, as if the manifold were known. Furthermore, we show how this approach can be efficiently implemented, requiring only O(k N log N) operations to estimate the regression function at all N labeled and unlabeled points. We illustrate this approach on two datasets with a manifold structure: indoor localization using WiFi fingerprints and facial pose estimation. In both cases, geodesic kNN is more accurate and much faster than the popular Laplacian eigenvector regressor.

[1]  Adam Krzyzak,et al.  A Distribution-Free Theory of Nonparametric Regression , 2002, Springer series in statistics.

[2]  Partha Niyogi,et al.  Manifold regularization and semi-supervised learning: some theoretical analyses , 2013, J. Mach. Learn. Res..

[3]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[4]  John M. Lee Introduction to Smooth Manifolds , 2002 .

[5]  P. Bickel,et al.  Local polynomial regression on unknown manifolds , 2007, 0708.0983.

[6]  Robert D. Nowak,et al.  Unlabeled data: Now it helps, now it doesn't , 2008, NIPS.

[7]  Stephen M. Omohundro,et al.  Five Balltree Construction Algorithms , 2009 .

[8]  Sariel Har-Peled Computing the k Nearest-Neighbors for all Vertices via Dijkstra , 2016, ArXiv.

[9]  Philippe Rigollet,et al.  Generalization Error Bounds in Semi-supervised Classification Under the Cluster Assumption , 2006, J. Mach. Learn. Res..

[10]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[11]  Mikhail Belkin,et al.  Semi-Supervised Learning on Riemannian Manifolds , 2004, Machine Learning.

[12]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[13]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[14]  Martin Erwig,et al.  The graph Voronoi diagram with applications , 2000, Networks.

[15]  Alexandre B. Tsybakov,et al.  Introduction to Nonparametric Estimation , 2008, Springer series in statistics.

[16]  Fei Wang,et al.  Graph-based semi-supervised learning , 2009, Artificial Life and Robotics.

[17]  Israel Cohen,et al.  Single-Site Emitter Localization via Multipath Fingerprinting , 2013, IEEE Transactions on Signal Processing.

[18]  Ling Huang,et al.  An Analysis of the Convergence of Graph Laplacians , 2010, ICML.

[19]  Larry A. Wasserman,et al.  Statistical Analysis of Semi-Supervised Regression , 2007, NIPS.

[20]  Xiaojin Zhu,et al.  Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.

[21]  P. Gács,et al.  Algorithms , 1992 .

[22]  Jing Liu,et al.  Survey of Wireless Indoor Positioning Techniques and Systems , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[23]  Lei Shi,et al.  Fast Algorithm for Approximate k-Nearest Neighbor Graph Construction , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[24]  Kaizhu Huang,et al.  Fast kNN Graph Construction with Locality Sensitive Hashing , 2013, ECML/PKDD.

[25]  L. Györfi,et al.  A Distribution-Free Theory of Nonparametric Regression (Springer Series in Statistics) , 2002 .

[26]  Avleen Singh Bijral,et al.  Semi-supervised Learning with Density Based Distances , 2011, UAI.

[27]  Mati Wax,et al.  Single-Site Localization via Maximum Discrimination Multipath Fingerprinting , 2014, IEEE Transactions on Signal Processing.

[28]  Ulrike von Luxburg,et al.  Shortest path distance in random k-nearest neighbor graphs , 2012, ICML.

[29]  Samory Kpotufe,et al.  k-NN Regression Adapts to Local Intrinsic Dimension , 2011, NIPS.

[30]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[31]  Alexander Zien,et al.  Semi-Supervised Classification by Low Density Separation , 2005, AISTATS.