论文信息 - Sets of approximating functions with finite Vapnik-Chervonenkis dimension for nearest-neighbors algorithms

Sets of approximating functions with finite Vapnik-Chervonenkis dimension for nearest-neighbors algorithms

According to a certain misconception sometimes met in the literature: for the nearest-neighbors algorithms there is no fixed hypothesis class of limited Vapnik-Chervonenkis dimension. In the paper a simple reformulation (not a modification) of the nearest-neighbors algorithm is shown where instead of a natural number k, a percentage @[email protected]?(0,1) of nearest neighbors is used. Owing to this reformulation one can construct sets of approximating functions, which we prove to have finite VC dimension. In a special (but practical) case this dimension is equal to @?2/@[email protected]?. It is also then possible to form a sequence of sets of functions with increasing VC dimension, and to perform complexity selection via cross-validation or similarly to the structural risk minimization framework. Results of such experiments are also presented.

Przemyslaw Klesk | Marcin Korzen

[1] Pascal Vincent,et al. K-Local Hyperplane and Convex Distance Nearest Neighbor Algorithms , 2001, NIPS.

[2] Vladimir Cherkassky,et al. Learning from data , 1998 .

[3] Jon Louis Bentley,et al. An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.

[4] S. T. Buckland,et al. Computer Intensive Statistical Methods: Validation, Model Selection, and Bootstrap , 1993 .

[5] John Shawe-Taylor,et al. A framework for structural risk minimisation , 1996, COLT '96.

[6] Przemyslaw Klesk,et al. Maximal Margin Estimation with Perceptron-Like Algorithm , 2008, ICAISC.

[7] Peter L. Bartlett,et al. Rademacher and Gaussian Complexities: Risk Bounds and Structural Results , 2003, J. Mach. Learn. Res..

[8] S. P. Lloyd,et al. Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[9] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[10] Ronald L. Graham,et al. Concrete mathematics - a foundation for computer science , 1991 .

[11] Dana Ron,et al. Algorithmic Stability and Sanity-Check Bounds for Leave-One-Out Cross-Validation , 1997, Neural Computation.