论文信息 - Unsupervised slow subspace-learning from stationary processes

Unsupervised slow subspace-learning from stationary processes

We propose a method of unsupervised learning from stationary, vector-valued processes. A projection to a low-dimensional subspace is selected on the basis of an objective function which rewards data-variance and penalizes the variance of the velocity vector, thus exploiting the short-time dependencies of the process. We prove bounds on the estimation error of the objective in terms of the @b-mixing coefficients of the process. It is also shown that maximizing the objective minimizes an error bound for simple classification algorithms on a generic class of learning tasks. Experiments with image recognition demonstrate the algorithms ability to learn geometrically invariant feature maps.

Andreas Maurer | Andreas Maurer

[1] Gunnar Rätsch,et al. Kernel PCA and De-Noising in Feature Spaces , 1998, NIPS.

[2] B. Simon. Trace ideals and their applications , 1979 .

[3] Mathukumalli Vidyasagar,et al. Learning and Generalization: With Applications to Neural Networks , 2002 .

[4] Gilles Blanchard,et al. Statistical properties of Kernel Prinicipal Component Analysis , 2019 .

[5] A. Dembo,et al. A note on uniform laws of averages for dependent processes , 1993 .

[6] R. C. Bradley. Basic Properties of Strong Mixing Conditions , 1985 .

[7] Bin Yu. RATES OF CONVERGENCE FOR EMPIRICAL PROCESSES OF STATIONARY MIXING SEQUENCES , 1994 .

[8] E. Rio,et al. Théorie asymptotique de processus aléatoires faiblement dépendants , 2000 .

[9] Michael A. Arbib,et al. The handbook of brain theory and neural networks , 1995, A Bradford book.

[10] Terrence J. Sejnowski,et al. Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.

[11] Nello Cristianini,et al. On the eigenspectrum of the gram matrix and the generalization error of kernel-PCA , 2005, IEEE Transactions on Information Theory.