论文信息 - On Random Weights and Unsupervised Feature Learning

On Random Weights and Unsupervised Feature Learning

Recently two anomalous results in the literature have shown that certain feature learning architectures can yield useful features for object recognition tasks even with untrained, random weights. In this paper we pose the question: why do random weights sometimes do so well? Our answer is that certain convolutional pooling architectures can be inherently frequency selective and translation invariant, even with random weights. Based on this we demonstrate the viability of extremely fast architecture search by using random weights to evaluate candidate architectures, thereby sidestepping the time-consuming learning process. We then show that a surprising fraction of the performance of certain state-of-the-art methods can be attributed to the architecture alone.

[1] Nirmal K. Bose,et al. Asymptotic Eigenvalue Distribution of Block-Toeplitz Matrices , 1998, IEEE Trans. Inf. Theory.

[2] Y. LeCun,et al. Learning methods for generic object recognition with invariance to pose and lighting , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[3] G. Tee. Eigenvectors of block circulant and alternating circulant matrices , 2005 .

[4] Robert M. Gray,et al. Toeplitz and Circulant Matrices: A Review , 2005, Found. Trends Commun. Inf. Theory.

[5] Robert M. Gray,et al. Toeplitz And Circulant Matrices: A Review (Foundations and Trends(R) in Communications and Information Theory) , 2006 .

[6] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[7] Jason Weston,et al. Scaling Learning Algorithms toward AI , 2007 .

[8] Yoshua Bengio,et al. Scaling learning algorithms towards AI , 2007 .

[9] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[10] AI Koan. Weighted Sums of Random Kitchen Sinks : Replacing minimization with randomization in learning , 2008 .

[11] Benjamin Recht,et al. Weighted Sums of Random Kitchen Sinks: Replacing minimization with randomization in learning , 2008, NIPS.

[12] Yann LeCun,et al. What is the best multi-stage architecture for object recognition? , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13] Geoffrey E. Hinton,et al. 3D Object Recognition with Deep Belief Nets , 2009, NIPS.

[14] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .