论文信息 - Distributed sparse HMAX model

Distributed sparse HMAX model

HMAX is a neuroscience-inspired deep learning model, which consists of alternating S layers and C layers, mimicking the functional mechanism of the ventral visual pathway of primates. Recently, sparse coding is introduced to the framework of HMAX for learning the S layer bases, leading to semantic features of natural images. However, when processing large scale datasets, the sparse HMAX is beset with large time and memory consumption during both training and testing. In this paper, we present a distributed algorithm to parallelize the sparse coding in S layers. Experiments on a cluster with eight nodes verify that the algorithm can also learn higher level semantic representations of objects but with much higher efficiency than the original model. The acceleration rate is linearly proportional to the number of nodes, which shows the good scalability of the proposed method.

[1] David J. Field,et al. Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[2] Message Passing Interface Forum. MPI: A message - passing interface standard , 1994 .

[3] Guillermo Sapiro,et al. Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[4] D. Hubel,et al. Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[5] Aapo Hyvärinen,et al. Natural Image Statistics - A Probabilistic Approach to Early Computational Vision , 2009, Computational Imaging and Vision.

[6] Marc Teboulle,et al. A fast Iterative Shrinkage-Thresholding Algorithm with application to wavelet-based image deblurring , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7] Prashant Parikh. A Theory of Communication , 2010 .

[8] Jeffrey Pennington,et al. Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions , 2011, EMNLP.

[9] Xiaolin Hu,et al. Sparsity-Regularized HMAX for Visual Recognition , 2014, PloS one.

[10] R. Tibshirani,et al. Least angle regression , 2004, math/0406456.

[11] I. Daubechies,et al. An iterative thresholding algorithm for linear inverse problems with a sparsity constraint , 2003, math/0307152.

[12] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[13] Y-Lan Boureau,et al. Learning Convolutional Feature Hierarchies for Visual Recognition , 2010, NIPS.

[14] Jürgen Schmidhuber,et al. Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.

[15] Rajat Raina,et al. Large-scale deep unsupervised learning using graphics processors , 2009, ICML '09.

[16] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .

[17] Rajat Raina,et al. Efficient sparse coding algorithms , 2006, NIPS.

[18] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[19] Pietro Perona,et al. Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[20] L. Abbott,et al. Responses of neurons in primary and inferior temporal visual cortices to natural scenes , 1997, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[21] Pietro Perona,et al. Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[22] Mario A. Storti,et al. MPI for Python: Performance improvements and MPI-2 extensions , 2008, J. Parallel Distributed Comput..

[23] Yurii Nesterov,et al. Gradient methods for minimizing composite functions , 2012, Mathematical Programming.

[24] T. Poggio,et al. Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[25] Trevor Hastie,et al. Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[26] BengioYoshua. Learning Deep Architectures for AI , 2009 .

[27] Fei-FeiLi,et al. Learning generative visual models from few training examples , 2007 .