论文信息 - PNNU: Parallel Nearest-Neighbor Units for Learned Dictionaries

PNNU: Parallel Nearest-Neighbor Units for Learned Dictionaries

We present a novel parallel approach, parallel nearest neighbor unit PNNU, for finding the nearest member in a learned dictionary of high-dimensional features. This is a computation fundamental to machine learning and data analytics algorithms such as sparse coding for feature extraction. PNNU achieves high performance by using three techniques: 1 PNNU employs a novel fast table look up scheme to identify a small number of atoms as candidates from which the nearest neighbor of a query data vector can be found; 2 PNNU reduces computation cost by working with candidate atoms of reduced dimensionality; and 3 PNNU performs computations in parallel over multiple cores with low inter-core communication overheads. Based on efficient computation via techniques 1 and 2, technique 3 attains further speed up via parallel processing. We have implemented PNNU on multi-core machines. We demonstrate its superior performance on three application tasks in signal processing and computer vision. For an action recognition task, PNNU achieves 41x overall performance gains on a 16-core compute server against a conventional serial implementation of nearest neighbor computation. Our PNNU software is available online as open source.

H. T. Kung | Bradley McDanel | Surat Teerapittayanon | Surat Teerapittayanon | Bradley McDanel

[1] Stefan Wess,et al. Using k-d Trees to Improve the Retrieval Step in Case-Based Reasoning , 1993, EWCBR.

[2] Lawrence G. Roberts,et al. Picture coding using pseudo-random noise , 1962, IRE Trans. Inf. Theory.

[3] Ioannis Gkioulekas,et al. Dimensionality Reduction Using the Sparse Linear Model , 2011, NIPS.

[4] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[5] Michael A. Saunders,et al. Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[6] Yann LeCun,et al. Fast Training of Convolutional Networks through FFTs , 2013, ICLR.

[7] Pascal Vincent,et al. Learning invariant features through local space contraction , 2011, ArXiv.

[8] Cordelia Schmid,et al. Action recognition by dense trajectories , 2011, CVPR 2011.

[9] M. Elad,et al. $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[10] Leif E. Peterson. K-nearest neighbor , 2009, Scholarpedia.

[11] Nicole Immorlica,et al. Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[12] Stéphane Mallat,et al. Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[13] Subhasis Saha,et al. Image compression—from DCT to wavelets: a review , 2000, CROS.

[14] R. Tibshirani,et al. Least angle regression , 2004, math/0406456.

[15] A. Bruckstein,et al. K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[16] Heng Tao Shen,et al. Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[17] Piotr Indyk,et al. Nearest Neighbors in High-Dimensional Spaces , 2004, Handbook of Discrete and Computational Geometry, 2nd Ed..

[18] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[19] Trevor Darrell,et al. Nearest-Neighbor Methods in Learning and Vision: Theory and Practice (Neural Information Processing) , 2006 .

[20] Joel A. Tropp,et al. Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit , 2007, IEEE Transactions on Information Theory.

[21] Barbara Caputo,et al. Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..