A Power Efficient Architecture with Optimized Parallel Memory Accessing for Feature Generation

Visual feature detection has been widely used in many computer vision applications. In comparison with popular feature detection algorithms, AFFINE-SIFT (ASIFT) achieves the strongest robustness on image illumination, image rotation and image scale transformation, etc., however, since feature vectors of different feature points need to be computed, feature generation of ASIFT exhibits high computing complexity. Meanwhile, it also shows low efficiency on parallel memory accessing since multiple computing processes of feature points need to access the same memory concurrently, which causes memory accessing conflicts and incurs high power consumption. In this work, we improve the feature generation based on a rotation invariant block binary pattern(RIBBP) to reduce computation and design a parallel processing architecture. Meanwhile, a spatial relevance based memory accessing(SRBMA) optimization is performed to improve data reuse and enhance parallel memory accessing. Evaluations using TSMC 65 nm 1P9M LP process show that this work improves the processing speed and energy efficiency of feature generation by 17% and 37.9%, respectively, when compared with state-of-the art work.

[1]  Hoi-Jun Yoo,et al.  A 201.4 GOPS 496 mW Real-Time Multi-Object Recognition Processor With Bio-Inspired Neural Perception Engine , 2009, IEEE Journal of Solid-State Circuits.

[2]  Nurfitri Anbarsanti,et al.  Multi-object tracking and detection system based on feature detection of the intelligent transportation system , 2014, 2014 IEEE 4th International Conference on System Engineering and Technology (ICSET).

[3]  Hoi-Jun Yoo,et al.  Intelligent Network-on-Chip With Online Reinforcement Learning for Portable HD Object Recognition Processor , 2014, IEEE Transactions on Circuits and Systems I: Regular Papers.

[4]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[5]  Leibo Liu,et al.  A 83fps 1080P resolution 354 mW silicon implementation for computing the improved robust feature in affine space , 2015, 2015 IEEE Custom Integrated Circuits Conference (CICC).

[6]  Chenchen Deng,et al.  SURFEX: A 57fps 1080P resolution 220mW silicon implementation for simplified speeded-up robust feature with 65nm process , 2013, Proceedings of the IEEE 2013 Custom Integrated Circuits Conference.

[7]  Xiaoyang Tan,et al.  Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions , 2007, IEEE Transactions on Image Processing.

[8]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[9]  Hoi-Jun Yoo,et al.  A 320 mW 342 GOPS Real-Time Dynamic Object Recognition Processor for HD 720p Video Streams , 2013, IEEE Journal of Solid-State Circuits.

[10]  Aamir S. Malik,et al.  A framework for real time indoor robot navigation using Monte Carlo Localization and ORB feature detection , 2014, The 18th IEEE International Symposium on Consumer Electronics (ISCE 2014).

[11]  Adrien Bartoli,et al.  KAZE Features , 2012, ECCV.

[12]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13]  Xiaoyang Tan,et al.  Enhanced Local Texture Feature Sets for Face Recognition Under Difficult Lighting Conditions , 2007, AMFG.

[14]  Jean-Michel Morel,et al.  ASIFT: A New Framework for Fully Affine Invariant Image Comparison , 2009, SIAM J. Imaging Sci..