A Sub-100 mW Dual-Core HOG Accelerator VLSI for Parallel Feature Extraction Processing for HDTV Resolution Video

This paper describes a Histogram of Oriented Gradients (HOG) feature extraction accelerator that features a VLSI-oriented HOG algorithm with early classification in Support Vector Machine (SVM) classification, dual core architecture for parallel feature extraction and multiple object detection, and detection-window-size scalable architecture with reconfigurable MAC array for processing objects of several shapes. To achieve low-power consumption for mobile applications, early classification reduces the amount of computations in SVM classification efficiently with no accuracy degradation. The dual core architecture enables parallel feature extraction in one frame for high-speed or low-power computing and detection of multiple objects simultaneously with low power consumption by HOG feature sharing. Objects of several shapes, a vertically long object, a horizontally long object, and a square object, can be detected because of cooperation between the two cores. The proposed methods provide processing capability for HDTV resolution video (1920 × 1080 pixels) at 30 frames per second (fps). The test chip, which has been fabricated using 65 nm CMOS technology, occupies 4.2 × 2.1 mm2 containing 502 Kgates and 1.22 Mbit on-chip SRAMs. The simulated data show 99.5 mW power consumption at 42.9 MHz and 1.1 V. key words: HOG, object detection, low-power, HDTV

[1]  Hiroyuki Ochi,et al.  Hardware Architecture for HOG Feature Extraction , 2009, 2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[2]  Jack E. Volder The CORDIC Trigonometric Computing Technique , 1959, IRE Trans. Electron. Comput..

[3]  Hironobu Fujiyoshi,et al.  FPGA Hardware with Target-Reconfigurable Object Detector , 2015, IEICE Trans. Inf. Syst..

[4]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[5]  Ramakant Nevatia,et al.  Efficient scan-window based object detection using GPGPU , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[6]  Yuichiro Shibata,et al.  Deep pipelined one-chip FPGA implementation of a real-time image-based human detection algorithm , 2011, 2011 International Conference on Field-Programmable Technology.

[7]  Ulrich Brunsmann,et al.  FPGA-GPU architecture for kernel SVM pedestrian detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[8]  Guang Deng,et al.  Real-Time Vision-Based Stop Sign Detection System on FPGA , 2008, 2008 Digital Image Computing: Techniques and Applications.

[9]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[10]  S. Bauer,et al.  FPGA Implementation of a HOG-based Pedestrian Recognition System , 2010 .

[11]  Shintaro Izumi,et al.  A 34.7-mW quad-core MIQP solver processor for robot control , 2010, IEEE Custom Integrated Circuits Conference 2010.

[12]  Shintaro Izumi,et al.  Architectural Study of HOG Feature Extraction Processor for Real-Time Object Detection , 2012, 2012 IEEE Workshop on Signal Processing Systems.

[13]  Ryusuke Miyamoto,et al.  Hardware architecture for high-accuracy real-time pedestrian detection with CoHOG features , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.