A Multi-Class Objects Detection Coprocessor With Dual Feature Space and Weighted Softmax

A critical mission for mobile robot vision is to detect and classify different objects with low power consumption. In this brief, a multi-class object detection coprocessor is proposed by using the Histogram of Oriented Gradient (HOG) and Local Binary Pattern (LBP) together with a weighted Softmax classifier. Its architecture is compact and hardware-friendly suitable for energy-constrained applications. Firstly, the cell-based feature extraction unit and block-level normalization reuse the SRAMs for storing one-row cell and one-row block. Meanwhile, the working frequency of the feature extraction and block-normalization unit is synchronized to the image sensor for low dynamic power. Then, a block-level one-time sliding-detection-window (OTSDW) mechanism is developed for partial classification with scalable object size. Finally, the Softmax classifier, which is implemented by the look-up table, linear fitting methods, and the fixed-point number, is tested in the Fashion MNIST dataset to evaluate its performance in multi-class classification problems and it reached an accuracy of over 86.2% with 10,180 parameters. The experimental result shows that the hardware-resource usage of the FPGA implementation is capable of 60 fps VGA video with 80.98 mW power consumption. This method uses similar or even fewer hardware resources than that of previous work using only the HOG feature and single-class classifier.

[1]  Lei Wang,et al.  A Systolic SNN Inference Accelerator and its Co-optimized Software Framework , 2019, ACM Great Lakes Symposium on VLSI.

[2]  Thomas Deselaers,et al.  Measuring the Objectness of Image Windows , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Jungme Park,et al.  Comparison of HOG, LBP and Haar-Like Features for On-Road Vehicle Detection , 2018, 2018 IEEE International Conference on Electro/Information Technology (EIT).

[4]  Maheshkumar H. Kolekar,et al.  Classification of fashion article images using convolutional neural networks , 2017, 2017 Fourth International Conference on Image Information Processing (ICIIP).

[5]  Marek Gorgon,et al.  Floating point HOG implementation for real-time multiple object detection , 2012, 22nd International Conference on Field Programmable Logic and Applications (FPL).

[6]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Jan Barowski,et al.  ConvNet Transfer Learning for GPR Images Classification , 2020, 2020 German Microwave Conference (GeMiC).

[8]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[9]  Jian Cheng,et al.  Pedestrian Detection Based on HOG-LBP Feature , 2011, 2011 Seventh International Conference on Computational Intelligence and Security.

[10]  M. Flynn,et al.  Fast division algorithm with a small lookup table , 1999, Conference Record of the Thirty-Third Asilomar Conference on Signals, Systems, and Computers (Cat. No.CH37020).

[11]  John Scott Bridle,et al.  Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition , 1989, NATO Neurocomputing.

[12]  Shintaro Izumi,et al.  Architectural Study of HOG Feature Extraction Processor for Real-Time Object Detection , 2012, 2012 IEEE Workshop on Signal Processing Systems.

[13]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Peter A. Beerel,et al.  pSConv: A Pre-defined S parse Kernel Based Convolution for Deep CNNs , 2019, 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[15]  Yiqiang Sheng,et al.  CoNN: Collaborative Neural Network for Personalized Representation Learning with Application to Scalable Task Classification , 2019, 2019 International Conference on Computer, Information and Telecommunication Systems (CITS).

[16]  Fengwei An,et al.  A Hardware Architecture for Cell-Based Feature-Extraction and Classification Using Dual-Feature Space , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[18]  Tieniu Tan,et al.  Boosted local structured HOG-LBP for object localization , 2011, CVPR 2011.

[19]  Usman Qayyum,et al.  A High Speed and Resource Efficient Approximation of Softmax Loss Function , 2019, 2019 16th International Bhurban Conference on Applied Sciences and Technology (IBCAST).