论文信息 - Accelerate the Detection Frame Rate of YOLO Object Detection Algorithm

Accelerate the Detection Frame Rate of YOLO Object Detection Algorithm

YOLO (You-Only-Look-Once) is by far the well-known Deep Neural Networks (DNNs) object detection algorithm with real-time performance on a computer with GPUs. Conceptually, YOLO divides the input image of size \(W \times W\) into non-overlapping square cells with the final feature of size \(S \times S\); i.e. \((416 \times 416) \rightarrow (13 \times 13)\). Each cell is responsible for predicting a single object whose centre falls into it. In this paper, we propose the algorithm that makes use of our observation mapping relationship which states that while the sizes of square cells are changed from layer to layer, their indices are preserved. The algorithm operates by locating a region of change in an input image and identifies the indices of square cells that cover the region. Only the members of the input features within these cells in all layers along the network are required to be operated. When the algorithm is employed along with the spatio-temporal property within video frames, it is capable of attaining the best relative detection of 1.47 (about 7 fps) with 90% correctness. These are benchmarked with the ordinary YOLO object detection on a personal computer: Intel Core i7 CPU at 3.5 GHz with 16 GB of memory and without any sophisticate GPUs, on the Tiny-YOLO network.

Wattanapong Kurdthongmee

[1] Jeremy Bottleson,et al. clCaffe: OpenCL Accelerated Caffe for Convolutional Neural Networks , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[2] Luca Benini,et al. CBinfer: Change-Based Inference for Convolutional Neural Networks on Video Data , 2017, ICDSC.

[3] Ali Farhadi,et al. YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Hanqing Lu,et al. Recent advances in efficient computation of deep convolutional neural networks , 2018, Frontiers of Information Technology & Electronic Engineering.

[6] Xuanzhe Liu,et al. DeepCache: Principled Cache for Mobile Deep Vision , 2017, MobiCom.

[7] Meng Zhang,et al. Recent Advances in Convolutional Neural Network Acceleration , 2018, Neurocomputing.