Accuracy and performance analysis of Harris Corner computation on tightly-coupled processor arrays

Nowadays, computer vision algorithms have countless application domains. On the one hand, these algorithms are typically computationally demanding, on the other hand, they are often used in embedded systems, which have stringent constraints on, e. g., size or power. In this work, we present the benefits of mapping compute-intensive imaging algorithms on programmable massively parallel processor arrays. More specific, we propose different variants of a combined corner and edge detection algorithm, the Harris Corner Detector (HCD), map these variants onto tightly-coupled processor arrays (TCPAs), and prototype the TCPA architecture, executing the different HCD implementations, in FPGA technology. Because floating-point operations are very costly in FPGAs, we use fixed-point arithmetic in our design, and evaluate our implementation by means of accuracy and performance against two state-of-the-art implementations: (a) the OpenCV library of programming functions for real-time computer vision, using 64-bit floating-point precision, and (b) a 32-bit fixed-point DSP-based embedded system. The accuracy of our work is evaluated by considering the number of corners detected. Here, our approach achieves an average error of less than 1.5% when compared with a reference implementation. Our different variants, trading accuracy for performance, are mapped to the programmable processor elements of a TCPA. Here, the fastest TCPA implementation achieves a 55 times higher frame rate than a state-of-the-art implementation of the HCD on a digital signal processor. Finally, we show how our implementation can be used in the context of a new resource-aware parallel computing paradigm, called invasive computing. Here, an application can adapt itself at run-time in order to satisfy different quality and throughput requirements.

[1]  K. Takaya,et al.  FPGA Based Stereo Vision System to Display Disparity Map in Realtime , 2012, 2012 International Conference on Information Science and Applications.

[2]  Marc Reichenbach,et al.  A Generic VHDL Template for 2D Stencil Code Applications on FPGAs , 2012, 2012 IEEE 15th International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing Workshops.

[3]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[4]  Rahul Sukthankar,et al.  Controlling your TV with gestures , 2010, MIR '10.

[5]  Jürgen Becker,et al.  Multiprocessor System-on-Chip - Hardware Design and Tool Integration , 2011, Multiprocessor System-on-Chip.

[6]  Adam Schmidt,et al.  High-Speed Image Feature Detection Using FPGA Implementation of Fast Algorithm , 2008, VISAPP.

[7]  Rudy Lauwereins,et al.  Lococo: low complexity corner detector , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Miguel Arias-Estrada,et al.  An FPGA Co-processor for Real-Time Visual Tracking , 2002, FPL.

[9]  Liang-Gee Chen,et al.  iVisual: An intelligent visual sensor SoC with 2790fps CMOS image sensor and 205GOPS/W vision processor , 2008, 2008 45th ACM/IEEE Design Automation Conference.

[10]  Jürgen Teich,et al.  Hierarchical power management for adaptive tightly-coupled processor arrays , 2013, TODE.

[11]  Alexander G. Hauptmann,et al.  MoSIFT: Recognizing Human Actions in Surveillance Videos , 2009 .

[12]  Jürgen Teich,et al.  Scalable Many-Domain Power Gating in Coarse-Grained Reconfigurable Processor Arrays , 2011, IEEE Embedded Systems Letters.

[13]  James J. Little,et al.  Mobile Robot Localization and Mapping with Uncertainty using Scale-Invariant Visual Landmarks , 2002, Int. J. Robotics Res..

[14]  Jürgen Teich,et al.  A prototype of an adaptive computer vision algorithm on MPSoC architecture , 2013, 2013 Conference on Design and Architectures for Signal and Image Processing.

[15]  Steven S. Muchnick,et al.  Advanced Compiler Design and Implementation , 1997 .

[16]  Amir Fijany,et al.  Highly Parallel Implementation of Harris Corner Detector on CSX SIMD Architecture , 2010, Euro-Par Workshops.

[17]  Sidi Ahmed Mahmoudi,et al.  A new self-adapting architecture for feature detection , 2012, 22nd International Conference on Field Programmable Logic and Applications (FPL).

[18]  Jürgen Teich,et al.  System integration of tightly-coupled processor arrays using reconfigurable buffer structures , 2013, CF '13.

[19]  Waldemar Celes Filho,et al.  Accelerated Corner-Detector Algorithms , 2008, BMVC.

[20]  Claude Tadonki,et al.  Accelerator-Based implementation of the Harris Algorithm , 2012, ICISP.