Invited paper: Accelerating neuromorphic vision on FPGAs

Reconfigurable hardware such as FPGAs are being increasingly employed for application acceleration due to their high degree of parallelism, flexibility and power efficiency — factors which are key in the rapidly evolving field of embedded real-time vision. While recent advances in technology have increased the capacity of FPGAs, lack of standard models for developing custom accelerators creates issues with scalability and compatibility. In this paper, we describe a model for designing streaming hardware accelerators with run-time configurability. This model provides a generic interface for each hardware module, a modular and hierarchical structure for parallelism at multiple levels and a run-time reconfiguration framework for increased flexibility. We present case studies to accelerate sample neu-romorphic vision algorithms which are inspired by models of the mammalian visual cortex. Experimental results show speedups of several factors over comparable CPU implementations and higher performance-per-watt over relevant GPU implementations.

[1]  Adrian Park,et al.  Designing Modular Hardware Accelerators in C with ROCCC 2.0 , 2010, 2010 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines.

[2]  Gianluca Palermo,et al.  A reconfigurable multiprocessor architecture for a reliable face recognition implementation , 2010, 2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010).

[3]  Pietro Perona,et al.  Overcomplete steerable pyramid filters and rotation invariance , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Laurent Itti,et al.  Applying computational tools to predict gaze direction in interactive visual environments , 2008, TAP.

[5]  Yann LeCun,et al.  CNP: An FPGA-based processor for Convolutional Networks , 2009, 2009 International Conference on Field Programmable Logic and Applications.

[6]  N. Vijaykrishnan,et al.  A scalable multi-FPGA framework for real-time digital signal processing , 2009, Optical Engineering + Applications.

[7]  Rolf Ernst,et al.  FlexWAFE - A High-end Real-Time Stream Processing Library for FPGAs , 2007, 2007 44th ACM/IEEE Design Automation Conference.

[8]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[9]  Christof Koch,et al.  Modeling attention to salient proto-objects , 2006, Neural Networks.

[10]  Theocharis Theocharides,et al.  Towards hardware stereoscopic 3D reconstruction a real-time FPGA computation of the disparity map , 2010, 2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010).

[11]  Tingting Xu,et al.  A high-speed multi-GPU implementation of bottom-up attention using CUDA , 2009, 2009 IEEE International Conference on Robotics and Automation.