Massive Parallel-Hardware Architecture for Multiscale Stereo, Optical Flow and Image-Structure Computation

Low-level vision tasks pose an outstanding challenge in terms of computational effort: pixel-wise operations require high-performance architectures to achieve real-time processing. Nowadays, diverse technologies permit a high level of parallelism and in this way researchers can address more and more complex on-chip low-level vision-feature extraction. In the state of the art, different architectures have been described that process single vision modes in real time but multiple computer vision modes are seldom conjointly computed on a single device to produce a general-purpose on-chip low-level vision system: this may be the basis for mid-level or high-level vision tasks. We present here a novel architecture for multiple-vision feature extraction that includes multiscale optical flow, disparity, energy, orientation, and phase. A high degree of robustness in real-life situations is obtained thanks to adopting phase-based models (at the cost of relatively high computing resource requirements). The high flexibility of the reconfigurable devices used allows for the exploration of different hardware configurations to deal with final target and user requirements. Making use of this novel architecture and hardware-sharing techniques we describe a co-processing board implementation as a case study. It reaches an outstanding computing power of 92.3 GigaOPS at very low power consumption (approximately 12.9 GigaOPS/W).

[1]  Dah-Jye Lee,et al.  FPGA-Based Embedded Motion Estimation Sensor , 2008, Int. J. Reconfigurable Comput..

[2]  Apostolos Dollas,et al.  Methodology and Experimental Setup for the Determination of System-level Dynamic Reconfiguration Overhead , 2007 .

[3]  Marc M. Van Hulle,et al.  Realtime phase-based optical flow on the GPU , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[4]  Javier Díaz,et al.  Superpipelined high-performance optical-flow computation architecture , 2008, Comput. Vis. Image Underst..

[5]  Eduardo Ros Vidal,et al.  Robust Bioinspired Architecture for Optical-Flow Computation , 2010, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[6]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[7]  Masahiko Yoshimoto,et al.  A VGA 30-fps Realtime Optical-Flow Processor Core for Moving Picture Recognition , 2008, IEICE Trans. Electron..

[8]  David J. Fleet,et al.  Performance of optical flow techniques , 1994, International Journal of Computer Vision.

[9]  Viktor Öwall,et al.  A Hardware Architecture for Real-Time Video Segmentation Utilizing Memory Reduction Techniques , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Mancia Anguita,et al.  Optimization Strategies for High-Performance Computing of Optical-Flow in General-Purpose Processors , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Bum-Jae You,et al.  Improved-Quality Real-Time Stereo Vision Processor , 2009, 2009 22nd International Conference on VLSI Design.

[12]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Karl Pauwels Computational modeling of visual attention: neuronal response modulation in the thalamocortical complex and saliency-based detection of independent motion , 2008 .

[14]  Gustavo Carneiro,et al.  Multi-scale phase-based local features , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[15]  Jun Luo,et al.  Structural Parameter Optimization for 3-DOF Spherical Parallel Mechanism, Binocular Stereo , 2012 .

[16]  Fabio Solari,et al.  Fast technique for phase-based disparity estimation with no explicit calculation of phase , 2001 .

[17]  Oge Marques,et al.  Stereo depth with a Unified Architecture GPU , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[18]  Ioannis Andreadis,et al.  FPGA based disparity map computation with vergence control , 2010, Microprocess. Microsystems.

[19]  Shigang Li,et al.  Binocular Spherical Stereo , 2008, IEEE Transactions on Intelligent Transportation Systems.

[20]  Eduardo Ros,et al.  Real-time computing platform for spiking neurons (RT-spike) , 2006, IEEE Trans. Neural Networks.

[21]  Robin R. Murphy,et al.  A VLSI Architecture and Algorithm for Lucas–Kanade-Based Optical Flow Computation , 2010, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[22]  T. Vaudrey,et al.  Differences between stereo and motion behaviour on synthetic and real-world stereo sequences , 2008, 2008 23rd International Conference Image and Vision Computing New Zealand.

[23]  Fabio Solari,et al.  Phase-Based Binocular Perception of Motion in Depth: Cortical-Like Operators and Analog VLSI Architectures , 2003, EURASIP J. Adv. Signal Process..

[24]  Eduardo Ros,et al.  High-Performance Optical-Flow Architecture Based on a Multi-Scale, Multi-Orientation Phase-Based Model , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Javier Díaz,et al.  Multi-port abstraction layer for FPGA intensive memory exploitation applications , 2010, J. Syst. Archit..

[26]  Oscar Nestares,et al.  Efficient spatial-domain implementation of a multiscale image representation based on Gabor functions , 1998, J. Electronic Imaging.

[27]  Fabio Solari,et al.  A compact harmonic code for early vision based on anisotropic frequency channels , 2010, Comput. Vis. Image Underst..

[28]  ZuWhan Kim,et al.  Robust Lane Detection and Tracking in Challenging Scenarios , 2008, IEEE Transactions on Intelligent Transportation Systems.

[29]  David J. Fleet,et al.  Stability of phase information , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[30]  Matteo Tomasini,et al.  Pyramidal architecture for stereo vision and motion estimation in real-time fpga-based devices , 2010 .

[31]  Partha Pratim Pande,et al.  Network-on-Chip Hardware Accelerators for Biological Sequence Alignment , 2010, IEEE Transactions on Computers.

[32]  Alberto Prieto,et al.  Real-Time System for High-Image Resolution Disparity Estimation , 2007, IEEE Transactions on Image Processing.

[33]  Gerald Estrin,et al.  Reconfigurable Computer Origins: The UCLA Fixed-Plus-Variable (F+V) Structure Computer , 2002, IEEE Ann. Hist. Comput..

[34]  Wayne Luk,et al.  Performance Comparison of Graphics Processors to Reconfigurable Logic: A Case Study , 2010, IEEE Transactions on Computers.

[35]  Minglun Gong,et al.  Real-Time Stereo Matching Using Orthogonal Reliability-Based Dynamic Programming , 2007, IEEE Transactions on Image Processing.

[36]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[37]  Francisco Barranco,et al.  A novel architecture for a massively parallel low level vision processing engine on chip , 2010, 2010 IEEE International Symposium on Industrial Electronics.

[38]  Maria E. Angelopoulou,et al.  Robust Real-Time Super-Resolution on FPGA and an Application to Video Enhancement , 2009, TRETS.

[39]  Hassen Salhi,et al.  On Line Wavelets Transform on a Xilinx FPGA Circuit to Medical Images Compression , 2009, 2009 International Conference on Digital Image Processing.

[40]  Liang-Gee Chen,et al.  Architecture design of stereo matching using belief propagation , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[41]  George A. Constantinides,et al.  A Parallel Hardware Architecture for Scale and Rotation Invariant Feature Detection , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[42]  Javier Diaz,et al.  Local image phase, energy and orientation extraction using FPGAs , 2008 .

[43]  Yiannis Aloimonos,et al.  A Roadmap to the Integration of Early Visual Modules , 2007, International Journal of Computer Vision.

[44]  Truong Quang Vinh,et al.  FPGA Implementation of Real-Time Edge-Preserving Filter for Video Noise Reduction , 2008, 2008 International Conference on Computer and Electrical Engineering.

[45]  Chris Murphy,et al.  Low-Cost Stereo Vision on an FPGA , 2007, 15th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM 2007).

[46]  Tian-Sheuan Chang,et al.  Algorithm and Architecture of Disparity Estimation With Mini-Census Adaptive Support Weight , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[47]  Azzedine Boukerche,et al.  A Hardware Accelerator for the Fast Retrieval of DIALIGN Biological Sequence Alignments in Linear Space , 2010, IEEE Transactions on Computers.

[48]  Mark L. Chang,et al.  Low-Cost Stereo Vision on an FPGA , 2007 .

[49]  Rodrigo Agís,et al.  Hardware event-driven simulation engine for spiking neural networks , 2007 .

[50]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[51]  Wangda Zuo,et al.  Fast and informative flow simulations in a building by using fast fluid dynamics model on graphics processing unit , 2010 .

[52]  Richard Szeliski,et al.  A Database and Evaluation Methodology for Optical Flow , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[53]  Derek Abbott,et al.  Theory and Validation of Magnetic Resonance Fluid Motion Estimation Using Intensity Flow Data , 2009, PloS one.

[54]  Florentin Wörgötter,et al.  Machine Vision and Applications Manuscript Nr. Performance of Phase-based Algorithms for Disparity Estimation , 2022 .