High-level Performance Evaluation of Object Detection based on Massively Parallel Focal-plane Acceleration Requiring Minimum Pixel Area Overhead

Smart CMOS image sensors can leverage the inherent data-level parallelism and regular computational flow of early vision by incorporating elementary processors at pixel level. However, it comes at the cost of extra area having a strong impact on the sensor sensitivity, resolution and image quality. In this scenario, the fundamental challenge is to devise new strategies capable of boosting the performance of the targeted vision pipeline while minimally affecting the sensing function itself. Such strategies must also feature enough flexibility to accommodate particular application requirements. From these high-level specifications, we propose a focal-plane processing architecture tailored to speed up object detection via the Viola-Jones algorithm. This architecture is supported by only two extra transistors per pixel and simple peripheral digital circuitry that jointly make up a massively parallel reconfigurable processing lattice. A performance evaluation of the proposed scheme in terms of accuracy and acceleration for face detection is reported.

[1]  Ricardo Carmona-Galán,et al.  FLIP-Q: A QCIF Resolution Focal-Plane Array for Low-Power Image Processing , 2011, IEEE Journal of Solid-State Circuits.

[2]  Leibo Liu,et al.  A Fast Integral Image Computing Hardware Architecture With High Power and Area Efficiency , 2015, IEEE Transactions on Circuits and Systems II: Express Briefs.

[3]  Jianliang Xu,et al.  Accelerating Viola-Jones Facce Detection Algorithm on GPUs , 2012, 2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems.

[4]  Boyd Fowler Solid‐State Image Sensors , 2015 .

[5]  Richard P. Kleihorst,et al.  Demo: Mouse sensor networks, the smart camera , 2011, 2011 Fifth ACM/IEEE International Conference on Distributed Smart Cameras.

[6]  Reinhard Klette,et al.  Concise Computer Vision , 2014, Undergraduate Topics in Computer Science.

[7]  Jun Ohta,et al.  Smart CMOS Image Sensors and Applications , 2007 .

[8]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[9]  Bruno Steux,et al.  Yet Even Faster (YEF) real-time object detection , 2007, Int. J. Intell. Syst. Technol. Appl..

[10]  Ákos Zarándy,et al.  Focal-Plane Sensor-Processor Chips , 2014 .