Minimalistic Image Signal Processing for Deep Learning Applications

In-sensor energy-efficient deep learning accelerators have the potential to enable the use of deep neural networks in embedded vision applications. However, their negative impact on accuracy has been severely underestimated. The inference pipeline used in prior in-sensor deep learning accelerators bypasses the image signal processor (ISP), thereby disrupting the conventional vision pipeline and undermining accuracy of machine learning algorithms trained on conventional, post-ISP datasets. For example, the detection accuracy of an off-the-shelf Faster RCNN algorithm in a vehicle detection scenario reduces by 60%. To make in-sensor accelerators practical, we describe energy-efficient operations that yield most of the benefits of an ISP and reduce covariate shift between the training (ISP processed images) and target (RAW images) distributions. For the vehicle detection problem, our approach improves accuracy by 25–60%. Relative to the conventional ISP pipeline, energy consumption and response time improve by 30% and 34%, respectively.

[1]  Robert P. Dick,et al.  Digital Foveation: An Energy-Aware Machine Vision Framework , 2018, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[2]  Bhaskar Choubey,et al.  A wide dynamic range CMOS image sensor with an adjustable logarithmic response , 2008, Electronic Imaging.

[3]  Lin Zhong,et al.  RedEye: Analog ConvNet Image Sensor Architecture for Continuous Mobile Vision , 2016, 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA).

[4]  Boris Murmann,et al.  Toward Always-On Mobile Object Detection: Energy Versus Performance Tradeoffs for Embedded HOG Feature Extraction , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Tianshi Chen,et al.  ShiDianNao: Shifting vision processing closer to the sensor , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).

[6]  Sergio Guadarrama,et al.  Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Seokjun Park,et al.  A 272.49 pJ/pixel CMOS image sensor with embedded object detection and bio-inspired 2D optic flow generation for nano-air-vehicle navigation , 2017, 2017 Symposium on VLSI Circuits.

[8]  Saibal Mukhopadhyay,et al.  3-D Stacked Image Sensor With Deep Neural Network Computation , 2018, IEEE Sensors Journal.

[9]  Suren Jayasuriya,et al.  Reconfiguring the Imaging Pipeline for Computer Vision , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[10]  Daehyeok Kim,et al.  A Multi-Resolution Mode CMOS Image Sensor with a Novel Two-Step Single-Slope ADC for Intelligent Surveillance Systems , 2017, Sensors.