Coded Two-Bucket Cameras for Computer Vision

We introduce coded two-bucket (C2B) imaging, a new operating principle for computational sensors with applications in active 3D shape estimation and coded-exposure imaging. A C2B sensor modulates the light arriving at each pixel by controlling which of the pixel’s two “buckets” should integrate it. C2B sensors output two images per video frame—one per bucket—and allow rapid, fully-programmable, per-pixel control of the active bucket. Using these properties as a starting point, we (1) develop an image formation model for these sensors, (2) couple them with programmable light sources to acquire illumination mosaics, i.e., images of a scene under many different illumination conditions whose pixels have been multiplexed and acquired in one shot, and (3) show how to process illumination mosaics to acquire live disparity or normal maps of dynamic scenes at the sensor’s native resolution. We present the first experimental demonstration of these capabilities, using a fully-functional C2B camera prototype. Key to this unique prototype is a novel programmable CMOS sensor that we designed from the ground up, fabricated and turned into a working system.

[1]  Kiriakos N. Kutulakos,et al.  CMOS Image Sensor Architecture for Primal-Dual Coding , 2017 .

[2]  Stefan Leutenegger,et al.  Real-Time 3D Reconstruction and 6-DoF Tracking with an Event Camera , 2016, ECCV.

[3]  Joaquim Salvi,et al.  Pattern codification strategies in structured light systems , 2004, Pattern Recognit..

[4]  Dieter Fox,et al.  DynamicFusion: Reconstruction and tracking of non-rigid scenes in real-time , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Graham Fyffe,et al.  Single-shot photometric stereo by spectral multiplexing , 2010, 2011 IEEE International Conference on Computational Photography (ICCP).

[6]  Shree K. Nayar,et al.  Multiplexing for Optimal Lighting , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Steve Marschner,et al.  Matching Real Fabrics with Micro-Appearance Models , 2015, ACM Trans. Graph..

[8]  Kiriakos N. Kutulakos,et al.  Computational Imaging on the Electric Grid , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Andrew D. Payne,et al.  A 0.13 μm CMOS System-on-Chip for a 512 × 424 Time-of-Flight Image Sensor With Multi-Frequency Photo-Demodulation up to 130 MHz and 2 GS/s ADC , 2015, IEEE Journal of Solid-State Circuits.

[10]  Frédo Durand,et al.  Deep joint demosaicking and denoising , 2016, ACM Trans. Graph..

[11]  Felix Goldberg,et al.  Optimal multiplexed sensing: bounds, conditions and a graph theory link. , 2007, Optics express.

[12]  Christopher M. Brown Multiplex imaging and random arrays , 1972 .

[13]  Jie Zhang,et al.  Compact all-CMOS spatiotemporal compressive sensing video camera with pixel-wise coded exposure. , 2016, Optics express.

[14]  Moshe Ben-Ezra,et al.  Photometric Stereo for Dynamic Surface Orientations , 2010, ECCV.

[15]  Aswin C. Sankaranarayanan,et al.  Compressive Video Sensing: Algorithms, architectures, and applications , 2017, IEEE Signal Processing Magazine.

[16]  Matthew O'Toole,et al.  3D Shape and Indirect Appearance by Structured Light Transport , 2014, CVPR.

[17]  Rin-ichiro Taniguchi,et al.  High-speed imaging using CMOS image sensor with quasi pixel-wise exposure , 2016, 2016 IEEE International Conference on Computational Photography (ICCP).

[18]  Matthew O'Toole,et al.  Homogeneous codes for energy-efficient illumination and imaging , 2015, ACM Trans. Graph..

[19]  Shahriar Mirabbasi,et al.  Exposure-Programmable CMOS Pixel With Selective Charge Storage and Code Memory for Computational Imaging , 2018, IEEE Transactions on Circuits and Systems I: Regular Papers.

[20]  R. Lange,et al.  Solid-state time-of-flight range camera , 2001 .

[21]  Ruigang Yang,et al.  BRDF Invariant Stereo Using Light Transport Constancy , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Sam Van der Jeught,et al.  Real-time structured light profilometry: a review , 2016 .

[23]  Ramesh Raskar,et al.  Demultiplexing illumination via low cost sensing and nanosecond coding , 2014, 2014 IEEE International Conference on Computational Photography (ICCP).

[24]  Shree K. Nayar,et al.  Video from a single coded exposure photograph using a learned over-complete dictionary , 2011, 2011 International Conference on Computer Vision.

[25]  Mohit Gupta,et al.  MC3D: Motion Contrast 3D Scanning , 2015, 2015 IEEE International Conference on Computational Photography (ICCP).

[26]  Shree K. Nayar,et al.  Enhancing resolution along multiple imaging dimensions using assorted pixels , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Shoji Kawahito,et al.  The Dynamic Photometric Stereo Method Using a Multi-Tap CMOS Image Sensor † , 2018, Sensors.

[28]  Shahriar Mirabbasi,et al.  Always-on CMOS image sensor pixel design for pixel-wise binary coded exposure , 2017, 2017 IEEE International Symposium on Circuits and Systems (ISCAS).

[29]  Donald B. Hondongwa,et al.  A Review of the Pinned Photodiode for CCD and CMOS Image Sensors , 2014, IEEE Journal of the Electron Devices Society.

[30]  Xavier Descombes,et al.  Photometric stereo with only two images: A theoretical study and numerical resolution , 2017, Image Vis. Comput..

[31]  Hans-Peter Seidel,et al.  Design and volume optimization of space structures , 2017, ACM Trans. Graph..

[32]  Zicheng Liu,et al.  Expressive expression mapping with ratio images , 2001, SIGGRAPH.

[33]  Marc Levoy,et al.  Applications of Multi-Bucket Sensors to Computational Photography , 2012 .

[34]  Shoji Kawahito,et al.  4.3 A programmable sub-nanosecond time-gated 4-tap lock-in pixel CMOS image sensor for real-time fluorescence lifetime imaging microscopy , 2017, 2017 IEEE International Solid-State Circuits Conference (ISSCC).

[35]  Gordon Wetzstein,et al.  Computational imaging with multi-camera time-of-flight systems , 2016, ACM Trans. Graph..

[36]  Gordon Wetzstein,et al.  On Plenoptic Multiplexing and Reconstruction , 2012, International Journal of Computer Vision.

[37]  Fumihiko Sakaue,et al.  Multiplex Image Projection Using Multi-band Projectors , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[38]  Pascal Fua,et al.  Making Background Subtraction Robust to Sudden Illumination Changes , 2008, ECCV.

[39]  Kari Pulli,et al.  FlexISP , 2014, ACM Trans. Graph..

[40]  Björn Stenger,et al.  Non-rigid Photometric Stereo with Colored Lights , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[41]  Ashok Veeraraghavan,et al.  A Framework for Analysis of Computational Imaging Systems: Role of Signal Prior, Sensor Noise and Multiplexing , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  R. Heintzmann,et al.  A dual path programmable array microscope (PAM): simultaneous acquisition of conjugate and non‐conjugate images , 2001, Journal of microscopy.

[43]  Robert J. Woodham,et al.  Photometric method for determining surface orientation from multiple images , 1980 .

[44]  Ramesh Raskar,et al.  Coded exposure photography: motion deblurring using fluttered shutter , 2006, SIGGRAPH 2006.

[45]  Tobi Delbrück,et al.  A 128$\times$ 128 120 dB 15 $\mu$s Latency Asynchronous Temporal Contrast Vision Sensor , 2008, IEEE Journal of Solid-State Circuits.

[46]  Shree K. Nayar,et al.  Micro Phase Shifting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[47]  Gordon Wetzstein,et al.  Snapshot difference imaging using correlation time-of-flight sensors , 2017, ACM Trans. Graph..

[48]  Hans-Peter Seidel,et al.  Animating deformable objects using sparse spacetime constraints , 2014, ACM Trans. Graph..

[49]  Wolfgang Heidrich,et al.  Low-budget transient imaging using photonic mixer devices , 2013, ACM Trans. Graph..

[50]  Joaquim Salvi,et al.  A state of the art in structured light patterns for surface profilometry , 2010, Pattern Recognit..

[51]  T. Delbruck,et al.  > Replace This Line with Your Paper Identification Number (double-click Here to Edit) < 1 , 2022 .

[52]  Marc Levoy,et al.  CMOS Image Sensors With Multi-Bucket Pixels for Computational Photography , 2012, IEEE Journal of Solid-State Circuits.

[53]  Shuntaro Yamazaki,et al.  Temporal Dithering of Illumination for Fast Active Vision , 2008, ECCV.

[54]  Shree K. Nayar,et al.  Generalized Assorted Pixel Camera: Postcapture Control of Resolution, Dynamic Range, and Spectrum , 2010, IEEE Transactions on Image Processing.

[55]  Yoav Y. Schechner,et al.  Illumination Multiplexing within Fundamental Limits , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[56]  Yasutomi Keita,et al.  4.3 A programmable sub-nanosecond time-gated 4-tap lock-in pixel CMOS image sensor for real-time fluorescence lifetime imaging microscopy , 2017 .

[57]  Ryo Furukawa,et al.  Dense 3D Reconstruction from High Frame-Rate Video Using a Static Grid Pattern , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58]  Joonki Paik,et al.  Sensor-Based Auto-Focusing System Using Multi-Scale Feature Extraction and Phase Correlation Matching , 2015, Sensors.