论文信息 - Learning to Capture Light Fields Through a Coded Aperture Camera

Learning to Capture Light Fields Through a Coded Aperture Camera

We propose a learning-based framework for acquiring a light field through a coded aperture camera. Acquiring a light field is a challenging task due to the amount of data. To make the acquisition process efficient, coded aperture cameras were successfully adopted; using these cameras, a light field is computationally reconstructed from several images that are acquirToshiakied with different aperture patterns. However, it is still difficult to reconstruct a high-quality light field from only a few acquired images. To tackle this limitation, we formulated the entire pipeline of light field acquisition from the perspective of an auto-encoder. This auto-encoder was implemented as a stack of fully convolutional layers and was trained end-to-end by using a collection of training samples. We experimentally show that our method can successfully learn good image-acquisition and reconstruction strategies. With our method, light fields consisting of 5 \(\times \) 5 or 8 \(\times \) 8 images can be successfully reconstructed only from a few acquired images. Moreover, our method achieved superior performance over several state-of-the-art methods. We also applied our method to a real prototype camera to show that it is capable of capturing a real 3-D scene.

[1] Ravi Ramamoorthi,et al. Learning to Synthesize a 4D RGBD Light Field from a Single Image , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[2] F. Okano,et al. Gradient-index lens-array method based on real-time integral photography for three-dimensional images. , 1998, Applied optics.

[3] Hans-Peter Seidel,et al. Design and volume optimization of space structures , 2017, ACM Trans. Graph..

[4] P. Hanrahan,et al. Digital light field photography , 2006 .

[5] Aggelos K. Katsaggelos,et al. Compressive Light Field Sensing , 2012, IEEE Transactions on Image Processing.

[6] Gordon Wetzstein,et al. Tensor displays , 2012, ACM Trans. Graph..

[7] Ting-Chun Wang,et al. Learning-based view synthesis for light field cameras , 2016, ACM Trans. Graph..

[8] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[9] Gordon Wetzstein,et al. Compressive light field photography using overcomplete dictionaries and optimized projections , 2013, ACM Trans. Graph..

[10] P. Hanrahan,et al. Light Field Photography with a Hand-held Plenoptic Camera , 2005 .

[11] Aggelos K. Katsaggelos,et al. Deep fully-connected networks for video compressive sensing , 2016, Digit. Signal Process..

[12] Chia-Kai Liang,et al. Programmable aperture photography: multiplexed light field acquisition , 2008, SIGGRAPH 2008.

[13] Atsushi Shimada,et al. Light Field Distortion Feature for Transparent Object Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[14] In Kyu Park,et al. Robust Light Field Depth Estimation Using Occlusion-Noise Aware Data Costs , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] Sehoon Ha,et al. Iterative Training of Dynamic Skills Inspired by Human Coaching Techniques , 2014, ACM Trans. Graph..

[16] Toshiaki Fujii,et al. Displaying Real-World Light Fields With Stacked Multiplicative Layers: Requirement and Data Conversion for Input Multiview Images , 2016, Journal of Display Technology.

[17] Tom E. Bishop,et al. Light field superresolution , 2009, 2009 IEEE International Conference on Computational Photography (ICCP).

[18] Ashok Veeraraghavan,et al. Towards Motion Aware Light Field Video for Dynamic Scenes , 2013, 2013 IEEE International Conference on Computer Vision.

[19] Edward H. Adelson,et al. Single Lens Stereo with a Plenoptic Camera , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[20] David L Donoho,et al. Compressed sensing , 2006, IEEE Transactions on Information Theory.

[21] Richard Szeliski,et al. The lumigraph , 1996, SIGGRAPH.

[22] Toshiaki Fujii,et al. Free-Viewpoint TV , 2011, IEEE Signal Processing Magazine.

[23] Frédo Durand,et al. Light Field Reconstruction Using Sparsity in the Continuous Fourier Domain , 2014, ACM Trans. Graph..

[24] Sven Wanner,et al. Variational Light Field Analysis for Disparity Estimation and Super-Resolution , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Toshiaki Fujii,et al. PCA-coded aperture for light field photography , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[26] Rin-ichiro Taniguchi,et al. Motion-Invariant Coding Using a Programmable Aperture Camera , 2012, ACCV.

[27] E.J. Candes,et al. An Introduction To Compressive Sampling , 2008, IEEE Signal Processing Magazine.

[28] Alexei A. Efros,et al. A 4D Light-Field Dataset and CNN Architectures for Material Recognition , 2016, ECCV.

[29] Steve Marschner,et al. Matching Real Fabrics with Micro-Appearance Models , 2015, ACM Trans. Graph..

[30] Yoshua Bengio,et al. Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[31] Kenta Oono,et al. Chainer : a Next-Generation Open Source Framework for Deep Learning , 2015 .

[32] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[33] Bobby Bodenheimer,et al. Synthesis and evaluation of linear motion transitions , 2008, TOGS.

[34] Ieee Transactions,et al. TransCAIP: A Live 3D TV System Using a Camera Array and an Integral Photography Display with Interactive Control of Viewing Parameters , 2009 .

[35] Alexei A. Efros,et al. Light field video capture using a learning-based hybrid imaging system , 2017, ACM Trans. Graph..

[36] Derek Nowrouzezahrai,et al. Learning hatching for pen-and-ink illustration of surfaces , 2012, TOGS.

[37] Leonard McMillan,et al. Dynamically reparameterized light fields , 2000, SIGGRAPH.

[38] Sven Wanner,et al. Datasets and Benchmarks for Densely Sampled 4D Light Fields , 2013, VMV.

[39] Marc Levoy,et al. High performance imaging using large camera arrays , 2005, SIGGRAPH 2005.

[40] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[41] Ming C. Lin,et al. Example-guided physically based modal sound synthesis , 2013, ACM Trans. Graph..

[42] Toshiaki Fujii,et al. Multipoint Measuring System for Video and Sound - 100-camera and microphone system , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[43] M. Landy,et al. The Plenoptic Function and the Elements of Early Vision , 1991 .

[44] In-So Kweon,et al. Learning a Deep Convolutional Network for Light-Field Image Super-Resolution , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[45] Williem,et al. Robust Light Field Depth Estimation for Noisy Scene with Occlusion , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46] Yonina C. Eldar,et al. Compressed Sensing with Coherent and Redundant Dictionaries , 2010, ArXiv.

[47] Byoungho Lee,et al. Additive light field displays , 2016, ACM Trans. Graph..

[48] Alexei A. Efros,et al. Depth Estimation with Occlusion Modeling Using Light-Field Cameras , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49] Marc Levoy,et al. Light field rendering , 1996, SIGGRAPH.

[50] Ayan Chakrabarti,et al. Learning Sensor Multiplexing Design through Back-propagation , 2016, NIPS.

[51] Shree K. Nayar,et al. Programmable Aperture Camera Using LCoS , 2012, IPSJ Trans. Comput. Vis. Appl..

[52] Kyoung Mu Lee,et al. Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[53] Prasan A. Shedligeri,et al. Data driven coded aperture design for depth recovery , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[54] Gordon Wetzstein,et al. The light field stereoscope , 2015, ACM Trans. Graph..

[55] Ramesh Raskar,et al. Dappled photography: mask enhanced cameras for heterodyned light fields and coded aperture refocusing , 2007, SIGGRAPH 2007.

[56] Pavan K. Turaga,et al. Compressive Light Field Reconstructions Using Deep Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).