论文信息 - Mask-based light field capture and display

Mask-based light field capture and display

This dissertation describes light-efficient methods for capturing and displaying 3D images using thin, optically-attenuating masks. Light transport is modeled, under geometrical optics, as a 4D function: the light field; this function records the amount of light traveling through any point along any direction. Conventional photographs only record a 2D projection of the incident light field. Each image point is produced by integrating over the full hemisphere of incidence angles. Similarly, conventional displays only approximate a diffuse surface, where the amount of light leaving any point is constant over the full hemisphere of viewing angles. Thus, conventional cameras and displays only support 2D images, for which the perception of scene depth is lost. 3D images can be captured and displayed by including masks in conventional camera and display architectures. Parallax barriers are one example; a mask containing a uniform array of slits is placed slightly in front of a conventional display. This mask only allows certain disjoint display regions to be visible from each viewpoint. 3D image capture is achieved by placing a similar mask close to a sensor. In both cases, 3D images come at the cost of decreased resolution and brightness. This dissertation presents a first-principles analysis of dual-layer camera and display architectures, wherein the first layer is a conventional sensor or display and the second layer is a mask. Novel masks are developed that facilitate 3D image capture and display, outperforming conventional parallax barriers in terms of total light transmission and light field resolution. For 3D capture, a family of static, periodic, non-adaptive masks are derived from a frequency-domain analysis. For 3D display, a linear algebraic analysis reveals a set of time-multiplexed, aperiodic, adaptive masks. Four motivating applications are presented: digital photography, single-shot visual hull reconstruction, depth-sensing LCDs, and 3D display using dual-stacked LCDs.

G. Taubin | Douglas Lanman

[1] A. Laurentini,et al. The Visual Hull Concept for Silhouette-Based Image Understanding , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[2] R. Shack,et al. History and principles of Shack-Hartmann wavefront sensing. , 2001, Journal of refractive surgery.

[3] S. Oh,et al. Mask-Based Vision Systems by Use of the Wigner Distribution Function and Ambiguity Function , 2009 .

[4] H. Kato,et al. A Continuous-Grain Silicon-System LCD With Optical Input Function , 2007, IEEE Journal of Solid-State Circuits.

[5] Luc Van Gool,et al. Blue-c: a spatially immersive display and 3D video portal for telepresence , 2003, IPT/EGVE.

[6] Douglas Lanman,et al. Build your own 3D scanner: 3D photography for beginners , 2009, SIGGRAPH '09.

[7] Kun Zhou,et al. Precomputed shadow fields for dynamic scenes , 2005, ACM Trans. Graph..

[8] Hanumant Singh,et al. Flat Refractive Geometry , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Hans-Peter Seidel,et al. Time-resolved 3d capture of non-stationary gas flows , 2008, SIGGRAPH Asia '08.

[10] Gordon Wetzstein,et al. A theory of plenoptic multiplexing , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11] Jefferson Y. Han. Low-cost multi-touch sensing through frustrated total internal reflection , 2005, UIST.

[12] R. Bracewell. Strip Integration in Radio Astronomy , 1956 .

[13] G. Lippmann. Epreuves reversibles donnant la sensation du relief , 1908 .

[14] Frédo Durand,et al. Image and depth from a conventional camera with a coded aperture , 2007, ACM Trans. Graph..

[15] Douglas Lanman,et al. Content-adaptive parallax barriers for automultiscopic 3D display , 2010, SIGGRAPH Talks.

[16] E. Adelson,et al. The Plenoptic Function and the Elements of Early Vision , 1991 .

[17] W. Freeman,et al. Understanding Camera Trade-Offs through a Bayesian Analysis of Light Field Projections , 2008, ECCV.

[18] William Buxton,et al. ThinSight: integrated optical multi-touch sensing through thin form-factor displays , 2007, EDT '07.

[19] Kenneth C. Smith,et al. A multi-touch three dimensional touch-sensitive tablet , 1985, CHI '85.

[20] Kiriakos N. Kutulakos,et al. Confocal Stereo , 2006, International Journal of Computer Vision.

[21] Michael W. Halle. Holographic stereograms as discrete imaging systems , 1994, Electronic Imaging.

[22] Shree K. Nayar,et al. Lighting sensitive display , 2004, ACM Trans. Graph..

[23] David Salesin,et al. Surface light fields for 3D photography , 2000, SIGGRAPH.

[24] Amnon Yariv,et al. Optical Electronics in Modern Communications Fifth Edition , 2012 .

[25] Andrew E. Johnson,et al. Advances in the Dynallax Solid-State Dynamic Parallax Barrier Autostereoscopic Visualization Display System , 2008, IEEE Transactions on Visualization and Computer Graphics.

[26] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[27] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[28] Richard Szeliski,et al. The lumigraph , 1996, SIGGRAPH.

[29] D Marr,et al. Directional selectivity and its use in early visual processing , 1981, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[30] R. Plemmons,et al. Optimality, computation, and interpretation of nonnegative matrix factorizations , 2004 .

[31] Shree K. Nayar,et al. Light field transfer: global illumination between real and synthetic objects , 2008, ACM Trans. Graph..

[32] Douglas Lanman,et al. Shape from depth discontinuities under orthographic projection , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[33] Steven K. Feiner,et al. Cross-dimensional gestural interaction techniques for hybrid immersive environments , 2005, IEEE Proceedings. VR 2005. Virtual Reality, 2005..

[34] Ramesh Raskar,et al. Glare aware photography: 4D ray sampling for reducing glare effects of camera lenses , 2008, ACM Trans. Graph..

[35] Edward H. Adelson,et al. Single Lens Stereo with a Plenoptic Camera , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[36] Michael W. Berry,et al. Algorithms and applications for approximate nonnegative matrix factorization , 2007, Comput. Stat. Data Anal..

[37] Yuan Yan Tang,et al. Total variation norm-based nonnegative matrix factorization for identifying discriminant representation of image patterns , 2008, Neurocomputing.

[38] Douglas Lanman,et al. Modeling and Synthesis of Aperture Effects in Cameras , 2008, CAe.

[39] Douglas Lanman,et al. BiDi screen: depth and lighting aware interaction and display , 2009, SIGGRAPH Posters.

[40] Shree K. Nayar,et al. Programmable Imaging: Towards a Flexible Camera , 2006, International Journal of Computer Vision.

[41] Andrew Gardner,et al. Capturing and Rendering with Incident Light Fields , 2003, Rendering Techniques.

[42] Hideshi Yamada,et al. Rendering for an interactive 360° light field display , 2007, ACM Trans. Graph..

[43] Ramesh Raskar,et al. Non-refractive modulators for encoding and capturing scene appearance and depth , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[44] Hideaki Sasazawa,et al. Autostereoscopic 3‐D display using LCD‐generated parallax barrier , 1993 .

[45] Light Fields on the Cheap , 2000 .

[46] P. Hanrahan,et al. Light Field Photography with a Hand-held Plenoptic Camera , 2005 .

[47] François X. Sillion,et al. Fast calculation of soft shadow textures using convolution , 1998, SIGGRAPH.

[48] Shree K. Nayar,et al. Lensless Imaging with a Controllable Aperture , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[49] Shree K. Nayar,et al. Multiplexing for Optimal Lighting , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50] Pat Hanrahan,et al. Digital correction of lens aberrations in light field photography , 2006, International Optical Design Conference.

[51] Takeo Kanade,et al. The Theory and Practice of Coplanar Shadowgram Imaging for Acquiring Visual Hulls of Intricate Objects , 2009, International Journal of Computer Vision.

[52] L. Lipton. Foundations of the Stereoscopic Cinema , 1982 .

[53] Tom E. Bishop,et al. Light field superresolution , 2009, 2009 IEEE International Conference on Computational Photography (ICCP).

[54] Andrew D. Wilson. TouchLight: an imaging touch screen and display for gesture-based interaction , 2004, ICMI '04.

[55] D. Gabor. A New Microscopic Principle , 1948, Nature.

[56] Hailin Jin,et al. Light field video stabilization , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[57] Marc Levoy,et al. Veiling glare in high dynamic range imaging , 2007, ACM Trans. Graph..

[58] Tsuhan Chen,et al. Light field capturing with lensless cameras , 2005, IEEE International Conference on Image Processing 2005.

[59] I. Ashdown,et al. Near-Field Photometry: A New Approach , 1993 .

[60] Dan Rosenfeld,et al. Going beyond the display: a surface technology with an electronically switchable diffuser , 2008, UIST '08.

[61] Ramesh Raskar,et al. Reinterpretable Imager: Towards Variable Post‐Capture Space, Angle and Time Resolution in Photography , 2010, Comput. Graph. Forum.

[62] Amit K. Agrawal,et al. Shield fields: modeling and capturing 3D occluders , 2008, SIGGRAPH Asia '08.

[63] Abhishek Kumar Jha,et al. Affine theorem for two-dimensional Fourier transform , 1993 .

[64] Gene H. Golub,et al. Matrix Computations, Third Edition , 1996 .

[65] Clifton Forlines,et al. DTLens: multi-user tabletop spatial data exploration , 2005, UIST.

[66] Xin Liu,et al. Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[67] Herbert E. Ives,et al. A Camera for Making Parallax Panoramagrams , 1928 .

[68] Graham John Woodgate,et al. LP‐1: Late‐News Poster: High Efficiency Reconfigurable 2D/3D Autostereoscopic Display , 2003 .

[69] Douglas Lanman,et al. Shape from Depth Discontinuities , 2009, ETVC.

[70] Douglas Lanman,et al. Multi-flash 3D photography: capturing shape and appearance , 2006, SIGGRAPH '06.

[71] Chia-Kai Liang,et al. Programmable aperture photography: multiplexed light field acquisition , 2008, SIGGRAPH 2008.

[72] Douglas Lanman,et al. BiDi screen: a thin, depth-sensing LCD for 3D interaction using light fields , 2009, SIGGRAPH 2009.

[73] H. Sebastian Seung,et al. Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[74] Ajay Limaye,et al. Drishti: a volume exploration and presentation tool , 2012, Optics & Photonics - Optical Engineering + Applications.

[75] Douglas Lanman,et al. Descattering Transmission via Angular Filtering , 2010, ECCV.

[76] Jun Rekimoto,et al. SmartSkin: an infrastructure for freehand manipulation on interactive surfaces , 2002, CHI.

[77] Gordon Wetzstein,et al. Sensor saturation in Fourier multiplexed imaging , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[78] Alan Hedge,et al. Multi-Touch: A New Tactile 2-D Gesture Interface for Human-Computer Interaction , 2001 .

[79] Douglas Lanman,et al. Surround structured lighting: 3-D scanning with orthographic illumination , 2009, Comput. Vis. Image Underst..

[80] Pablo Tamayo,et al. Metagenes and molecular pattern discovery using matrix factorization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[81] Shree K. Nayar,et al. Shape from Focus , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[82] Pietro Perona,et al. 3D Reconstruction by Shadow Carving: Theory and Practical Evaluation , 2007, International Journal of Computer Vision.

[83] Hans-Peter Seidel,et al. Free-viewpoint video of human actors , 2003, ACM Trans. Graph..

[84] J. Greivenkamp. Color dependent optical prefilter for the suppression of aliasing artifacts. , 1990, Applied optics.

[85] L. McMillan. Image-Based Rendering using Image Warping , 1999 .

[86] M. Glas,et al. Principles of Computerized Tomographic Imaging , 2000 .

[87] Robert C. Bolles,et al. Epipolar-plane image analysis: An approach to determining structure from motion , 1987, International Journal of Computer Vision.

[88] Model-based Face Capture from Orthogonal Images , 2001 .

[89] Andrew W. Fitzgibbon,et al. Direct least squares fitting of ellipses , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[90] Shree K. Nayar,et al. A theory of multiplexed illumination , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[91] M. Levoy,et al. Recording and controlling the 4D light field in a microscope using microlens arrays , 2009, Journal of microscopy.

[92] Leonard McMillan,et al. A Real-Time Distributed Light Field Camera , 2002, Rendering Techniques.

[93] David Salesin,et al. Spatio-angular resolution tradeoffs in integral photography , 2006, EGSR '06.

[94] Douglas Lanman,et al. Spherical Catadioptric Arrays: Construction, Multi-View Geometry, and Calibration , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[95] Ken Perlin,et al. An autostereoscopic display , 2000, SIGGRAPH.

[96] M. Levoy,et al. Wigner distributions and how they relate to the light field , 2009, 2009 IEEE International Conference on Computational Photography (ICCP).

[97] T. Gustavson. Camera: A History of Photography from Daguerreotype to Digital , 2009 .

[98] Harry Shum,et al. Plenoptic sampling , 2000, SIGGRAPH.

[99] Marc Levoy,et al. Light field rendering , 1996, SIGGRAPH.

[100] Marc Levoy,et al. Reconstructing Occluded Surfaces Using Synthetic Apertures: Stereo, Focus and Robust Measures , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[101] Gregory Wornell,et al. Quasi light fields: extending the light field to coherent radiation. , 2009, Journal of the Optical Society of America. A, Optics, image science, and vision.

[102] William T. Freeman,et al. What makes a good model of natural images? , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[103] P. Paatero,et al. Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values† , 1994 .

[104] M. Hutley,et al. The moire magnifier , 1994 .

[105] Douglas Lanman. Distributed sensor networks with collective computation , 2001 .

[106] Shree K. Nayar,et al. Rational Filters for Passive Depth from Defocus , 1998, International Journal of Computer Vision.

[107] Frédo Durand,et al. 4D frequency analysis of computational cameras for depth of field extension , 2009, SIGGRAPH '09.

[108] Ravi Ramamoorthi,et al. A first-order analysis of lighting, shading, and shadows , 2007, TOGS.

[109] Pieter Peers,et al. Dynamic shape capture using multi-view photometric stereo , 2009, ACM Trans. Graph..

[110] Brian Cabral,et al. Imaging vector fields using line integral convolution , 1993, SIGGRAPH.

[111] Ren Ng. Fourier slice photography , 2005, ACM Trans. Graph..

[112] Douglas Lanman,et al. Reconstructing a 3D Line from a Single Catadioptric Image , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[113] Douglas Lanman,et al. Surround Structured Lighting for Full Object Scanning , 2007, Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007).

[114] Shahzad Malik,et al. Visual touchpad: a two-handed gestural input device , 2004, ICMI '04.

[115] Adrian Hilton,et al. Model-based human shape reconstruction from multiple views , 2008, Comput. Vis. Image Underst..

[116] Chris Slinger,et al. Computer-generated holography as a generic display technology , 2005, Computer.

[117] Douglas Lanman,et al. Beyond Silhouettes: Surface Reconstruction Using Multi-Flash Photography , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[118] Jun Rekimoto,et al. HoloWall: designing a finger, hand, body, and object sensitive wall , 1997, UIST '97.

[119] Mark A. Horowitz,et al. Light field video camera , 2000, IS&T/SPIE Electronic Imaging.

[120] Haldun M. Ozaktas,et al. Linear algebraic theory of partial coherence: discrete fields and measures of partial coherence , 2002, International Commission for Optics.

[121] Bruce G. Baumgart,et al. Geometric modeling for computer vision. , 1974 .

[122] F. Okano,et al. Analysis of resolution limitation of integral photography , 1998 .

[123] Ramesh Raskar,et al. Image destabilization: Programmable defocus using lens and sensor motion , 2009, 2009 IEEE International Conference on Computational Photography (ICCP).

[124] Ramesh Raskar,et al. Dappled photography: mask enhanced cameras for heterodyned light fields and coded aperture refocusing , 2007, ACM Trans. Graph..

[125] M. Halle,et al. 3-D Displays and Signal Processing , 2007, IEEE Signal Processing Magazine.

[126] Andrew Lumsdaine,et al. The focused plenoptic camera , 2009, 2009 IEEE International Conference on Computational Photography (ICCP).

[127] Paul Van Dooren,et al. Weighted Nonnegative Matrix Factorization and Face Feature Extraction , 2007 .

[128] Martin Kemp,et al. Leonardo on Painting: An Anthology of Writings by Leonardo da Vinci; With a Selection of Documents Relating to his Career as an Artist , 1989 .

[129] Marc Levoy,et al. Symmetric photography: exploiting data-sparseness in reflectance fields , 2006, EGSR '06.

[130] Chih-Jen Lin,et al. Projected Gradient Methods for Nonnegative Matrix Factorization , 2007, Neural Computation.

[131] Johannes Taelman,et al. Shadow multiplexing for real-time silhouette extraction , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[132] Joohwan Kim,et al. Point light source integral imaging with improved resolution and viewing angle by the use of electrically movable pinhole array. , 2007, Optics express.

[133] Sehoon Yea,et al. Resampling, Antialiasing, and Compression in Multiview 3-D Displays , 2007, IEEE Signal Processing Magazine.

[134] E E Fenimore,et al. New family of binary arrays for coded aperture imaging. , 1989, Applied optics.

[135] V. Kshirsagar,et al. Face recognition using Eigenfaces , 2011, 2011 3rd International Conference on Computer Research and Development.

[136] Lance Williams,et al. View Interpolation for Image Synthesis , 1993, SIGGRAPH.

[137] Douglas Lanman,et al. Build your own 3D display , 2010, SIGGRAPH '10.

[138] D. Spencer,et al. The photic field , 1981 .

[139] Hrvoje Benko,et al. Using Depth-Sensing Camera to Enable Freehand Interactions On and Above the Interactive Surface , 2008 .