Plenoptic Layer-Based Modeling for Image Based Rendering

Image based rendering is an attractive alternative to model based rendering for generating novel views because of its lower complexity and potential for photo-realistic results. To reduce the number of images necessary for alias-free rendering, some geometric information for the 3D scene is normally necessary. In this paper, we present a fast automatic layer-based method for synthesizing an arbitrary new view of a scene from a set of existing views. Our algorithm takes advantage of the knowledge of the typical structure of multiview data to perform occlusion-aware layer extraction. In addition, the number of depth layers used to approximate the geometry of the scene is chosen based on plenoptic sampling theory with the layers placed non-uniformly to account for the scene distribution. The rendering is achieved using a probabilistic interpolation approach and by extracting the depth layer information on a small number of key images. Numerical results demonstrate that the algorithm is fast and yet is only 0.25 dB away from the ideal performance achieved with the ground-truth knowledge of the 3D geometry of the scene of interest. This indicates that there are measurable benefits from following the predictions of plenoptic theory and that they remain true when translated into a practical system for real world data.

[1]  Pier Luigi Dragotti,et al.  Exact Feature Extraction Using Finite Rate of Innovation Principles With an Application to Image Super-Resolution , 2009, IEEE Transactions on Image Processing.

[2]  Toshiaki Fujii,et al.  View Generation with 3D Warping Using Depth Information for FTV , 2008, 2008 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[3]  Harry Shum,et al.  Rendering driven depth reconstruction , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[4]  智一 吉田,et al.  Efficient Graph-Based Image Segmentationを用いた圃場図自動作成手法の検討 , 2014 .

[5]  Hideo Saito,et al.  Free viewpoint video synthesis and presentation of sporting events for mixed reality entertainment , 2004, ACE '04.

[6]  Marc Levoy,et al.  Light field rendering , 1996, SIGGRAPH.

[7]  Zhengang Zhai,et al.  Stereo Matching for Larger Disparity Range Using Gradient Information and Adjacent Segments Cooperative Optimization , 2008, 2008 International Conference on Computer and Electrical Engineering.

[8]  Hrvoje Benko,et al.  Combining multiple depth cameras and projectors for interactions on, above and between surfaces , 2010, UIST.

[9]  Yo-Sung Ho,et al.  Generation of ROI Enhanced Depth Maps Using Stereoscopic Cameras and a Depth Camera , 2008, IEEE Transactions on Broadcasting.

[10]  Peter Meer,et al.  Synergism in low level vision , 2002, Object recognition supported by user interaction for service robots.

[11]  Christophe Tillier,et al.  Distance Dependent Depth Filtering in 3D Warping for 3DTV , 2007, 2007 IEEE 9th Workshop on Multimedia Signal Processing.

[12]  Jianbin Jiao,et al.  A new segment-based algorithm for stereo matching , 2009, 2009 International Conference on Mechatronics and Automation.

[13]  Leonard McMillan,et al.  Plenoptic Modeling: An Image-Based Rendering System , 2023 .

[14]  Heiko Hirschmüller,et al.  Improvements in real-time correlation-based stereo vision , 2001, CVPR 2001.

[15]  Markus H. Gross,et al.  Articulated Billboards for Video‐based Rendering , 2010, Comput. Graph. Forum.

[16]  Tsuhan Chen,et al.  A survey on image-based rendering - representation, sampling and compression , 2004, Signal Process. Image Commun..

[17]  Mike Brookes,et al.  Accurate non-iterative depth layer extraction algorithm for image based rendering , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18]  BrookesMike,et al.  Plenoptic Layer-Based Modeling for Image Based Rendering , 2013 .

[19]  Richard Szeliski,et al.  Extracting layers and analyzing their specular properties using epipolar-plane-image analysis , 2005, Comput. Vis. Image Underst..

[20]  Harry Shum,et al.  Layered lumigraph with LOD control , 2002, Comput. Animat. Virtual Worlds.

[21]  Yo-Sung Ho,et al.  High-quality multi-view depth generation using multiple color and depth cameras , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[22]  Xing Mei,et al.  On building an accurate stereo matching system on graphics hardware , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[23]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Thomas Wiegand,et al.  3-D Video Representation Using Depth Maps , 2011, Proceedings of the IEEE.

[25]  Hideo Saito,et al.  Computer Vision for 3DTV and Augmented Reality , 2011, 2011 International Symposium on Ubiquitous Virtual Reality.

[26]  Henry Fuchs,et al.  Encumbrance-free telepresence system with real-time 3D capture and display using commodity depth cameras , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[27]  Andrew Zisserman,et al.  Multiple View Geometry in Computer Vision (2nd ed) , 2003 .

[28]  Liang Zhang,et al.  Stereoscopic image generation based on depth images for 3D TV , 2005, IEEE Transactions on Broadcasting.

[29]  Minh N. Do,et al.  Ieee Transactions on Image Processing on the Bandwidth of the Plenoptic Function , 2022 .

[30]  J. Berent,et al.  Plenoptic Manifolds , 2007, IEEE Signal Processing Magazine.

[31]  Minh N. Do,et al.  Symmetric multi-view stereo reconstruction from planar camera arrays , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Richard Szeliski,et al.  The lumigraph , 1996, SIGGRAPH.

[33]  Vladan Velisavljevic,et al.  Multiview Image Coding Using Depth Layers and an Optimized Bit Allocation , 2012, IEEE Transactions on Image Processing.

[34]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[35]  Vivek K Goyal,et al.  Exploiting sparsity in time-of-flight range acquisition using a single time-resolved sensor. , 2011, Optics express.

[36]  Jonathan M. Garibaldi,et al.  Real-Time Correlation-Based Stereo Vision with Reduced Border Errors , 2002, International Journal of Computer Vision.

[37]  Ha T. Nguyen,et al.  Immersive Visual Communication , 2011, IEEE Signal Processing Magazine.

[38]  P. Debevec,et al.  Image-based modeling, rendering, and lighting , 2002, IEEE Computer Graphics and Applications.

[39]  Richard Szeliski,et al.  High-accuracy stereo depth maps using structured light , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[40]  Peter H. N. de With,et al.  Free-viewpoint depth image based rendering , 2010, J. Vis. Commun. Image Represent..

[41]  Hujun Bao,et al.  Consistent Depth Maps Recovery from a Video Sequence , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Yasuhiko Sasaki,et al.  Hole filling: a novel delay reduction technique using selector logic , 1998, Proceedings of the IEEE 1998 Custom Integrated Circuits Conference (Cat. No.98CH36143).

[43]  Masayuki Tanimoto,et al.  FTV (free viewpoint television) creating ray-based image engineering , 2005, IEEE International Conference on Image Processing 2005.

[44]  Mike Brookes,et al.  Image based rendering with depth cameras: How many are needed? , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[45]  Dongxiao Li,et al.  A Depth Extraction Method Based on Motion and Geometry for 2D to 3D Conversion , 2009, 2009 Third International Symposium on Intelligent Information Technology Application.

[46]  Om Prakash Gangwal,et al.  Depth map post-processing for 3D-TV , 2009, 2009 Digest of Technical Papers International Conference on Consumer Electronics.

[47]  A. Lumsdaine Full Resolution Lightfield Rendering , 2008 .

[48]  Andrew Zisserman,et al.  Learning Layered Motion Segmentations of Video , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[49]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[50]  Liang-Gee Chen,et al.  Efficient Depth Image Based Rendering with Edge Dependent Depth Filter and Interpolation , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[51]  Heung-Yeung Shum,et al.  Image-Based Rendering and Synthesis , 2007, IEEE Signal Processing Magazine.

[52]  Yo-Sung Ho,et al.  Joint coding of multi-view video and corresponding depth map , 2008, 2008 15th IEEE International Conference on Image Processing.

[53]  Pier Luigi Dragotti,et al.  A fast layer-based multiview image coding algorithm , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[54]  H. Opower Multiple view geometry in computer vision , 2002 .

[55]  Mike Brookes,et al.  Image-Based Rendering and the Sampling of the Plenoptic Function , 2013, Emerging Technologies for 3D Video.

[56]  Moon Gi Kang,et al.  Super-resolution image reconstruction: a technical overview , 2003, IEEE Signal Process. Mag..

[57]  E. Adelson,et al.  The Plenoptic Function and the Elements of Early Vision , 1991 .

[58]  Yo-Sung Ho,et al.  Hole filling method using depth based in-painting for view synthesis in free viewpoint television and 3-D video , 2009, 2009 Picture Coding Symposium.

[59]  Minh N. Do,et al.  Depth Video Enhancement Based on Weighted Mode Filtering , 2012, IEEE Transactions on Image Processing.

[60]  Minh N. Do,et al.  Wavelet-Based Joint Estimation and Encoding of Depth-Image-Based Representations for Free-Viewpoint Rendering , 2008, IEEE Transactions on Image Processing.

[61]  Harry Shum,et al.  Pop-up light field: An interactive image-based modeling and rendering system , 2004, TOGS.

[62]  Yasushi Yagi,et al.  A System for Capturing Textured 3D Shapes Based on One-Shot Grid Pattern with Multi-band Camera and Infrared Projector , 2011, 2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission.

[63]  Richard Szeliski,et al.  Layered depth images , 1998, SIGGRAPH.

[64]  A. Laurentini,et al.  The Visual Hull Concept for Silhouette-Based Image Understanding , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[65]  Markus H. Gross,et al.  Novel‐View Synthesis of Outdoor Sport Events Using an Adaptive View‐Dependent Geometry , 2012, Comput. Graph. Forum.

[66]  Mike Brookes,et al.  Adaptive layer extraction for image based rendering , 2009, 2009 IEEE International Workshop on Multimedia Signal Processing.

[67]  Tom Drummond,et al.  Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[68]  Michael Bosse,et al.  Unstructured lumigraph rendering , 2001, SIGGRAPH.

[69]  Heiko Hirschmüller,et al.  Evaluation of Stereo Matching Costs on Images with Radiometric Differences , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[70]  Richard Szeliski,et al.  A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[71]  H. Saito,et al.  Arbitrary viewpoint video synthesis from multiple uncalibrated cameras , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[72]  Robert C. Bolles,et al.  Epipolar-plane image analysis: An approach to determining structure from motion , 1987, International Journal of Computer Vision.

[73]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[74]  P. Nasiopoulos,et al.  A new prediction structure for multiview video coding , 2009, 2009 16th International Conference on Digital Signal Processing.

[75]  Hideo Saito,et al.  Depth camera based system for auto-stereoscopic displays , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.

[76]  Ha T. Nguyen,et al.  Error Analysis for Image-Based Rendering with Depth Information , 2009, 2006 International Conference on Image Processing.

[77]  Jan-Michael Frahm,et al.  Real-Time Visibility-Based Fusion of Depth Maps , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[78]  Vladimir Kolmogorov,et al.  Generalized Multi-camera Scene Reconstruction Using Graph Cuts , 2003, EMMCVPR.

[79]  H. Hirschmüller Accurate and Efficient Stereo Processing by Semi-Global Matching and Mutual Information , 2005, CVPR.

[80]  Andrea Fusiello Image-based Rendering * , 2003 .

[81]  Yasushi Yagi,et al.  Dynamic scene shape reconstruction using a single structured light pattern , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[82]  Masayuki Tanimoto,et al.  FTV: Free-viewpoint Television , 2006, Signal Process. Image Commun..

[83]  Hai Tao,et al.  Global matching criterion and color segmentation based stereo , 2000, Proceedings Fifth IEEE Workshop on Applications of Computer Vision.

[84]  Yo-Sung Ho,et al.  Depth Coding Using a Boundary Reconstruction Filter for 3-D Video Systems , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[85]  Adrian Hilton,et al.  A Comparative Study of Free-Viewpoint Video Techniques For sports events , 2006 .

[86]  Richard I. Hartley,et al.  Theory and Practice of Projective Rectification , 1999, International Journal of Computer Vision.

[87]  Qixiang Ye,et al.  A progressive region-merging algorithm for stereo matching , 2009, 2009 IEEE Youth Conference on Information, Computing and Telecommunication.

[88]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[89]  Minh N. Do,et al.  On the Information Rates of the Plenoptic Function , 2007, IEEE Transactions on Information Theory.

[90]  Harry Shum,et al.  An Object-Based Approach to Image/Video-Based Synthesis and Processing for 3-D and Multiview Televisions , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[91]  Hugh Durrant-Whyte,et al.  Simultaneous localization and mapping (SLAM): part II , 2006 .

[92]  Sehoon Yea,et al.  Depth Estimation for View Synthesis in Multiview Video Coding , 2007, 2007 3DTV Conference.

[93]  Hujun Bao,et al.  Recovering consistent video depth maps via bundle optimization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[94]  Harry Shum,et al.  Plenoptic sampling , 2000, SIGGRAPH.

[95]  Sunan Wang,et al.  Scale invariant control points based stereo matching for dynamic programming , 2009, 2009 9th International Conference on Electronic Measurement & Instruments.

[96]  Vivek K. Goyal,et al.  CoDAC: A compressive depth acquisition camera framework , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[97]  Hugh F. Durrant-Whyte,et al.  Simultaneous localization and mapping: part I , 2006, IEEE Robotics & Automation Magazine.

[98]  Frederic Dufaux,et al.  Emerging Technologies for 3D Video: Creation, Coding, Transmission and Rendering , 2013, Emerging Technologies for 3D Video.

[99]  Keita Takahashi,et al.  Theoretical Analysis of View Interpolation With Inaccurate Depth Information , 2012, IEEE Transactions on Image Processing.

[100]  Vladan Velisavljevic,et al.  Interactive multiview image coding , 2011, 2011 18th IEEE International Conference on Image Processing.

[101]  Michael Elad,et al.  Fast and robust multiframe super resolution , 2004, IEEE Transactions on Image Processing.

[102]  Martin Vetterli,et al.  Sampling and interpolation of the plenoptic function , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[103]  Mike Brookes,et al.  A closed-form expression for the bandwidth of the plenoptic function under finite field of view constraints , 2010, 2010 IEEE International Conference on Image Processing.

[104]  Andrew K. C. Wong,et al.  Dense depth map acquisition by hierarchic structured light , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[105]  Sing Bing Kang,et al.  Stereo for Image-Based Rendering using Image Over-Segmentation , 2007, International Journal of Computer Vision.