Image-Based Rendering and Synthesis

One of the most important applications in multiview imaging (MVI) is the development of advanced immersive viewing or visualization systems using, for instance, 3DTV. With the introduction of multiview TVs, it is expected that a new age of 3DTV systems will arrive in the near future. Image-based rendering (IBR) refers to a collection of techniques and representations that allow 3-D scenes and objects to be visualized in a realistic way without full 3-D model reconstruction. IBR uses images as the primary substrate. The potential for photorealistic visualization has tremendous appeal, and it has been receiving increasing attention over the years. Applications such as video games, virtual travel, and E-commerce stand to benefit from this technology. This article serves as a tutorial introduction and brief review of this important technology. First the classification, principles, and key research issues of IBR are discussed. Then, an object-based IBR system to illustrate the techniques involved and its potential application in view synthesis and processing are explained. Stereo matching, which is an important technique for depth estimation and view synthesis, is briefly explained and some of the top-ranked methods are highlighted. Finally, the challenging problem of interactive IBR is explained. Possible solutions and some state-of-the-art systems are also reviewed.

[1]  Richard Szeliski,et al.  The lumigraph , 1996, SIGGRAPH.

[2]  Marc Levoy,et al.  Light field rendering , 1996, SIGGRAPH.

[3]  Yizhou Yu,et al.  Efficient View-Dependent Image-Based Rendering with Projective Texture-Mapping , 1998, Rendering Techniques.

[4]  In-So Kweon,et al.  Adaptive Support-Weight Approach for Correspondence Search , 2006, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Leonard McMillan,et al.  A Real-Time Distributed Light Field Camera , 2002, Rendering Techniques.

[6]  J. Sethian,et al.  Fronts propagating with curvature-dependent speed: algorithms based on Hamilton-Jacobi formulations , 1988 .

[7]  Andreas Klaus,et al.  Segment-Based Stereo Matching Using Belief Propagation and a Self-Adapting Dissimilarity Measure , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[8]  S. Osher,et al.  Algorithms Based on Hamilton-Jacobi Formulations , 1988 .

[9]  Jian Sun,et al.  Symmetric stereo matching for occlusion handling , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[10]  Anselmo Lastra,et al.  LDI tree: a hierarchical representation for image-based rendering , 1999, SIGGRAPH.

[11]  Michael S. Landy,et al.  Computational models of visual processing , 1991 .

[12]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[13]  Harry Shum,et al.  On the data compression and transmission aspects of panoramic video , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[14]  Leonard McMillan,et al.  Plenoptic Modeling: An Image-Based Rendering System , 2023 .

[15]  Tsuhan Chen,et al.  A survey on image-based rendering - representation, sampling and compression , 2004, Signal Process. Image Commun..

[16]  Tony F. Chan,et al.  Active contours without edges , 2001, IEEE Trans. Image Process..

[17]  Harry Shum,et al.  The plenoptic video , 2005, IEEE Trans. Circuits Syst. Video Technol..

[18]  Harry Shum,et al.  Image-based rendering , 2006, Found. Trends Comput. Graph. Vis..

[19]  Cheng Lei,et al.  Region-Tree Based Stereo Using Dynamic Programming Optimization , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[20]  Toshiaki Fujii,et al.  Ray space coding for 3D visual communication , 1996 .

[21]  Andrew Blake,et al.  Surface shape from the deformation of apparent contours , 1992, International Journal of Computer Vision.

[22]  Kun Zhou,et al.  Precomputed shadow fields for dynamic scenes , 2005, SIGGRAPH 2005.

[23]  TheobaltChristian,et al.  Free-viewpoint video of human actors , 2003 .

[24]  Pere Brunet,et al.  3D reconstruction with projective octrees and epipolar geometry , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[25]  Tsuhan Chen,et al.  Active Rearranged Capturing of Image-Based Rendering Scenes—Theory and Practice , 2007, IEEE Transactions on Multimedia.

[26]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[27]  Jan Kautz,et al.  Precomputed radiance transfer for real-time rendering in dynamic, low-frequency lighting environments , 2002 .

[28]  Ashraf A. Kassim,et al.  Registration and partitioning-based compression of 3-D dynamic data , 2003, IEEE Trans. Circuits Syst. Video Technol..

[29]  P. Debevec,et al.  Image-based modeling, rendering, and lighting , 2002, IEEE Computer Graphics and Applications.

[30]  Richard Szeliski,et al.  Creating full view panoramic image mosaics and environment maps , 1997, SIGGRAPH.

[31]  Jake K. Aggarwal,et al.  TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE , 2008 .

[32]  P. Hanrahan,et al.  Triple product wavelet integrals for all-frequency relighting , 2004, SIGGRAPH 2004.

[33]  Harry Shum,et al.  Am object-based approach to plenoptic videos , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[34]  Ramesh Raskar,et al.  Image-based visual hulls , 2000, SIGGRAPH.

[35]  Roberto Cipolla,et al.  Reconstruction of sculpture from its profiles with unknown camera positions , 2004, IEEE Transactions on Image Processing.

[36]  Marcus A. Magnor,et al.  Hardware-Accelerated Dynamic Light Field Rendering , 2002, VMV.

[37]  Takeo Kanade,et al.  Visual hull alignment and refinement across time: a 3D reconstruction algorithm combining shape-from-silhouette with stereo , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[38]  M. Landy,et al.  The Plenoptic Function and the Elements of Early Vision , 1991 .

[39]  Takeshi Naemura,et al.  Real-Time Video-Based Modeling and Rendering of 3D Scenes , 2002, IEEE Computer Graphics and Applications.

[40]  Harry Shum,et al.  On object-based compression for a class of dynamic image-based representations , 2005, IEEE International Conference on Image Processing 2005.

[41]  Harry Shum,et al.  Rendering with concentric mosaics , 1999, SIGGRAPH.

[42]  Harry Shum,et al.  Pop-up light field: An interactive image-based modeling and rendering system , 2004, TOGS.

[43]  D. Nistér,et al.  Stereo Matching with Color-Weighted Correlation, Hierarchical Belief Propagation, and Occlusion Handling , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Patrick Pérez,et al.  Object removal by exemplar-based inpainting , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[45]  Takeo Kanade,et al.  Virtualized Reality: Constructing Virtual Worlds from Real Scenes , 1997, IEEE Multim..

[46]  Guillermo Sapiro,et al.  Image inpainting , 2000, SIGGRAPH.

[47]  Ieee Signal Processing Magazine the Implicit Channel a Descriptive Framework for Emotional States , 2022 .

[48]  Laurent Moll,et al.  Efficient image-based methods for rendering soft shadows , 2000, SIGGRAPH.

[49]  Vladimir Kolmogorov,et al.  Computing visual correspondence with occlusions using graph cuts , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[50]  Chen Liang,et al.  Robust Recovery of Shapes with Unknown Topology from the Dual Space , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51]  S. Osher,et al.  Geometric Level Set Methods in Imaging, Vision, and Graphics , 2011, Springer New York.

[52]  Jian Sun,et al.  Lazy snapping , 2004, SIGGRAPH 2004.

[53]  Baining Guo,et al.  Real-time rendering of plant leaves , 2005, SIGGRAPH 2005.

[54]  Richard Szeliski,et al.  Layered depth images , 1998, SIGGRAPH.

[55]  Shing-Chow Chan,et al.  Data compression and transmission aspects of panoramic videos , 2005 .

[56]  Hans-Peter Seidel,et al.  Free-viewpoint video of human actors , 2003, ACM Trans. Graph..

[57]  Takeo Kanade,et al.  Markerless human motion transfer , 2004, Proceedings. 2nd International Symposium on 3D Data Processing, Visualization and Transmission, 2004. 3DPVT 2004..

[58]  Harry Shum,et al.  Data compression and transmission aspects of panoramic videos , 2005, IEEE Trans. Circuits Syst. Video Technol..

[59]  Shing-Chow Chan,et al.  An object-based compression system for a class of dynamic image-based representations , 2005, Visual Communications and Image Processing.

[60]  Andrew Gardner,et al.  Performance relighting and reflectance transformation with time-multiplexed illumination , 2005, ACM Trans. Graph..

[61]  Ruigang Yang,et al.  Stereo Matching with Color-Weighted Correlation, Hierarchical Belief Propagation and Occlusion Handling , 2006, CVPR.

[62]  Harry Shum,et al.  Plenoptic sampling , 2000, SIGGRAPH.

[63]  A. Laurentini,et al.  The Visual Hull Concept for Silhouette-Based Image Understanding , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[64]  Jed Lengyel,et al.  Compression of time-dependent geometry , 1999, SI3D.

[65]  Harry Shum,et al.  An Object-based Approach to Plenoptic Video Processing , 2007, 2007 IEEE International Symposium on Circuits and Systems.

[66]  Olivier D. Faugeras,et al.  Using Extremal Boundaries for 3-D Object Modeling , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[67]  David Salesin,et al.  A Bayesian approach to digital matting , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[68]  Takeo Kanade,et al.  Image-based spatio-temporal modeling and view interpolation of dynamic events , 2005, TOGS.

[69]  Heiko Hirschmüller,et al.  Stereo Vision in Structured Environments by Consistent Semi-Global Matching , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[70]  S. B. Kang,et al.  Survey of image-based representations and compression techniques , 2003, IEEE Trans. Circuits Syst. Video Technol..

[71]  Mark A. Horowitz,et al.  Light field video camera , 2000, IS&T/SPIE Electronic Imaging.