Image-based rendering for scenes with reflections

We present a system for image-based modeling and rendering of real-world scenes containing reflective and glossy surfaces. Previous approaches to image-based rendering assume that the scene can be approximated by 3D proxies that enable view interpolation using traditional back-to-front or z-buffer compositing. In this work, we show how these can be generalized to multiple layers that are combined in an additive fashion to model the reflection and transmission of light that occurs at specular surfaces such as glass and glossy materials. To simplify the analysis and rendering stages, we model the world using piecewise-planar layers combined using both additive and opaque mixing of light. We also introduce novel techniques for estimating multiple depths in the scene and separating the reflection and transmission components into different layers. We then use our system to model and render a variety of real-world scenes with reflections.

[1]  Arie Yeredor,et al.  Blind Separation of Superimposed Shifted Images Using Parameterized Joint Diagonalization , 2008, IEEE Transactions on Image Processing.

[2]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[3]  Michal Irani,et al.  Computing occluding and transparent motions , 1994, International Journal of Computer Vision.

[4]  Richard Szeliski,et al.  Computer Vision - Algorithms and Applications , 2011, Texts in Computer Science.

[5]  Shree K. Nayar,et al.  Stereo and Specular Reflection , 1998, International Journal of Computer Vision.

[6]  Kiriakos N. Kutulakos,et al.  Multi-View Scene Capture by Surfel Sampling: From Video Streams to Non-Rigid 3D Motion, Shape and Reflectance , 2002, International Journal of Computer Vision.

[7]  Richard Szeliski,et al.  Layered depth images , 1998, SIGGRAPH.

[8]  Lance Williams,et al.  View Interpolation for Image Synthesis , 1993, SIGGRAPH.

[9]  Michael Bosse,et al.  Unstructured lumigraph rendering , 2001, SIGGRAPH.

[10]  Michael J. Black,et al.  Skin and bones: multi-layer, locally affine, optical flow and regularization with transparency , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Kiriakos N. Kutulakos,et al.  Multi-view scene capture by surfel sampling: from video streams to non-rigid 3D motion, shape and reflectance , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[12]  Roland Siegwart,et al.  Probabilistic plane fitting in 3D and an application to robotic mapping , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[13]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[14]  Mathieu Desbrun,et al.  Variational shape approximation , 2004, SIGGRAPH 2004.

[15]  Yoav Y. Schechner,et al.  Overcoming visual reverberations , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Shmuel Peleg,et al.  A Three-Frame Algorithm for Estimating Two-Component Image Motion , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Richard Szeliski,et al.  The lumigraph , 1996, SIGGRAPH.

[18]  John Hart,et al.  ACM Transactions on Graphics , 2004, SIGGRAPH 2004.

[19]  Alex Pentland,et al.  Cooperative Robust Estimation Using Layers of Support , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Richard Szeliski,et al.  Manhattan-world stereo , 2009, CVPR.

[21]  Heiko Hirschmüller,et al.  Stereo Processing by Semiglobal Matching and Mutual Information , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Yoav Y. Schechner,et al.  Polarization-based decorrelation of transparent layers: The inclination angle of an invisible surface , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[23]  Richard Szeliski,et al.  Extracting layers and analyzing their specular properties using epipolar-plane-image analysis , 2005, Comput. Vis. Image Underst..

[24]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[25]  Kenji Mase,et al.  Unified computational theory for motion transparency and motion boundaries based on eigenenergy analysis , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[26]  Maneesh Agrawala,et al.  Using Photographs to Enhance Videos of a Static Scene , 2007, Rendering Techniques.

[27]  Marc Levoy,et al.  Light field rendering , 1996, SIGGRAPH.

[28]  Horst Bischof,et al.  Fusion of Feature- and Area-Based Information for Urban Buildings Modeling from Aerial Imagery , 2008, ECCV.

[29]  Richard Szeliski,et al.  Piecewise planar stereo for image-based rendering , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[30]  Jitendra Malik,et al.  Modeling and Rendering Architecture from Photographs: A hybrid geometry- and image-based approach , 1996, SIGGRAPH.

[31]  Richard Szeliski,et al.  Layer extraction from multiple images containing reflections and transparency , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[32]  Olga Veksler,et al.  Fast Approximate Energy Minimization via Graph Cuts , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Voicu Popescu,et al.  Reflected‐Scene Impostors for Realistic Reflections at Interactive Rates , 2006, Comput. Graph. Forum.

[34]  Richard Szeliski,et al.  Stereo matching with linear superposition of layers , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Jan-Michael Frahm,et al.  Piecewise planar and non-planar stereo for urban scene reconstruction , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[36]  Assaf Zomet,et al.  Learning to Perceive Transparency from the Statistics of Natural Scenes , 2002, NIPS.