Spatiotemporal Inpainting for Recovering Texture Maps of Occluded Building Facades

We present a technique for constructing a ldquocleanrdquo texture map of a partially occluded building facade from a series of images taken from a moving camera. Building regions blocked by trees, signs, people, and other foreground objects in a minority of views can be recovered via temporal median filtering on a registered image mosaic of the planar facade. However, when such areas are occluded in the majority of camera views, appearance information from other visible portions of the facade provides a critical cue to correctly complete the mosaic. In this paper, we apply a robust measure of spread to infer whether a particular mosaic pixel is occluded in a majority of views, and introduce a novel spatiotemporal timeline-based inpainting algorithm that uses appearance and motion cues in order to fill the texture map in majority-occluded regions. We describe methods for automatically training appearance-based classifiers from a coarse motion-based segmentation to efficiently recognize foreground and background patches in static imagery. Results of recovered building facades are shown for various sequences.

[1]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[2]  Patrick Pérez,et al.  Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[3]  Marc Levoy,et al.  Lightfield completion , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[4]  Harry Shum,et al.  Image completion with structure propagation , 2005, ACM Trans. Graph..

[5]  Emanuele Trucco,et al.  Layered Representation of a Video Shot with Mosaicing , 2002, Pattern Analysis & Applications.

[6]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[7]  Kentaro Toyama,et al.  Wallflower: principles and practice of background maintenance , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[8]  Patrick Pérez,et al.  Poisson image editing , 2003, ACM Trans. Graph..

[9]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Andrew Zisserman,et al.  A Statistical Approach to Texture Classification from Single Images , 2004, International Journal of Computer Vision.

[11]  Richard Szeliski,et al.  Video mosaics for virtual environments , 1996, IEEE Computer Graphics and Applications.

[12]  Mubarak Shah,et al.  Motion layer extraction in the presence of occlusion using graph cuts , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Nipun Kwatra,et al.  Texture optimization for example-based synthesis , 2005, ACM Trans. Graph..

[14]  Truong Q. Nguyen,et al.  Spatio-temporal texture synthesis and image inpainting for video applications , 2005, IEEE International Conference on Image Processing 2005.

[15]  Tony F. Chan,et al.  Image processing and analysis - variational, PDE, wavelet, and stochastic methods , 2005 .

[16]  Jitendra Malik,et al.  Recognizing surfaces using three-dimensional textons , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[17]  David G. Stork,et al.  Pattern Classification , 1973 .

[18]  Alex Pentland,et al.  Face recognition using eigenfaces , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Anil C. Kokaram,et al.  A Bayesian framework for recursive object removal in movie post-production , 2003, ICIP.

[20]  Brendan J. Frey,et al.  Learning flexible sprites in video layers , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[21]  James Davis,et al.  Mosaics of scenes with moving objects , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[22]  Nikos Komodakis,et al.  Image Completion Using Global Optimization , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[23]  Guillermo Sapiro,et al.  Image inpainting , 2000, SIGGRAPH.

[24]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[25]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[26]  Alexei A. Efros,et al.  Image quilting for texture synthesis and transfer , 2001, SIGGRAPH.

[27]  Gregory D. Hager,et al.  A two level approach for scene recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[28]  Baining Guo,et al.  Real-time texture synthesis by patch-based sampling , 2001, TOGS.

[29]  Kristin J. Dana,et al.  Real-time scene stabilization and mosaic construction , 1994, Proceedings of 1994 IEEE Workshop on Applications of Computer Vision.

[30]  Antonio Criminisi,et al.  Object categorization by learned universal visual dictionary , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[31]  Harry Shum,et al.  Background Cut , 2006, ECCV.

[32]  Michael Ashikhmin,et al.  Synthesizing natural textures , 2001, I3D '01.

[33]  Andrew Zisserman,et al.  Computer vision applied to super resolution , 2003, IEEE Signal Process. Mag..

[34]  Christopher Rasmussen,et al.  Improving Spatiotemporal Inpainting with Layer Appearance Models , 2006, ISVC.

[35]  Daniel Cohen-Or,et al.  Fragment-based image completion , 2003, ACM Trans. Graph..

[36]  Guillermo Sapiro,et al.  Video inpainting of occluding and occluded objects , 2005, IEEE International Conference on Image Processing 2005.

[37]  Marc Pollefeys,et al.  Radiometric alignment of image sequences , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[38]  A. Criminisi,et al.  Bilayer Segmentation of Live Video , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[39]  Wolfgang Effelsberg,et al.  Robust background estimation for complex video sequences , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[40]  Tai-Pang Wu,et al.  Video repairing: inference of foreground and background under severe occlusion , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[41]  Christopher Rasmussen,et al.  Spatiotemporal inpainting for recovering texture maps of partially occluded building facades , 2005, IEEE International Conference on Image Processing 2005.

[42]  Emanuele Trucco,et al.  Making good features track better , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[44]  Eli Shechtman,et al.  Space-time video completion , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[45]  Andrew W. Fitzgibbon,et al.  Bayesian Estimation of Layers from Multiple Images , 2002, ECCV.

[46]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[47]  Emanuele Trucco,et al.  Improving Feature Tracking with Robust Statistics , 1999, Pattern Analysis & Applications.

[48]  Michael Bosse,et al.  Calibrated, Registered Images of an Extended Urban Area , 2003, International Journal of Computer Vision.

[49]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[50]  Christopher Rasmussen,et al.  PCA-Based Recognition for Efficient Inpainting , 2006, ACCV.

[51]  F. A. Heuvel Automation in Architectural Photogrammetry: Line-Photogrammetry for the Reconstruction from Single and Multiple Images , 2003 .

[52]  Andrew Blake,et al.  PatchWorks: Example-Based Region Tiling for Image Editing , 2004 .