论文信息 - Towards 3-D scene reconstruction from broadcast video

Towards 3-D scene reconstruction from broadcast video

Three-dimensional (3-D) scene reconstruction from broadcast video is a challenging problem with many potential applications, such as 3-D TV, free-view TV, augmented reality or three-dimensionalization of two-dimensional (2-D) media archives. In this paper, a flexible and effective system capable of efficiently reconstructing 3-D scenes from broadcast video is proposed, with the assumption that there is relative motion between camera and scene/objects. The system requires no a priori information and input, other than the video sequence itself, and capable of estimating the internal and external camera parameters and performing a 3-D motion-based segmentation, as well as computing a dense depth field. The system also serves as a showcase to present some novel approaches for moving object segmentation, sparse and dense reconstruction problems. According to the simulations for both synthetic and real data, the system achieves a promising performance for typical TV content, indicating that it is a significant step towards the 3-D reconstruction of scenes from broadcast video.

[1] Yair Weiss,et al. Interpreting Images by Propagating Bayesian Beliefs , 1996, NIPS.

[2] O. Faugeras,et al. Variational principles, surface evolution, PDE's, level set methods and the stereo problem , 1998, 5th IEEE EMBS International Summer School on Biomedical Imaging, 2002..

[3] Andrew W. Fitzgibbon,et al. The Problem of Degeneracy in Structure and Motion Recovery from Uncalibrated Image Sequences , 1999, International Journal of Computer Vision.

[4] Andrew Zisserman,et al. Multiple View Geometry , 1999 .

[5] P. Anandan,et al. A Unified Approach to Moving Object Detection in 2D and 3D Scenes , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[6] Yair Weiss,et al. Segmentation using eigenvectors: a unifying view , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[7] S. Bougnoux,et al. From projective to Euclidean space under any practical situation, a criticism of self-calibration , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[8] Olivier D. Faugeras,et al. Variational principles, surface evolution, PDEs, level set methods, and the stereo problem , 1998, IEEE Trans. Image Process..

[9] R. Hartley. Triangulation, Computer Vision and Image Understanding , 1997 .

[10] S. B. Kang,et al. An Active Multibaseline Stereo System with Real-Time Image Acquisition , 1994 .

[11] A. Murat Tekalp,et al. Digital Video Processing , 1995 .

[12] O. D. Faugeras,et al. Camera Self-Calibration: Theory and Experiments , 1992, ECCV.

[13] Takeo Kanade,et al. A Stereo Matching Algorithm with an Adaptive Window: Theory and Experiment , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[14] O. Faugeras,et al. Camera Self-Calibration from Video Sequences: the Kruppa Equations Revisited , 1996 .

[15] Richard Szeliski,et al. Extracting View-Dependent Depth Maps from a Collection of Images , 2004, International Journal of Computer Vision.

[16] C Tomasi,et al. Shape and motion from image streams: a factorization method. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[17] Richard Szeliski,et al. Vision Algorithms: Theory and Practice , 2002, Lecture Notes in Computer Science.

[18] Richard I. Hartley,et al. Estimation of Relative Camera Positions for Uncalibrated Cameras , 1992, ECCV.

[19] Rachid Deriche,et al. A Robust Technique for Matching two Uncalibrated Images Through the Recovery of the Unknown Epipolar Geometry , 1995, Artif. Intell..

[20] Vladimir Kolmogorov,et al. Computing visual correspondence with occlusions using graph cuts , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[21] Takeo Kanade,et al. A Multibody Factorization Method for Independently Moving Objects , 1998, International Journal of Computer Vision.

[22] Reinhard Koch,et al. A simple and efficient rectification method for general motion , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[23] Zhengyou Zhang,et al. A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[24] P. Torr. Geometric motion segmentation and model selection , 1998, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[25] Richard Szeliski,et al. A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[26] Nanning Zheng,et al. Stereo Matching Using Belief Propagation , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[27] Jangheon Kim,et al. Gaussian scale-space dense disparity estimation with anisotropic disparity-field diffusion , 2005, Fifth International Conference on 3-D Digital Imaging and Modeling (3DIM'05).

[28] Pietro Perona,et al. Reducing "Structure From Motion": A General Framework for Dynamic Vision Part 1: Modeling , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[29] Rama Chellappa,et al. Bayesian algorithms for simultaneous structure from motion estimation of multiple independently moving objects , 2005, IEEE Transactions on Image Processing.

[30] Reinhard Koch,et al. Automated reconstruction of 3D scenes from sequences of images , 2000 .

[31] Andrew W. Fitzgibbon,et al. Maintaining multiple motion model hypotheses over many views to recover matching and structure , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[32] D. Nistér. Automatic passive recovery of 3D from images and video , 2004 .

[33] Pedro F. Felzenszwalb,et al. Efficient belief propagation for early vision , 2004, CVPR 2004.

[34] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[35] Stan Z. Li,et al. Markov Random Field Modeling in Computer Vision , 1995, Computer Science Workbench.

[36] Maarten Vergauwen,et al. A Hierarchical Symmetric Stereo Algorithm Using Dynamic Programming , 2002, International Journal of Computer Vision.

[37] Y. Weiss. Belief Propagation and Revision in Networks with Loops , 1997 .

[38] Quang-Tuan Luong,et al. Self-Calibration of a Moving Camera from Point Correspondences and Fundamental Matrices , 1997, International Journal of Computer Vision.

[39] Richard Szeliski,et al. Stereo Matching with Transparency and Matting , 1999, International Journal of Computer Vision.

[40] Reinhard Koch,et al. Visual Modeling with a Hand-Held Camera , 2004, International Journal of Computer Vision.

[41] Andrew W. Fitzgibbon,et al. Multibody Structure and Motion: 3-D Reconstruction of Independently Moving Objects , 2000, ECCV.

[42] Peter Sturm,et al. On focal length calibration from two views , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[43] Andrew Zisserman,et al. Multiple view geometry in computer visiond , 2001 .

[44] Christopher G. Harris,et al. A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[45] Judea Pearl,et al. Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[46] Takeo Kanade,et al. Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[47] Richard Szeliski,et al. Bayesian modeling of uncertainty in low-level vision , 2011, International Journal of Computer Vision.

[48] A. Verri,et al. A compact algorithm for rectification of stereo pairs , 2000 .

[49] Luc Van Gool,et al. A stratified approach to metric self-calibration , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[50] Thorsten Thormählen,et al. Keyframe Selection for Camera Motion and Structure Estimation from Multiple Views , 2004, ECCV.

[51] Luc Van Gool,et al. PDE-based multi-view depth estimation , 2002, Proceedings. First International Symposium on 3D Data Processing Visualization and Transmission.