On Creating Depth Maps from Monoscopic Video using Structure from Motion

The depth-image-based rendering technique is a promising technology for three-dimensional television (3D-TV) systems. For such a system, one of the key components is to generate a high-quality per-pixel depth map, particularly for already existing 2D video sequences. This paper proposes a framework for creating the depth map from uncalibrated video sequences of static scenes using the Structure ?From Motion (SFM) technique. This paper describes the architecture and the main components of the proposed framework. The initial experimental results show that SFM can be an effective way for creating the depth map, or it can be used to refine the depth map created by other methods, for example, the Depth From Cues (DFC) technique.

[1]  Mei Han,et al.  A perspective factorization method for Euclidean reconstruction with uncalibrated cameras , 2002, Comput. Animat. Virtual Worlds.

[2]  Bill Triggs,et al.  Factorization methods for projective structure and motion , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Yong Ho Hwang,et al.  Frame grouping measure for factorization-based projective reconstruction , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[4]  Wenxin Wang,et al.  Evaluation of image corner detectors for hardware implementation , 2004, Canadian Conference on Electrical and Computer Engineering 2004 (IEEE Cat. No.04CH37513).

[5]  C. Fehn A 3D-TV system based on video plus depth information , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[6]  Qian Chen,et al.  Efficient iterative solution to M-view projective reconstruction problem , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[7]  Cordelia Schmid,et al.  Evaluation of Interest Point Detectors , 2000, International Journal of Computer Vision.

[8]  Reinhard Koch,et al.  Visual Modeling with a Hand-Held Camera , 2004, International Journal of Computer Vision.

[9]  Andrew W. Fitzgibbon,et al.  The Problem of Degeneracy in Structure and Motion Recovery from Uncalibrated Image Sequences , 1999, International Journal of Computer Vision.