Homography-Based 3D Scene Analysis of Video Sequences *

Abstract We propose a framework to recover projectivedepth based on image homography and discuss itsapplication to scene analysis of video sequences.We describe a robust homography algorithmwhich incorporates contrast/brightness adjustmentand robust estimation into image registration. Wepresent a camera motion solver to obtain the ego-motion and the real/virtual plane position fromhomography. We then apply the Levenburg-Marquardt method to generate a dense projectivedepth map. We also discuss temporal integrationover video sequences. Finally we present theresults of applying the homography-based videoanalysis to motion detection. 1 Introduction Temporal information redundancy of videosequences allows us to use efficient, incrementalmethods which perform temporal integration ofinformation for gradual refinement.Approaches handling 3D scene analysis of videosequences with camera motion can be classifiedinto two categories: algorithms which use 2Dtransformation or model fitting, and algorithmswhich use 3D geometry analysis. Video sequencesof our interest are taken from a moving airborneplatform where the ego-motion is complex and thescene is relatively distant but not necessarily flat;

[1]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[2]  P. Anandan,et al.  A unified approach to moving object detection in 2D and 3D scenes , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[3]  P. Anandan,et al.  Parallax Geometry of Pairs of Points for 3D Scene Analysis , 1996, ECCV.

[4]  Michal Irani,et al.  Recovery of ego-motion using image stabilization , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Reinhard Koch,et al.  Self-calibration and metric reconstruction in spite of varying and unknown internal camera parameters , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[6]  B. Triggs,et al.  Projective Geometry for Image Analysis , 1996 .

[7]  Richard Szeliski,et al.  Video mosaics for virtual environments , 1996, IEEE Computer Graphics and Applications.

[8]  Richard Szeliski,et al.  Creating full view panoramic image mosaics and texture-mapped models , 1997, International Conference on Computer Graphics and Interactive Techniques.

[9]  Michael Werman,et al.  Trilinearity of three perspective views and its associated tensor , 1995, Proceedings of IEEE International Conference on Computer Vision.

[10]  Richard Szeliski,et al.  Creating full view panoramic image mosaics and environment maps , 1997, SIGGRAPH.

[11]  Michal Irani,et al.  Video indexing based on mosaic representations , 1998, Proc. IEEE.

[12]  Richard Szeliski,et al.  A layered approach to stereo reconstruction , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[13]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[14]  Steven A. Shafer,et al.  Dense Structure from a Dense Optical Flow Sequence , 1998, Comput. Vis. Image Underst..

[15]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.