Robust Height Estimation of Moving Objects From Uncalibrated Videos

This paper presents an approach for video metrology. From videos acquired by an uncalibrated stationary camera, we first recover the vanishing line and the vertical point of the scene based upon tracking moving objects that primarily lie on a ground plane. Using geometric properties of moving objects, a probabilistic model is constructed for simultaneously grouping trajectories and estimating vanishing points. Then we apply a single view mensuration algorithm to each of the frames to obtain height measurements. We finally fuse the multiframe measurements using the least median of squares (LMedS) as a robust cost function and the Robbins-Monro stochastic approximation (RMSA) technique. This method enables less human supervision, more flexibility and improved robustness. From the uncertainty analysis, we conclude that the method with auto-calibration is robust in practice. Results are shown based upon realistic tracking data from a variety of scenes.

[1]  W. Grimson,et al.  Ground Plane Rectification by Tracking Moving Objects , 2003 .

[2]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[3]  Ian D. Reid,et al.  Single View Metrology , 2000, International Journal of Computer Vision.

[4]  Zhengyou Zhang,et al.  Determining the Epipolar Geometry and its Uncertainty: A Review , 1998, International Journal of Computer Vision.

[5]  James C. Spall,et al.  Introduction to stochastic search and optimization - estimation, simulation, and control , 2003, Wiley-Interscience series in discrete mathematics and optimization.

[6]  B. Caprile,et al.  Using vanishing points for camera calibration , 1990, International Journal of Computer Vision.

[7]  Carsten Rother A new approach to vanishing point detection in architectural environments , 2002, Image Vis. Comput..

[8]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[9]  James Orwell,et al.  Learning Surveillance Tracking Models for the Self-Calibrated Ground Plane , 2002, BMVC.

[10]  Zezhi Chen,et al.  Uncalibrated two-view metrology , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[11]  S. Shankar Sastry,et al.  Generalized principal component analysis (GPCA) , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Carsten Rother,et al.  A New Approach for Vanishing Point Detection in Architectural Environments , 2000, BMVC.

[13]  Ramakant Nevatia,et al.  Self-calibration of a camera from video of a walking human , 2002, Object recognition supported by user interaction for service robots.

[14]  George Wolberg,et al.  Digital image warping , 1990 .

[15]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[16]  Timothy F. Cootes,et al.  Statistical models of appearance for medical image analysis and computer vision , 2001, SPIE Medical Imaging.

[17]  Seth J. Teller,et al.  Automatic recovery of relative camera rotations for urban scenes , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[18]  S. P. Mudur,et al.  Three-dimensional computer vision: a geometric viewpoint , 1993 .

[19]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[20]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Antonio Criminisi,et al.  Accurate Visual Metrology from Single and Multiple Uncalibrated Images , 2001, Distinguished Dissertations.

[22]  Rama Chellappa,et al.  Video Mensuration Using a Stationary Camera , 2006, ECCV.

[23]  Michael Bosse,et al.  Vanishing points and 3D lines from omnidirectional video , 2002, Proceedings. International Conference on Image Processing.

[24]  Pierre Gurdjos,et al.  About conditions for recovering the metric structures of perpendicular planes from the single ground plane to image homography , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[25]  Rama Chellappa,et al.  Visual tracking and recognition using appearance-adaptive models in particle filters , 2004, IEEE Transactions on Image Processing.

[26]  Wei Zhang,et al.  Video Compass , 2002, ECCV.

[27]  H. Robbins A Stochastic Approximation Method , 1951 .

[28]  Robert T. Collins,et al.  Matching perspective views of coplanar structures using projective unwarping and similarity matching , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Rama Chellappa,et al.  Appearance Modeling Using a Geometric Transform , 2009, IEEE Transactions on Image Processing.

[30]  Mei Han,et al.  Reconstruction of a Scene with Multiple Linearly Moving Objects , 2004, International Journal of Computer Vision.

[31]  Joachim M. Buhmann,et al.  Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.

[32]  Chris Stauffer,et al.  Robust automated planar normalization of tracking data , 2003 .