Ground Plane Estimation Using a Hidden Markov Model

We focus on the problem of estimating the ground plane orientation and location in monocular video sequences from a moving observer. Our only assumptions are that the 3D ego motion t and the ground plane normal n are orthogonal, and that n and t are smooth over time. We formulate the problem as a state-continuous Hidden Markov Model (HMM) where the hidden state contains t and n and may be estimated by sampling and decomposing homographies. We show that using blocked Gibbs sampling, we can infer the hidden state with high robustness towards outliers, drifting trajectories, rolling shutter and an imprecise intrinsic calibration. Since our approach does not need any initial orientation prior, it works for arbitrary camera orientations in which the ground is visible.

[1]  David A. Forsyth,et al.  Thinking Inside the Box: Using Appearance Models and Context Based on Room Geometry , 2010, ECCV.

[2]  Andrea Fusiello,et al.  Robust Multiple Structures Estimation with J-Linkage , 2008, ECCV.

[3]  Karl Iagnemma,et al.  Ground plane identification using LIDAR in forested environments , 2010, 2010 IEEE International Conference on Robotics and Automation.

[4]  Robert E. Mahony,et al.  Robust estimation of planar surfaces using spatio-temporal RANSAC for applications in autonomous vehicle navigation , 2012, Robotics Auton. Syst..

[5]  S. Shankar Sastry,et al.  An Invitation to 3-D Vision , 2004 .

[6]  Daniel Maier,et al.  Appearance-based traversability classification in monocular images using iterative ground plane estimation , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Zihan Zhou,et al.  Robust plane-based structure from motion , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Gérard G. Medioni,et al.  3D Reconstruction of Background and Objects Moving on Ground Plane Viewed from a Moving Camera , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Davide Scaramuzza,et al.  1-Point-RANSAC Structure from Motion for Vehicle-Mounted Cameras by Exploiting Non-holonomic Constraints , 2011, International Journal of Computer Vision.

[10]  Kostas Daniilidis,et al.  Monocular visual odometry in urban environments using an omnidirectional camera , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[12]  Vladimir Kolmogorov,et al.  Optimizing Binary MRFs via Extended Roof Duality , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Alexei A. Efros,et al.  Putting Objects in Perspective , 2006, CVPR.

[14]  Yann LeCun,et al.  Road Scene Segmentation from a Single Image , 2012, ECCV.

[15]  Bodo Rosenhahn,et al.  Multi-scale Clustering of Frame-to-Frame Correspondences for Motion Segmentation , 2012, ECCV.

[16]  Julius Ziegler,et al.  StereoScan: Dense 3d reconstruction in real-time , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[17]  Jana Kosecka,et al.  Piecewise planar city 3D modeling from street view panoramic sequences , 2009, CVPR.

[18]  Uwe Franke,et al.  Efficient representation of traffic scenes by means of dynamic stixels , 2010, 2010 IEEE Intelligent Vehicles Symposium.

[19]  Michael Brady,et al.  Ground plane estimation, error analysis and applications , 2002, Robotics Auton. Syst..

[20]  Mubarak Shah,et al.  A Multiview Approach to Tracking People in Crowded Scenes Using a Planar Homography Constraint , 2006, ECCV.

[21]  J. M. M. Montiel,et al.  Indoor robot motion based on monocular images , 2001, Robotica.