Stereo Ground Truth with Error Bars

Creating stereo ground truth based on real images is a measurement task. Measurements are never perfectly accurate: the depth at each pixel follows an error distribution. A common way to estimate the quality of measurements are error bars. In this paper we describe a methodology to add error bars to images of previously scanned static scenes. The main challenge for stereo ground truth error estimates based on such data is the nonlinear matching of 2D images to 3D points. Our method uses 2D feature quality, 3D point and calibration accuracy as well as covariance matrices of bundle adjustments. We sample the reference data error which is the 3D depth distribution of each point projected into 3D image space. The disparity distribution at each pixel location is then estimated by projecting samples of the reference data error on the 2D image plane. An analytical Gaussian error propagation is used to validate the results. As proof of concept, we created ground truth of an image sequence with 100 frames. Results show that disparity accuracies well below one pixel can be achieved, albeit with much large errors at depth discontinuities mainly caused by uncertain estimates of the camera location.

[1]  T. Vaudrey,et al.  Differences between stereo and motion behaviour on synthetic and real-world stereo sequences , 2008, 2008 23rd International Conference Image and Vision Computing New Zealand.

[2]  Kenichi Kanatani,et al.  Statistical Optimization for Geometric Fitting: Theoretical Accuracy Bound and High Order Error Analysis , 2008, International Journal of Computer Vision.

[3]  Kenichi Kanatani,et al.  Uncertainty modeling and model selection for geometric inference , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Pascal Fua,et al.  On benchmarking camera calibration and multi-view stereo for high resolution imagery , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Lena Maier-Hein,et al.  The HCI Stereo Metrics: Geometry-Aware Performance Analysis of Stereo Algorithms , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[6]  Daniel Kondermann,et al.  Is Crowdsourcing for Optical Flow Ground Truth Generation Feasible? , 2013, ICVS.

[7]  Angel Domingo Sappa,et al.  Synthetic sequences and ground-truth flow field generation for algorithm validation , 2015, Multimedia Tools and Applications.

[8]  Daniel Kondermann,et al.  Ground truth design principles: an overview , 2013, VIGTA@ICVS.

[9]  Axel Pinz,et al.  Computer Vision – ECCV 2006 , 2006, Lecture Notes in Computer Science.

[10]  S Abrahama Towards Autonomous High Precision Calibration of Digital Cameras , 2008 .

[11]  Reinhard Klette,et al.  A Third Eye for Performance Evaluation in Stereo Sequence Analysis , 2009, CAIP.

[12]  Richard Szeliski,et al.  A Database and Evaluation Methodology for Optical Flow , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[13]  Wolfgang Förstner,et al.  BENCHMARKING AUTOMATIC BUNDLE ADJUSTMENT RESULTS , 2008 .

[14]  Horst Bischof,et al.  A Duality Based Approach for Realtime TV-L1 Optical Flow , 2007, DAGM-Symposium.

[15]  Michael J. Black,et al.  A Naturalistic Open Source Movie for Optical Flow Evaluation , 2012, ECCV.

[16]  José M. Bioucas-Dias,et al.  Fast Image Recovery Using Variable Splitting and Constrained Optimization , 2009, IEEE Transactions on Image Processing.

[17]  Slobodan Ilic,et al.  Framework for Generation of Synthetic Ground Truth Data for Driver Assistance Applications , 2013, GCPR.

[18]  James L. Crowley,et al.  International Conference on Vision Systems , 2004 .

[19]  S. Meister,et al.  Real versus realistically rendered scenes for optical flow evaluation , 2011, 2011 14th ITG Conference on Electronic Media Technology.

[20]  Wolfgang Boehler,et al.  Investigating Laser Scanner Accuracy , 2005 .

[21]  Edward H. Adelson,et al.  Human-assisted motion annotation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Christopher Schwartz,et al.  Are reflectance field renderings appropriate for optical flow evaluation? , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[23]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  W. Föstner Reliability analysis of parameter estimation in linear models with application to mensuration problems in computer vision , 1987 .

[25]  Wolfgang Förstner Reliability analysis of parameter estimation in linear models with applications to mensuration problems in computer vision , 1987, Comput. Vis. Graph. Image Process..

[26]  Andrew W. Fitzgibbon,et al.  Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[27]  Pushmeet Kohli,et al.  When Can We Use KinectFusion for Ground Truth Acquisition , 2012 .

[28]  Richard Szeliski,et al.  Vision Algorithms: Theory and Practice , 2002, Lecture Notes in Computer Science.