论文信息 - Computing the stereo matching cost with a convolutional neural network

Computing the stereo matching cost with a convolutional neural network

We present a method for extracting depth information from a rectified image pair. We train a convolutional neural network to predict how well two image patches match and use it to compute the stereo matching cost. The cost is refined by cross-based cost aggregation and semiglobal matching, followed by a left-right consistency check to eliminate errors in the occluded regions. Our stereo method achieves an error rate of 2.61% on the KITTI stereo dataset and is currently (August 2014) the top performing method on this dataset.

Yann LeCun | Jure Zbontar | Yann LeCun | Jure Zbontar

[1] Gauthier Lafruit,et al. Cross-Based Local Stereo Matching Using Orthogonal Integral Images , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[2] Olga Veksler,et al. Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[3] Tamir Hazan,et al. Continuous Markov Random Fields for Robust Stereo Estimation , 2012, ECCV.

[4] VekslerOlga,et al. Fast Approximate Energy Minimization via Graph Cuts , 2001 .

[5] Raquel Urtasun,et al. Robust Monocular Epipolar Flow Estimation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[6] Rahul Nair,et al. Ensemble Learning for Confidence Measures in Stereo Vision , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[7] Nikos Komodakis,et al. Learning to Detect Ground Control Points for Improving the Accuracy of Stereo Matching , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8] Xing Mei,et al. On building an accurate stereo matching system on graphics hardware , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[9] Hai Tao,et al. A method for learning matching errors for stereo computation , 2004, BMVC.

[10] Heiko Hirschmüller,et al. Stereo Processing by Semiglobal Matching and Mutual Information , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[11] Hai Tao,et al. Stereo Matching via Learning Multiple Experts Behaviors , 2006, BMVC.

[12] D. Scharstein,et al. A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision (SMBV 2001).

[13] Konrad Schindler,et al. View-Consistent 3D Scene Flow Estimation over Multiple Frames , 2014, ECCV.

[14] Clément Farabet,et al. Torch7: A Matlab-like Environment for Machine Learning , 2011, NIPS 2011.

[15] Li Zhang,et al. Estimating Optimal Parameters for MRF Stereo from a Single Image Pair , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16] Christopher Joseph Pal,et al. Learning Conditional Random Fields for Stereo , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[17] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[18] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[19] Konrad Schindler,et al. Piecewise Rigid Scene Flow , 2013, 2013 IEEE International Conference on Computer Vision.

[20] Raquel Urtasun,et al. Efficient Joint Segmentation, Occlusion Labeling, Stereo and Flow Estimation , 2014, ECCV.

[21] HirschmullerHeiko. Stereo Processing by Semiglobal Matching and Mutual Information , 2008 .

[22] Daniel P. Huttenlocher,et al. Learning for stereo vision using the structured support vector machine , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[23] Andreas Geiger,et al. Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[24] Heiko Hirschmüller,et al. Evaluation of Stereo Matching Costs on Images with Radiometric Differences , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Atsuto Maki,et al. Towards a simulation driven stereo vision system , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).