论文信息 - Robust Extrinsic Calibration of Multiple RGB-D Cameras with Body Tracking and Feature Matching

Robust Extrinsic Calibration of Multiple RGB-D Cameras with Body Tracking and Feature Matching

RGB-D cameras have been commercialized, and many applications using them have been proposed. In this paper, we propose a robust registration method of multiple RGB-D cameras. We use a human body tracking system provided by Azure Kinect SDK to estimate a coarse global registration between cameras. As this coarse global registration has some error, we refine it using feature matching. However, the matched feature pairs include mismatches, hindering good performance. Therefore, we propose a registration refinement procedure that removes these mismatches and uses the global registration. In an experiment, the ratio of inliers among the matched features is greater than 95% for all tested feature matchers. Thus, we experimentally confirm that mismatches can be eliminated via the proposed method even in difficult situations and that a more precise global registration of RGB-D cameras can be obtained.

Minsik Park | Jisang Yoo | Soonchul Kwon | Sang-Ha Lee | Jinwoong Kim

[1] Shahram Izadi,et al. Motion2fusion , 2017, ACM Trans. Graph..

[2] Vladlen Koltun,et al. Fast Global Registration , 2016, ECCV.

[3] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[4] Yaser Sheikh,et al. OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Pushmeet Kohli,et al. Fusion4D , 2016, ACM Trans. Graph..

[6] Qionghai Dai,et al. 3D Pose Detection of Closely Interactive Humans Using Multi-View Cameras , 2019, Sensors.

[7] Tom Drummond,et al. Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[8] Vincent Lepetit,et al. BRIEF: Computing a Local Binary Descriptor Very Fast , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Torsten Sattler,et al. D2-Net: A Trainable CNN for Joint Description and Detection of Local Features , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Andrew W. Fitzgibbon,et al. KinectFusion: Real-time dense surface mapping and tracking , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[11] Yaser Sheikh,et al. OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12] Gary R. Bradski,et al. ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[13] S. Umeyama,et al. Least-Squares Estimation of Transformation Parameters Between Two Point Patterns , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[14] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[15] Roland Siegwart,et al. BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[16] Iasonas Kokkinos,et al. DensePose: Dense Human Pose Estimation in the Wild , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[17] Tomasz Malisiewicz,et al. SuperPoint: Self-Supervised Interest Point Detection and Description , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[18] Jiaolong Yang,et al. Go-ICP: A Globally Optimal Solution to 3D ICP Point-Set Registration , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19] Luc Van Gool,et al. Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[20] Zoltan-Csaba Marton,et al. Tutorial: Point Cloud Library: Three-Dimensional Object Recognition and 6 DOF Pose Estimation , 2012, IEEE Robotics & Automation Magazine.

[21] Ali Kashif Bashir,et al. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , 2013, ICIRA 2013.

[22] Vincent Lepetit,et al. TILDE: A Temporally Invariant Learned DEtector , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Michael J. Black,et al. On the unification of line processes, outlier rejection, and robust statistics with applications in early vision , 1996, International Journal of Computer Vision.

[24] Youngbae Hwang,et al. Iterative K-Closest Point Algorithms for Colored Point Cloud Registration , 2020, Sensors.

[25] Vladlen Koltun,et al. Colored Point Cloud Registration Revisited , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[26] Juan D. Tardós,et al. ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras , 2016, IEEE Transactions on Robotics.

[27] Tom Drummond,et al. Faster and Better: A Machine Learning Approach to Corner Detection , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28] Luca Carlone,et al. Modeling Perceptual Aliasing in SLAM via Discrete–Continuous Graphical Models , 2018, IEEE Robotics and Automation Letters.

[29] Kurt Konolige,et al. CenSurE: Center Surround Extremas for Realtime Feature Detection and Matching , 2008, ECCV.

[30] Vincent Lepetit,et al. LIFT: Learned Invariant Feature Transform , 2016, ECCV.

[31] Ju Shen,et al. A Fast and Robust Extrinsic Calibration for RGB-D Camera Networks † , 2018, Sensors.

[32] Youngbae Hwang,et al. Multi-Cue-Based Circle Detection and Its Application to Robust Extrinsic Calibration of RGB-D Cameras , 2019, Sensors.

[33] Francesco G. B. De Natale,et al. Fast automatic camera network calibration through human mesh recovery , 2020, Journal of Real-Time Image Processing.

[34] Hammam A. Alshazly,et al. Image Features Detection, Description and Matching , 2016 .

[35] Richard Elvira,et al. ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual–Inertial, and Multimap SLAM , 2021, IEEE Transactions on Robotics.

[36] Tim D. Barfoot,et al. At all Costs: A Comparison of Robust Cost Functions for Camera Correspondence Outliers , 2015, 2015 12th Conference on Computer and Robot Vision.

[37] J. M. M. Montiel,et al. ORB-SLAM: A Versatile and Accurate Monocular SLAM System , 2015, IEEE Transactions on Robotics.