A Closed Form Solution for the Self-Calibration of Heterogeneous Sensors

We present a novel closed-form solution for the joint self-calibration of video and range sensors. The approach single assumption is the availability of synchronous time of flight (i.e., range distances) measurements and visual position of the target on images acquired by a set of cameras. In such case, we make explicit a rank constraint that is valid for both image and range data. This rank property is used to find an initial and affine solution via bilinear factorization, which is then corrected by enforcing the metric constraints characteristic for both sensor modalities (i.e., camera and anchors constraints). The output of the algorithm is the identification of the target/range sensor position and the calibration of the cameras. The application extent of our approach is broad and versatile. In fact, with the same framework, we can deal with, but not restricted to, two very different applications. The first is aimed at calibrating cameras and microphones deployed in an unknown environment. The second uses a RGB-D device to reconstruct the 3D position of a set of keypoints using the camera and depth map images. Synthetic and real tests show the algorithm performance under different levels of noise and configurations of target locations, number of sensors and cameras.

[1]  Riad I. Hammoud,et al.  Robust Multi-Pedestrian Tracking in Thermal-Visible Surveillance Videos , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[2]  Frédéric Lerasle,et al.  Vision and RFID data fusion for tracking people in crowds by a mobile robot , 2010, Comput. Vis. Image Underst..

[3]  Claudia Raibulet,et al.  Risk Management through Real-Time Wearable Services , 2009, 2009 Fourth International Conference on Software Engineering Advances.

[4]  Rita Cucchiara,et al.  Mutual calibration of camera motes and RFIDs for people localization and identification , 2010, ICDSC '10.

[5]  Marc Pollefeys,et al.  Direct computation of sound and microphone locations from time-difference-of-arrival data , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Adrien Bartoli,et al.  Towards gauge invariant bundle adjustment: a solution based on gauge dependent damping , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[7]  Christopher Taylor,et al.  Simultaneous localization, calibration, and tracking in an ad hoc sensor network , 2006, IPSN.

[8]  Alessio Del Bue,et al.  A Bilinear Approach to the Position Self-Calibration of Multiple Sensors , 2012, IEEE Transactions on Signal Processing.

[9]  Pierre Vandergheynst,et al.  Learning Bimodal Structure in Audio–Visual Data , 2009, IEEE Transactions on Neural Networks.

[10]  K. Kanatani,et al.  Factorization without Factorization: Complete Recipe , 2004 .

[11]  Kenichi Kanatani,et al.  Gauges and gauge transformations for uncertainty description of geometric structure with indeterminacy , 2001, IEEE Trans. Inf. Theory.

[12]  Andreas Terzis,et al.  Multi-Modal Calibration of Surveillance Sensor Networks , 2006, MILCOM 2006 - 2006 IEEE Military Communications conference.

[13]  Takeo Kanade,et al.  A Paraperspective Factorization Method for Shape and Motion Recovery , 1994, ECCV.

[14]  Andreas Geiger,et al.  Automatic camera and range sensor calibration using a single shot , 2012, 2012 IEEE International Conference on Robotics and Automation.

[15]  Kenichi Kanatani,et al.  Uncalibrated Factorization Using a Variable Symmetric Affine Camera , 2007, IEICE Trans. Inf. Syst..

[16]  Matthew Brand,et al.  A direct method for 3D factorization of nonrigid motion observed in 2D , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[17]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[18]  João M. F. Xavier,et al.  Robust Localization of Nodes and Time-Recursive Tracking in Sensor Networks Using Noisy Range Measurements , 2011, IEEE Transactions on Signal Processing.

[19]  Peter F. Sturm,et al.  A Factorization Based Algorithm for Multi-Image Projective Structure and Motion , 1996, ECCV.

[20]  Radu Horaud,et al.  Finding audio-visual events in informal social gatherings , 2011, ICMI '11.

[21]  Matt Welsh,et al.  CodeBlue: An Ad Hoc Sensor Network Infrastructure for Emergency Medical Care , 2004 .

[22]  M. Pollefeys,et al.  A Unified Approach to Calibrate a Network of Camcorders and ToF cameras , 2008 .

[23]  Camillo J. Taylor,et al.  Distributed target tracking using self localizing smart camera networks , 2010, ICDSC '10.

[24]  Gyula Simon,et al.  Sensor network-based countersniper system , 2004, SenSys '04.

[25]  Oswald Lanz,et al.  A joint particle filter to track the position and head orientation of people using audio visual cues , 2010, 2010 18th European Signal Processing Conference.

[26]  Chung-Kuo Chang,et al.  Video surveillance for hazardous conditions using sensor networks , 2004, IEEE International Conference on Networking, Sensing and Control, 2004.