Registration of Multimodal Stereo Images Using Disparity Voting from Correspondence Windows

This paper presents a method for registering multimodal imagery in short range surveillance situations when the differences in object depths preclude any global registration techniques. An analysis of multimodal registration approaches gives insight into the limitations of global assumptions and motivates the developed algorithm. Using calibrated stereo imagery, we use maximization of mutual information in sliding correspondence windows that inform a disparity voting scheme to demonstrate successful registration of color and thermal images. Extensive testing of scenes with multiple people at different depths and levels of occlusion shows high rates of successful registration and gives a reliable framework for further processing and analysis of the multimodal imagery.

[1]  James W. Davis,et al.  Fusion-Based Background-Subtraction using Contour Saliency , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[2]  P.K. Varshney,et al.  Imaging for concealed weapon detection: a tutorial overview of development in imaging sensors and processing , 2005, IEEE Signal Processing Magazine.

[3]  A. Broggi,et al.  Low-level Pedestrian Detection by means of Visible and Far Infra-red Tetra-vision , 2006, 2006 IEEE Intelligent Vehicles Symposium.

[4]  Jean-Yves Bouguet,et al.  Camera calibration toolbox for matlab , 2001 .

[5]  Mohan M. Trivedi,et al.  Multimodal Stereo Image Registration for Pedestrian Detection , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[6]  E. Coiras,et al.  Segment-based registration technique for visual-infrared images , 2000 .

[7]  Yuichi Ohta,et al.  Simple and robust tracking of hands and objects for video-based multimedia production , 2003, Proceedings of IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, MFI2003..

[8]  Alan F. Smeaton,et al.  Background Modelling in Infrared and Visible Spectrum Video for People Tracking , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[9]  Michael Unser,et al.  Optimization of mutual information for multiresolution image registration , 2000, IEEE Trans. Image Process..

[10]  P. Anandan,et al.  Robust multi-sensor image alignment , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[11]  B. Bhanu,et al.  Detecting moving humans using color and infrared video , 2003, Proceedings of IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, MFI2003..

[12]  Pramod K. Varshney,et al.  On registration of regions of interest (ROI) in video sequences , 2003, Proceedings of the IEEE Conference on Advanced Video and Signal Based Surveillance, 2003..