Multimodal Stereo Image Registration for Pedestrian Detection

This paper presents an approach for the registration of multimodal imagery for pedestrian detection when the significant depth differences of objects in the scene preclude a global alignment assumption. Using maximization-of-mutual-information matching techniques and sliding correspondence windows over calibrated image pairs, we demonstrate successful registration of color and thermal data. We develop a robust method using disparity voting for determining the registration of each object in the scene and provide a statistically based measure for evaluating the match confidence. Testing shows successful registration in complex scenes with multiple people at different depths and levels of occlusion

[1]  Xia Liu,et al.  Pedestrian detection and tracking with night vision , 2005, IEEE Transactions on Intelligent Transportation Systems.

[2]  P. Anandan,et al.  Robust multi-sensor image alignment , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[3]  Getian Ye Image registration and super-resolution mosaicing , 2005 .

[4]  Jean-Yves Bouguet,et al.  Camera calibration toolbox for matlab , 2001 .

[5]  KimKyungnam,et al.  Real-time foreground-background segmentation using codebook model , 2005 .

[6]  P.K. Varshney,et al.  Imaging for concealed weapon detection: a tutorial overview of development in imaging sensors and processing , 2005, IEEE Signal Processing Magazine.

[7]  Osama Masoud,et al.  Detection of loitering individuals in public transportation areas , 2005, IEEE Transactions on Intelligent Transportation Systems.

[8]  Michael Unser,et al.  Optimization of mutual information for multiresolution image registration , 2000, IEEE Trans. Image Process..

[9]  Noboru Yasuda,et al.  Pedestrian Detection and Tracking in Far Infrared Images , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[10]  Keiichi Yamada,et al.  Comparison between infrared-image-based and visible-image-based approaches for pedestrian detection , 2003, IEEE IV2003 Intelligent Vehicles Symposium. Proceedings (Cat. No.03TH8683).

[11]  Larry S. Davis,et al.  Real-time foreground-background segmentation using codebook model , 2005, Real Time Imaging.

[12]  E. Coiras,et al.  Segment-based registration technique for visual-infrared images , 2000 .

[13]  James W. Davis,et al.  Fusion-Based Background-Subtraction using Contour Saliency , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[14]  Yuichi Ohta,et al.  Simple and robust tracking of hands and objects for video-based multimedia production , 2003, Proceedings of IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, MFI2003..

[15]  Liang Zhao,et al.  Stereo- and neural network-based pedestrian detection , 2000, IEEE Trans. Intell. Transp. Syst..

[16]  Pramod K. Varshney,et al.  On registration of regions of interest (ROI) in video sequences , 2003, Proceedings of the IEEE Conference on Advanced Video and Signal Based Surveillance, 2003..

[17]  Massimo Bertozzi,et al.  Stereo Vision-based approaches for Pedestrian Detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[18]  Kurt Konolige,et al.  Small Vision Systems: Hardware and Implementation , 1998 .