Real-time EO/IR sensor fusion on a portable computer and head-mounted display

Multi-sensor platforms are widely used in surveillance video systems for both military and civilian applications. The complimentary nature of different types of sensors (e.g. EO and IR sensors) makes it possible to observe the scene under almost any condition (day/night/fog/smoke). In this paper, we propose an innovative EO/IR sensor registration and fusion algorithm which runs real-time on a portable computing unit with head-mounted display. The EO/IR sensor suite is mounted on a helmet for a dismounted soldier and the fused scene is shown in the goggle display upon the processing on a portable computing unit. The linear homography transformation between images from the two sensors is precomputed for the mid-to-far scene, which reduces the computational cost for the online calibration of the sensors. The system is implemented in a highly optimized C++ code, with MMX/SSE, and performing a real-time registration. The experimental results on real captured video show the system works very well both in speed and in performance.

[1]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[2]  P. Anandan,et al.  Direct Recovery of Planar-Parallax from Multiple Frames , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Michael T. Manry,et al.  Real-time object-based image registration using improved MRAN , 2007, SPIE Defense + Commercial Sensing.

[4]  Carsten Rother,et al.  Projective factorization of planes and cameras in multiple views , 2002, Object recognition supported by user interaction for service robots.

[5]  Amnon Shashua,et al.  Novel View Synthesis by Cascading Trilinear Tensors , 1998, IEEE Trans. Vis. Comput. Graph..

[6]  David A. Forsyth,et al.  Bayesian structure from motion , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[7]  Gérard G. Medioni,et al.  Detection and Tracking of Moving Objects from Overlapping EO and IR Sensors , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[8]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[9]  Bill Triggs,et al.  Autocalibration from Planar Scenes , 1998, ECCV.

[10]  Alex Pentland,et al.  3 D Structure from 2 D Motion , 2001 .

[11]  P. Anandan,et al.  Robust multi-sensor image alignment , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[12]  Rama Chellappa,et al.  Robust estimation of depth and motion using stochastic approximation , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[13]  Thomas S. Huang,et al.  Motion and structure from feature correspondences: a review , 1994, Proc. IEEE.