Sight enhancement through video fusion in a surveillance system

In this paper we consider the problem of fusing two video streams acquired by an RGB camera and a sensor operating in the long wave infrared (LWIR). The application of interest is area surveillance and the fusion process aims at enhancing the human perception of the monitored scene. We propose a fusion procedure where the background and the moving objects are separated and fused by means of different strategies. With respect to standard video fusion techniques this approach has the advantage of reducing the computational load and mitigating the rapid brightness variations in the fused video. It is also less sensitive to the presence of noise. We discuss the experimental results obtained on a typical area surveillance scenario and demonstrate the effectiveness of the proposed method. For this purpose, the analysis is carried out both subjectively, in terms of visual quality of the fused video stream and objectively, in terms of standard image quality indexes. The computational load is also evaluated.

[1]  Christine Pohl,et al.  Multisensor image fusion in remote sensing: concepts, methods and applications , 1998 .

[2]  Belur V. Dasarathy,et al.  Decision fusion , 1994 .

[3]  Alexander Toet Multiscale color image enhancement , 1992, Pattern Recognit. Lett..

[4]  Vladimir Petrovic,et al.  Objective image fusion performance measure , 2000 .

[5]  David A. Landgrebe,et al.  Decision fusion approach for multitemporal classification , 1999, IEEE Trans. Geosci. Remote. Sens..

[6]  Alexander Toet,et al.  Merging thermal and visual images by a contrast pyramid , 1989 .

[7]  Oliver Rockinger,et al.  Image sequence fusion using a shift-invariant wavelet transform , 1997, Proceedings of International Conference on Image Processing.

[8]  Scott Cohen,et al.  Background estimation as a labeling problem , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[9]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[10]  Vladimir S. Petrovic,et al.  Objective pixel-level image fusion performance measure , 2000, SPIE Defense + Commercial Sensing.

[11]  Alexander Toet,et al.  Image fusion by a ration of low-pass pyramid , 1989, Pattern Recognit. Lett..

[12]  Azriel Rosenfeld,et al.  Detection and location of people in video images using adaptive fusion of color and edge information , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[13]  A. Bovik,et al.  A universal image quality index , 2002, IEEE Signal Processing Letters.

[14]  Marco Diani,et al.  Enhancement of Sight Effectiveness by Dual Infrared System: Evaluation of Image Fusion Strategies , 2005 .

[15]  Stéphane Mallat,et al.  Multifrequency channel decompositions of images and wavelet models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[16]  Heng-Ming Tai,et al.  Efficient video object segmentation using adaptive background registration and edge-based change detection techniques , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[17]  Henk J. A. M. Heijmans,et al.  A new quality metric for image fusion , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[18]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..