Graph clustering for weapon discharge event detection and tracking in infrared imagery using deep features

This paper addresses the problem of detecting and tracking weapon discharge event in an Infrared Imagery collection. While most of the prior work in related domains exploits the vast amount of complementary in- formation available from both visible-band (EO) and Infrared (IR) image (or video sequences), we handle the problem of recognizing human pose and activity detection exclusively in thermal (IR) images or videos. The task is primarily two-fold: 1) locating the individual in the scene from IR imagery, and 2) identifying the correct pose of the human individual (i.e. presence or absence of weapon discharge activity or intent). An efficient graph-based shortlisting strategy for identifying candidate regions of interest in the IR image utilizes both image saliency and mutual similarities from the initial list of the top scored proposals of a given query frame, which ensures an improved performance for both detection and recognition simultaneously and reduced false alarms. The proposed search strategy offers an efficient feature extraction scheme that can capture the maximum amount of object structural information by defining a region- based deep shape descriptor representing each object of interest present in the scene. Therefore, our solution is capable of handling the fundamental incompleteness of the IR imageries for which the conventional deep features optimized on the natural color images in Imagenet are not quite suitable. Our preliminary experiments on the OSU weapon dataset demonstrates significant success in automated recognition of weapon discharge events from IR imagery.

[1]  Mikolaj Leszczuk,et al.  Automated Detection of Firearms and Knives in a CCTV Image , 2016, Sensors.

[2]  P.K. Varshney,et al.  Imaging for concealed weapon detection: a tutorial overview of development in imaging sensors and processing , 2005, IEEE Signal Processing Magazine.

[3]  Vladimir Kolmogorov,et al.  An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision , 2001, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Ji Wan,et al.  Deep Learning for Content-Based Image Retrieval: A Comprehensive Study , 2014, ACM Multimedia.

[5]  Tao Xiang,et al.  Sketch-a-Net that Beats Humans , 2015, BMVC.

[6]  Dimitrios Tzovaras,et al.  Suspicious activity recognition in infrared imagery using Hidden Conditional Random Fields for outdoor perimeter surveillance , 2015, International Conference on Quality Control by Artificial Vision.

[7]  I. Pavlidis,et al.  Thermal imaging for anxiety detection , 2000, Proceedings IEEE Workshop on Computer Vision Beyond the Visible Spectrum: Methods and Applications (Cat. No.PR00640).

[8]  Svetlana Lazebnik,et al.  Multi-scale Orderless Pooling of Deep Convolutional Activation Features , 2014, ECCV.

[9]  Sudeep Sarkar,et al.  Discrimination of motion based on traces in the space of probability functions over feature relations , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[10]  Daniel Olmeda,et al.  Detection and tracking of pedestrians in infrared images , 2009, 2009 3rd International Conference on Signals, Circuits and Systems (SCS).

[11]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  H. D. Arlowe Thermal detection contrast of human targets , 1992, Proceedings 1992 International Carnahan Conference on Security Technology: Crime Countermeasures.

[13]  Massimo Bertozzi,et al.  Vehicle detection and localization in infra-red images , 2002, Proceedings. The IEEE 5th International Conference on Intelligent Transportation Systems.

[14]  Gang Xu,et al.  Understanding human motion patterns , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[15]  Edoardo Ardizzone,et al.  Combining top-down and bottom-up visual saliency for firearms localization , 2014, 2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP).

[16]  James W. Davis Sequential Reliable-Inference for Rapid Detection of Human Actions , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[17]  Toby P. Breckon,et al.  Human pose classification within the context of near-IR imagery tracking , 2013, Optics/Photonics in Security and Defence.

[18]  Rama Chellappa,et al.  Higher-order spectral analysis of human motion , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[19]  Bir Bhanu,et al.  Physical models for moving shadow and object detection in video , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Simone Calderara,et al.  Visual Tracking: An Experimental Survey , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Joseph P. Racamato,et al.  Solid-State Color Night Vision: Fusion of Low-Light Visible and Thermal Infrared Imagery , 1998 .

[22]  Nashwan Jasim Hussein,et al.  An alternative method to discover concealed weapon detection using critical fusion image of color image and infrared image , 2016, 2016 First IEEE International Conference on Computer Communication and the Internet (ICCCI).

[23]  Eddie L. Jacobs,et al.  Human suspicious activity recognition in thermal infrared video , 2014, Optics & Photonics - Optical Engineering + Applications.

[24]  Cheng Xiang,et al.  Scalable scene understanding using saliency-guided object localization , 2013, 2013 10th IEEE International Conference on Control and Automation (ICCA).

[25]  Angel D. Sappa,et al.  Unsupervised motion classification by means of efficient feature selection and tracking , 2004 .

[26]  Bir Bhanu,et al.  Fusion of color and infrared video for moving human detection , 2007, Pattern Recognit..

[27]  Allen M. Waxman,et al.  Real-time fusion of low-light CCD and uncooled IR imagery for color night vision , 1998, Defense, Security, and Sensing.

[28]  Shigeyuki Tomita,et al.  Face identification using thermal image processing , 1997, Proceedings 6th IEEE International Workshop on Robot and Human Communication. RO-MAN'97 SENDAI.

[29]  Eddie L. Jacobs,et al.  Activity recognition in thermal infrared video , 2015, SoutheastCon 2015.

[30]  Rick S. Blum,et al.  Concealed weapon detection using color image fusion , 2003, Sixth International Conference of Information Fusion, 2003. Proceedings of the.

[31]  C. Lawrence Zitnick,et al.  Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[32]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[33]  Marc Alexa,et al.  How do humans sketch objects? , 2012, ACM Trans. Graph..

[34]  Hironobu Fujiyoshi,et al.  Real-time human motion analysis by image skeletonization , 1998, Proceedings Fourth IEEE Workshop on Applications of Computer Vision. WACV'98 (Cat. No.98EX201).

[35]  R. Nelson,et al.  Low level recognition of human motion (or how to get your man without finding his body parts) , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.