Gun Source and Muzzle Head Detection

There is a surging need across the world for protection against gun violence. There are three main areas that we have identified as challenging in research that tries to curb gun violence: temporal location of gunshots, gun type prediction and gun source (shooter) detection. Our task is gun source detection and muzzle head detection, where the muzzle head is the round opening of the firing end of the gun. We would like to locate the muzzle head of the gun in the video visually, and identify who has fired the shot. In our formulation, we turn the problem of muzzle head detection into two sub-problems of human object detection and gun smoke detection. Our assumption is that the muzzle head typically lies between the gun smoke caused by the shot and the shooter. We have interesting results both in bounding the shooter as well as detecting the gun smoke. In our experiments, we are successful in detecting the muzzle head by detecting the gun smoke and the shooter.

[1]  Florian Metze,et al.  Event-based Video Retrieval Using Audio , 2012, INTERSPEECH.

[2]  Jia Chen,et al.  Videos from the 2013 Boston Marathon: An Event Reconstruction Dataset for Synchronization and Localization , 2016 .

[3]  Irfan A. Essa,et al.  Exploiting human actions and object context for recognition tasks , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[4]  Justin Salamon,et al.  A Dataset and Taxonomy for Urban Sound Research , 2014, ACM Multimedia.

[5]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Alexander G. Hauptmann,et al.  Video Analytics for Conflict Monitoring and Human Rights Documentation , 2015 .

[7]  Gyanendra K. Verma,et al.  A Computer Vision based Framework for Visual Gun Detection Using Harris Interest Point Detector , 2015 .

[8]  Michael A. Saunders,et al.  Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[9]  Thomas Brox,et al.  FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[10]  Alexander G. Hauptmann,et al.  Temporal localization of audio events for conflict monitoring in social media , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  Joel A. Tropp,et al.  Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit , 2007, IEEE Transactions on Information Theory.

[12]  Chloé Clavel,et al.  Events Detection for an Audio-Based Surveillance System , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[13]  Augusto Sarti,et al.  Scream and gunshot detection and localization for audio-surveillance systems , 2007, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance.

[14]  José,et al.  Gunshot detection in noisy environments , 2010 .

[15]  Bhiksha Raj,et al.  Audio event detection from acoustic unit occurrence patterns , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Pablo Alvarado-Moya,et al.  Evaluation of gunshot detection algorithms , 2008, 2008 Argentine School of Micro-Nanoelectronics, Technology and Applications.

[17]  Ronald R. Coifman,et al.  Wavelet analysis and signal processing , 1990 .

[18]  Augusto Sarti,et al.  Scream and gunshot detection in noisy environments , 2007, 2007 15th European Signal Processing Conference.

[19]  Kyogu Lee,et al.  Rare Sound Event Detection Using 1D Convolutional Recurrent Neural Networks , 2017, DCASE.

[20]  Nathaniel D. Daw,et al.  Human Representation of Visuo-Motor Uncertainty as Mixtures of Orthogonal Basis Distributions , 2015, Nature Neuroscience.

[21]  Thomas Brox,et al.  FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Rick S. Blum,et al.  Fusion of visual and IR images for concealed weapon detection , 2002, Proceedings of the Fifth International Conference on Information Fusion. FUSION 2002. (IEEE Cat.No.02EX5997).

[23]  R.C. Maher,et al.  Modeling and Signal Processing of Acoustic Gunshot Recordings , 2006, 2006 IEEE 12th Digital Signal Processing Workshop & 4th IEEE Signal Processing Education Workshop.

[24]  Alexander G. Hauptmann,et al.  Video Synchronization and Sound Search for Human Rights Documentation and Conflict Monitoring , 2016 .

[25]  HyunWook Park,et al.  Segmentation of forward-looking infrared image using fuzzy thresholding and edge detection , 2001 .

[26]  Xirong Li,et al.  Detecting semantic concepts in consumer videos using audio , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[27]  Deyu Meng,et al.  Easy Samples First: Self-paced Reranking for Zero-Example Multimedia Search , 2014, ACM Multimedia.

[28]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[29]  Talal Ahmed,et al.  Improving efficiency and reliability of gunshot detection systems , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[30]  Sergios Theodoridis,et al.  Gunshot detection in audio streams from movies by means of dynamic programming and Bayesian networks , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.