Target Detection and Classification Improvements using Contrast Enhanced 16-bit Infrared Videos

In our earlier target detection and classification papers, we used 8-bit infrared videos in the Defense Systems Information Analysis Center(DSIAC) video dataset. In this paper, we focus on how we can improve the target detection and classification results using 16-bit videos. One problem with the 16-bit videos is that some image frames have very low contrast. Two methods were explored to improve upon previous detection and classification results. The first method used to improve contrast was effectively the same as the baseline 8-bit video data but using the 16-bit raw data rather than the 8-bit data taken from the avi files. The second method used was a second order histogram matching algorithm that preserves the 16-bit nature of the videos while providing normalization and contrast enhancement. Results showed the second order histogram matching algorithm improved the target detection using You Only Look Once (YOLO) and classificationusing Residual Network (ResNet) performance. The average precision (AP) metric in YOLO was improved by 8%. This is quite significant. The overall accuracy (OA) of ResNet has been improved by 12%. This is also very significant.

[1]  Henry Arguello,et al.  Object Detection on Compressive Measurements using Correlation Filters and Sparse Representation , 2018, 2018 26th European Signal Processing Conference (EUSIPCO).

[2]  Vibhav Vineet,et al.  Privacy-Preserving Action Recognition Using Coded Aperture Videos , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[3]  Michael Felsberg,et al.  Channel Coded Distribution Field Tracking for Thermal Infrared Imagery , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[4]  Chenming Li,et al.  Detection and Tracking of Moving Targets for Thermal Infrared Video Sequences , 2018, Sensors.

[5]  Chiman Kwan,et al.  Tracking and Classification of Multiple Human Objects Directly in Compressive Measurement Domain for Low Quality Optical Videos , 2019, 2019 IEEE 10th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON).

[6]  Ralph Etienne-Cummings,et al.  Target Tracking and Classification Using Compressive Measurements of MWIR and LWIR Coded Aperture Cameras , 2019, Journal of Signal and Information Processing.

[7]  Chiman Kwan,et al.  Deep Learning Based Target Tracking and Classification Directly in Compressive Measurement for Low Quality Videos , 2019, Signal & Image Processing : An International Journal.

[8]  Enrique Tajahuerce,et al.  Online reconstruction-free single-pixel image classification , 2019, Image Vis. Comput..

[9]  Michael Elad,et al.  Compressed Learning: A Deep Neural Network Approach , 2016, ArXiv.

[10]  Chiman Kwan,et al.  Compressive Vehicle Tracking Using Deep Learning , 2018, 2018 9th IEEE Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON).

[11]  A. Enis Çetin,et al.  Co-difference based object tracking algorithm for infrared videos , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[12]  Kevin F. Kelly,et al.  Compressed domain image classification using a multi-rate neural network , 2019, ArXiv.

[13]  Chiman Kwan,et al.  Compressive object tracking and classification using deep learning for infrared videos , 2019, Defense + Commercial Sensing.

[14]  Chiman Kwan,et al.  Deep Learning Based Target Tracking and Classification for Infrared Videos Using Compressive Measurements , 2019, Journal of Signal and Information Processing.

[15]  Chao Gao,et al.  Background subtraction based level sets for human segmentation in thermal infrared surveillance systems , 2013 .

[16]  Chiman Kwan,et al.  A Comparative Study of Conventional and Deep Learning Target Tracking Algorithms for Low Quality Videos , 2018, ISNN.

[17]  Ralph Etienne-Cummings,et al.  Real-Time and Deep Learning Based Vehicle Detection and Classification Using Pixel-Wise Code Exposure Measurements , 2020, Electronics.

[18]  Yuzhong Shen,et al.  Simple and effective cloud- and shadow-detection algorithms for Landsat and Worldview images , 2020, Signal Image Video Process..

[19]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Bülent Sankur,et al.  Compressively Sensed Image Recognition , 2018, 2018 7th European Workshop on Visual Information Processing (EUVIP).

[21]  Chiman Kwan,et al.  Enhancing Small Moving Target Detection Performance in Low-Quality and Long-Range Infrared Videos Using Optical Flow Techniques , 2020, Remote. Sens..

[22]  Pavan K. Turaga,et al.  Direct inference on compressive measurements using convolutional neural networks , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[23]  Chiman Kwan,et al.  Multiple Human Objects Tracking and Classification Directly in Compressive Measurement Domain for Long Range Infrared Videos , 2019, 2019 IEEE 10th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON).

[24]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[26]  Chiman Kwan,et al.  Small Infrared Target Detection Based on Fast Adaptive Masking and Scaling With Iterative Segmentation , 2021, IEEE Geoscience and Remote Sensing Letters.

[27]  W. Marsden I and J , 2012 .

[28]  Chiman Kwan,et al.  A high-performance approach to detecting small targets in long-range low-quality infrared videos , 2020, Signal Image Video Process..

[29]  Ralph Etienne-Cummings,et al.  Detection and Confirmation of Multiple Human Targets Using Pixel-Wise Code Aperture Measurements , 2020, J. Imaging.