STRATEGIES FOR QUALITY-AWARE VIDEO CONTENT ANALYTICS

Recent research in video analytics promises the capability to automatically detect and extract information from video. Potential tasks include object and pedestrian detection, object and face recognition, motion detection, object tracking, as well as background subtraction and activity recognition. However, in many instances, the quality of the video from which information is to be extracted is not very high. This may be because of system constraints (like a bandwidth constraint or VHS recorder), environmental conditions (fog or low light), or a poor camera (wobbly/moving camera, limited FOV, or just a low-quality lens).

[1]  Wei Tsang Ooi,et al.  Critical video quality for distributed automated video surveillance , 2005, MULTIMEDIA '05.

[2]  Thomas J. Hacker,et al.  Determining the Necessary Frame Rate of Video Data for Object Tracking under Accuracy Constraints , 2018, 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).

[3]  Edward J. Delp,et al.  Quality-adaptive deep learning for pedestrian detection , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[4]  Marco Tagliasacchi,et al.  Performance evaluation of object recognition tasks in visual sensor networks , 2014, 2014 26th International Teletraffic Congress (ITC).

[5]  He Liu,et al.  Software to Stress Test Image Quality Estimators , 2016, 2016 Eighth International Conference on Quality of Multimedia Experience (QoMEX).

[6]  Nikhil Gagvani,et al.  Analytics-Modulated coding of surveillance video , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[7]  Pavel Korshunov Rate-accuracy tradeoff in automated, distributed video surveillance systems , 2006, MM '06.

[8]  George K. Thiruvathukal,et al.  Analyzing Real-Time Multimedia Content from Network Cameras Using CPUs and GPUs in the Cloud , 2018, 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).

[9]  Giovanni Maria Farinella,et al.  Distortion Adaptive Descriptors: Extending Gradient-Based Descriptors to Wide Angle Images , 2015, ICIAP.

[10]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[11]  Dieter Fox,et al.  Kernel Descriptors for Visual Recognition , 2010, NIPS.

[12]  Edward J. Delp,et al.  Accuracy prediction for pedestrian detection , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[13]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[14]  Eckehard G. Steinbach,et al.  SIFT feature-preserving bit allocation for H.264/AVC video compression , 2012, 2012 19th IEEE International Conference on Image Processing.

[15]  Thomas S. Huang,et al.  Studying Very Low Resolution Recognition Using Deep Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Ivan Laptev,et al.  On Space-Time Interest Points , 2005, International Journal of Computer Vision.

[17]  Joydeep Ghosh,et al.  Face Detection on Distorted Images Augmented by Perceptual Quality-Aware Features , 2014, IEEE Transactions on Information Forensics and Security.

[18]  Klemen Grm,et al.  UG^2: A Video Benchmark for Assessing the Impact of Image Restoration and Enhancement on Automatic Visual Recognition , 2017, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[19]  Bernhard Schölkopf,et al.  Correcting Sample Selection Bias by Unlabeled Data , 2006, NIPS.

[20]  Antonio Torralba,et al.  Assessing the Quality of Actions , 2014, ECCV.

[21]  Rui Dai,et al.  A new quality model for object detection using compressed videos , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[22]  Stefano Tubaro,et al.  Coding Visual Features Extracted From Video Sequences , 2014, IEEE Transactions on Image Processing.

[23]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[24]  Jenq-Neng Hwang,et al.  A Review on Video-Based Human Activity Recognition , 2013, Comput..

[25]  Marco Saerens,et al.  Adjusting the Outputs of a Classifier to New a Priori Probabilities: A Simple Procedure , 2002, Neural Computation.

[26]  Azriel Rosenfeld,et al.  Face recognition: A literature survey , 2003, CSUR.

[27]  Wei Tsang Ooi,et al.  Video quality for face detection, recognition, and tracking , 2011, TOMCCAP.

[28]  Derek Hoiem,et al.  Diagnosing Error in Object Detectors , 2012, ECCV.

[29]  Eckehard G. Steinbach,et al.  Keypoint Encoding for Improved Feature Extraction From Compressed Video at Low Bitrates , 2015, IEEE Transactions on Multimedia.

[30]  Amy R. Reibman,et al.  Prediction system for activity recognition with compressed video , 2018, Visual Information Processing and Communication.

[31]  Xiaoou Tang,et al.  Single Image Haze Removal Using Dark Channel Prior , 2011 .

[32]  David R. Bull,et al.  Robust texture features for blurred images using Undecimated Dual-Tree Complex Wavelets , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[33]  Shih-Fu Chang,et al.  Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Kate Saenko,et al.  Fine-to-coarse knowledge transfer for low-res image classification , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[35]  Sheila S. Hemami,et al.  No-reference image and video quality estimation: Applications and human-motivated design , 2010, Signal Process. Image Commun..

[36]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[37]  Jiaolong Xu,et al.  Incremental Domain Adaptation of Deformable Part-based Models , 2014, BMVC.

[38]  Aggelos K. Katsaggelos,et al.  Low-Complexity Tracking-Aware H.264 Video Compression for Transportation Surveillance , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[39]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[40]  Richard J. Wood,et al.  Real-time video image quality estimation supports enhanced tracker performance , 2013, Defense, Security, and Sensing.

[41]  Rama Chellappa,et al.  A Blur-Robust Descriptor with Applications to Face Recognition , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Wei Tsang Ooi,et al.  Reducing Frame Rate for Object Tracking , 2010, MMM.

[43]  Cordelia Schmid,et al.  Dense Trajectories and Motion Boundary Descriptors for Action Recognition , 2013, International Journal of Computer Vision.

[44]  João Ascenso,et al.  Descriptor-based adaptive tracking-by-detection for visual sensor networks , 2015, 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[45]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[46]  David R. Bull,et al.  Automatic contrast enhancement of low-light images based on local statistics of wavelet coefficients , 2010, 2010 IEEE International Conference on Image Processing.

[47]  Pierre Moulin,et al.  A two-part predictive coder for multitask signal compression , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).