NR-Bitstream video quality metrics for SSIM using encoding decisions in AVC and HEVC coded videos

Proposed NR-Bitstream SSIM model of AVC and HEVC do not need pixel domain features.Block-Partition features are important in SSIM prediction especially for HEVC.Proposed common model performs well for AVC (0.78 correlation) and HEVC (0.88).Proposed common model is up to 13.13% better than the latest method in correlation.Proposed method requires much less computational complexity than the latest method. We propose a no-reference compressed video quality model to predict the full-reference SSIM metrics for AVC (Advanced Video Coding, H.264) and HEVC (High Efficiency Video Coding) videos. The model we use is support vector regression (SVR) model. We use only encoding decisions made during motion estimation to perform the prediction, and do not need the information from pixel domain. We show that the Block-Partition-related features have great importance in SSIM prediction, especially for HEVC videos, due to its partition decisions being more complex than those of AVC. The proposed SVR model trained by data of two different encoding configurations can predict SSIM well for AVC videos (0.78 correlation) and for HEVC videos (0.88 correlation). The proposed models are also compared with a state-of-the-art no-reference-bitstream-pixel SSIM prediction model. We show that the proposed methods provide higher prediction correlation (as high as 13.13% improvement in correlation) with much lower complexity.

[1]  Nicu Sebe,et al.  Facial expression recognition from video sequences , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[2]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[3]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[4]  Tamer Shanableh,et al.  Prediction of Structural Similarity Index of Compressed Video at a Macroblock Level , 2011, IEEE Signal Processing Letters.

[5]  Gwen Littlewort,et al.  Toward Practical Smile Detection , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Bernd Girod,et al.  What's wrong with mean-squared error? , 1993 .

[7]  Paul S. Fisher,et al.  Image quality measures and their performance , 1995, IEEE Trans. Commun..

[8]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[9]  K. R. Rao,et al.  High Efficiency Video Coding(HEVC) , 2014 .

[10]  Rik Van de Walle,et al.  No-Reference Bitstream-Based Visual Quality Impairment Detection for High Definition H.264/AVC Encoded Video Sequences , 2012, IEEE Transactions on Broadcasting.

[11]  Munchurl Kim,et al.  No-Reference PSNR Estimation for HEVC Encoded Video , 2013, IEEE Transactions on Broadcasting.

[12]  Weisi Lin,et al.  Comparison of Video Quality Metrics on Multimedia Videos , 2006, 2006 International Conference on Image Processing.

[13]  Gary J. Sullivan,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Gwen Littlewort,et al.  Real Time Face Detection and Facial Expression Recognition: Development and Applications to Human Computer Interaction. , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[15]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[16]  Pamela C. Cosman,et al.  A Versatile Model for Packet Loss Visibility and its Application to Packet Prioritization , 2010, IEEE Transactions on Image Processing.

[17]  Qiang Ji,et al.  Active and dynamic information fusion for facial expression understanding from image sequences , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Margaret H. Pinson Low Bandwidth Reduced Reference Video Quality Monitoring System , 2005 .

[19]  Martin Reisslein,et al.  Objective Video Quality Assessment Methods: A Classification, Review, and Performance Comparison , 2011, IEEE Transactions on Broadcasting.

[20]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[21]  Nicu Sebe,et al.  Facial expression recognition from video sequences: temporal and static modeling , 2003, Comput. Vis. Image Underst..

[22]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.