Attentional Feature Fusion for End-to-End Blind Image Quality Assessment

In this paper, an end-to-end blind image quality assessment (BIQA) model based on feature fusion with an attention mechanism is proposed. We extracted the multilayer features of the image and fused them based on the attention mechanism; the fused features are then mapped into score, and the image quality assessment without reference is realized. First, because the human visual perception system hierarchically approaches the input information from local to global, we used three different neural networks to extract physically meaningful image features, and we use modified VGG19 and modified VGG16 to extract the substrate texture information and the local information of the edges, respectively. Meanwhile, we use the resNet50 to extract high-level global semantic information. Second, to take full advantage of multilevel features and avoid monotonic addition in hierarchical feature fusion, we adopt an attention-based feature fusion mechanism that combines the global and local contexts of the features and assigns different weights to the features to be fused, so that the model can perceive richer types of distortion. Experimental findings on six standard databases show that our approach yields improved performance.

[1]  Guangming Shi,et al.  Image Quality Caption with Attentive and Recurrent Semantic Attractor Network , 2021, ACM Multimedia.

[2]  Xinfeng Zhang,et al.  Quality Assessment of End-to-End Learned Image Compression: The Benchmark and Objective Measure , 2021, ACM Multimedia.

[3]  Manri Cheon,et al.  Perceptual Image Quality Assessment with Transformers , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[4]  Weisi Lin,et al.  Blind image quality prediction with hierarchical feature aggregation , 2021, Inf. Sci..

[5]  Junyong You,et al.  Transformer For Image Quality Assessment , 2020, 2021 IEEE International Conference on Image Processing (ICIP).

[6]  Ke Gu,et al.  Blind Image Quality Assessment by Natural Scene Statistics and Perceptual Characteristics , 2020, ACM Trans. Multim. Comput. Commun. Appl..

[7]  Guangming Shi,et al.  End-to-End Blind Image Quality Prediction With Cascaded Deep Neural Network , 2020, IEEE Transactions on Image Processing.

[8]  Yu Zhu,et al.  Blindly Assess Image Quality in the Wild Guided by a Self-Adaptive Hyper Network , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Baining Guo,et al.  Learning Texture Transformer Network for Image Super-Resolution , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Shiqi Wang,et al.  Image Quality Assessment: Unifying Structure and Texture Similarity , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Weisi Lin,et al.  Multiscale Natural Scene Statistical Analysis for No-Reference Quality Evaluation of DIBR-Synthesized Views , 2020, IEEE Transactions on Broadcasting.

[12]  Gerhard Nahler,et al.  Pearson Correlation Coefficient , 2020, Definitions.

[13]  Praful Gupta,et al.  From Patches to Pictures (PaQ-2-PiQ): Mapping the Perceptual Space of Picture Quality , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Zhou Wang,et al.  Blind Image Quality Assessment Using a Deep Bilinear Convolutional Neural Network , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Federica Battisti,et al.  Reduced Reference Quality Assessment of Light Field Images , 2019, IEEE Transactions on Broadcasting.

[16]  Qi Tian,et al.  Blind image quality prediction by exploiting multi-level deep representations , 2018, Pattern Recognit..

[17]  Mylène C. Q. Farias,et al.  Blind image quality assessment based on multiscale salient local binary patterns , 2018, MMSys.

[18]  Hong Cai,et al.  PieAPP: Perceptual Image-Error Assessment Through Pairwise Preference , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[19]  Yutao Liu,et al.  Blind Image Quality Estimation via Distortion Aggravation , 2018, IEEE Transactions on Broadcasting.

[20]  Guangming Shi,et al.  Hierarchical Feature Degradation Based Blind Image Quality Assessment , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[21]  Weisi Lin,et al.  No-Reference and Robust Image Sharpness Evaluation Based on Multiscale Spatial and Spectral Features , 2017, IEEE Transactions on Multimedia.

[22]  Sanghoon Lee,et al.  Fully Deep Blind Image Quality Predictor , 2017, IEEE Journal of Selected Topics in Signal Processing.

[23]  Xiang Bai,et al.  Richer Convolutional Features for Edge Detection , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Sebastian Bosse,et al.  Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment , 2016, IEEE Transactions on Image Processing.

[25]  Sebastian Bosse,et al.  A deep neural network for image quality assessment , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[26]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Zhou Wang,et al.  No-Reference Quality Assessment of Contrast-Distorted Images Based on Natural Scene Statistics , 2015, IEEE Signal Processing Letters.

[28]  Lei Zhang,et al.  A Feature-Enriched Completely Blind Image Quality Evaluator , 2015, IEEE Transactions on Image Processing.

[29]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[30]  Wenjun Zhang,et al.  Hybrid No-Reference Quality Metric for Singly and Multiply Distorted Images , 2014, IEEE Transactions on Broadcasting.

[31]  Hongyu Li,et al.  VSI: A Visual Saliency-Induced Index for Perceptual Image Quality Assessment , 2014, IEEE Transactions on Image Processing.

[32]  Alan C. Bovik,et al.  No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[33]  Christophe Charrier,et al.  Blind Image Quality Assessment: A Natural Scene Statistics Approach in the DCT Domain , 2012, IEEE Transactions on Image Processing.

[34]  Philip H. Ramsey Critical Values for Spearman’s Rank Order Correlation , 1989 .

[35]  Wenjun Zhang,et al.  Using Free Energy Principle For Blind Image Quality Assessment , 2015, IEEE Transactions on Multimedia.