Image aesthetic assessment using a saliency symbiosis network

Abstract. As a result of people taking more and more pictures in their lives, image assessment technology, which can automatically help people choose high quality pictures quickly, has become particularly important. Most algorithms use peak signal-to-noise ratio (PSNR) to assess image quality. However, images with high scores on PSNR are not as beautiful as individuals think. Image aesthetic assessment technology can come closer to human aesthetic standards. We report on a method named saliency symbiosis network for image aesthetic assessment. This is significant because we improved the conventional convolutional neural networks (CNN) method, which gets very close to the human visual mechanism after adding saliency features in CNN. Owing to considering limitations of CNN input size, we also proposed a pooling strategy to improve the ability of the model to accept arbitrary input sizes. Afterward, we propose an effective mean Huber loss function, which becomes less sensitive to outliers and can quickly train the model to being optimal. The experiment results proved that the proposed method, by using very small training data, performed the highest accuracy in image aesthetic assessment and classification.

[1]  Sebastian Bosse,et al.  Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment , 2016, IEEE Transactions on Image Processing.

[2]  Qiaosong Wang,et al.  Towards the Success Rate of One: Real-Time Unconstrained Salient Object Detection , 2017, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[3]  Xiang Zhang,et al.  Region Diversity Maximization for Salient Object Detection , 2012, IEEE Signal Processing Letters.

[4]  Damon M. Chandler,et al.  Opinion-Unaware Blind Quality Assessment of Multiply and Singly Distorted Images via Distortion Parameter Estimation , 2018, IEEE Transactions on Image Processing.

[5]  Florent Perronnin,et al.  Learning beautiful (and ugly) attributes , 2013, BMVC.

[6]  Gabriela Csurka,et al.  Assessing the aesthetic quality of photographs using generic image descriptors , 2011, 2011 International Conference on Computer Vision.

[7]  Chang-Su Kim,et al.  PAC-Net: Pairwise Aesthetic Comparison Network for Image Aesthetic Assessment , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[8]  Houqiang Li,et al.  Photo Quality Assessment with DCNN that Understands Image Well , 2015, MMM.

[9]  Xiaogang Wang,et al.  Content-based photo quality assessment , 2011, 2011 International Conference on Computer Vision.

[10]  Thomas Deselaers,et al.  What is an object? , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Chanho Jung,et al.  A Unified Spectral-Domain Approach for Saliency Detection and Its Application to Automatic Object Segmentation , 2012, IEEE Transactions on Image Processing.

[12]  Sajid Saleem,et al.  A Robust SIFT Descriptor for Multispectral Images , 2014, IEEE Signal Processing Letters.

[13]  Huchuan Lu,et al.  Salient Object Detection with Recurrent Fully Convolutional Networks , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Vicente Ordonez,et al.  High level describable attributes for predicting aesthetics and interestingness , 2011, CVPR 2011.

[15]  Yan Yan,et al.  Automatic Image Cropping for Visual Aesthetic Enhancement Using Deep Neural Networks and Cascaded Regression , 2017, IEEE Transactions on Multimedia.

[16]  Shuang Ma,et al.  A-Lamp: Adaptive Layout-Aware Multi-patch Deep Convolutional Neural Network for Photo Aesthetic Assessment , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Haibin Ling,et al.  A Deep Network Solution for Attention and Aesthetics Aware Photo Cropping , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Rita Cucchiara,et al.  Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features , 2016, ICMR.

[19]  Radomír Mech,et al.  Photo Aesthetics Ranking Network with Attributes and Content Adaptation , 2016, ECCV.

[20]  Naila Murray,et al.  AVA: A large-scale database for aesthetic visual analysis , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Ran He,et al.  Deep Aesthetic Quality Assessment With Semantic Information , 2016, IEEE Transactions on Image Processing.

[22]  Nicu Sebe,et al.  Computer Vision – ECCV 2016 , 2016, Lecture Notes in Computer Science.

[23]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Xiaoou Tang,et al.  Image Aesthetic Assessment: An experimental survey , 2016, IEEE Signal Processing Magazine.

[25]  James Ze Wang,et al.  Studying Aesthetics in Photographic Images Using a Computational Approach , 2006, ECCV.

[26]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[27]  Sruthy Suran,et al.  Automatic aesthetic quality assessment of photographic images using deep convolutional neural network , 2016, 2016 International Conference on Information Science (ICIS).

[28]  Hailin Jin,et al.  Composition-Preserving Deep Photo Aesthetics Assessment , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Yilong Yin,et al.  Distribution-Oriented Aesthetics Assessment With Semantic-Aware Hybrid Network , 2019, IEEE Transactions on Multimedia.

[30]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Yan Ke,et al.  The Design of High-Level Features for Photo Quality Assessment , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[32]  Seunghoon Hong,et al.  Personalized Image Aesthetic Quality Assessment by Joint Regression and Ranking , 2017, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[33]  Zhuowen Tu,et al.  Deeply Supervised Salient Object Detection with Short Connections , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  James Zijun Wang,et al.  RAPID: Rating Pictorial Aesthetics using Deep Learning , 2014, ACM Multimedia.

[35]  Henry Kang,et al.  Photo Aesthetics Analysis via DCNN Feature Encoding , 2017, IEEE Transactions on Multimedia.

[36]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Jiebo Luo,et al.  Aesthetics and Emotions in Images , 2011, IEEE Signal Processing Magazine.

[38]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[39]  Zhou Wang,et al.  Video saliency incorporating spatiotemporal cues and uncertainty weighting , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[40]  Wenguan Wang,et al.  Deep Cropping via Attention Box Prediction and Aesthetics Assessment , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[41]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[42]  Ishu Arora,et al.  Bi-featured image quality assessment with the hierarchical image quality enhancement algorithm , 2016, 2016 International Conference on Inventive Computation Technologies (ICICT).

[43]  Jar-Ferr Yang,et al.  Linear Discriminant Regression Classification for Face Recognition , 2013, IEEE Signal Processing Letters.

[44]  Sebastian Bosse,et al.  A deep neural network for image quality assessment , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[45]  Philip J. Corriveau,et al.  Objective Consumer Device Photo Quality Evaluation , 2015, IEEE Signal Processing Letters.