Multi-label Prediction for Visual Sentiment Analysis using Eight Different Emotions based on Psychology

In visual sentiment analysis, sentiment estimation from images is a challenging research problem. Previous studies focused on a few specific sentiments and their intensities and have not captured abundant psychological human feelings. In addition, multi-label sentiment estimation from images has not been sufficiently investigated. The purpose of this research is to build a visual sentiment dataset, accurately estimate the sentiments as a multi-label multi-class problem from images that simultaneously evoke multiple emotions. We built a visual sentiment dataset based on Plutchik's wheel of emotions. We describe this ‘Senti8PW’ dataset, then perform multi-label sentiment analysis using the dataset, where we propose a combined deep neural network model that enables inputs from both hand-crafted features and CNN features. We also introduce a threshold-based multi-label prediction algorithm, in which we assume that each emotion has a probability distribution. In other words, after training our deep neural network, we predict evoked emotions for an image if the intensity of the emotion is larger than the threshold of the corresponding emotion. Extensive experiments were conducted on our dataset. Our model achieves superior results compared to the state-of-the-art algorithms in terms of subsets.

[1]  Hassan Foroosh,et al.  LUCFER: A Large-Scale Context-Sensitive Image Dataset for Deep Learning of Visual Emotions , 2019, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV).

[2]  Quoc V. Le,et al.  EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[3]  Ming-Hsuan Yang,et al.  Weakly Supervised Coupled Networks for Visual Sentiment Analysis , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4]  Vijay Vasudevan,et al.  Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[5]  P. Lang International affective picture system (IAPS) : affective ratings of pictures and instruction manual , 2005 .

[6]  Miki Haseyama,et al.  A Cross-Modal Approach for Extracting Semantic Relationships Between Concepts Using Tagged Images , 2014, IEEE Transactions on Multimedia.

[7]  Ethan Rublee,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[8]  Masaki Aono,et al.  Median based Multi-label Prediction by Inflating Emotions with Dyads for Visual Sentiment Analysis , 2019, 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC).

[9]  R. Plutchik Emotions : a general psychoevolutionary theory , 1984 .

[10]  Allan Hanbury,et al.  Affective image classification using features inspired by psychology and art theory , 2010, ACM Multimedia.

[11]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[12]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Tsuhan Chen,et al.  Where do emotions come from? Predicting the Emotion Stimuli Map , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[14]  Mohan S. Kankanhalli,et al.  Emotional Attention: A Study of Image Sentiment and Visual Attention , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[15]  Jiebo Luo,et al.  Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark , 2016, AAAI.

[16]  Mohan S. Kankanhalli,et al.  Emotion-Aware Human Attention Prediction , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Reiner Lenz,et al.  Color Based Bags-of-Emotions , 2009, CAIP.

[18]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[19]  In-Kwon Lee,et al.  Building Emotional Machines: Recognizing Image Emotions Through Deep Neural Networks , 2017, IEEE Transactions on Multimedia.

[20]  Mohammad Soleymani,et al.  A survey of multimodal sentiment analysis , 2017, Image Vis. Comput..

[21]  Wei Zhang,et al.  Exploring Discriminative Representations for Image Emotion Recognition With CNNs , 2020, IEEE Transactions on Multimedia.

[22]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[23]  Li-Jia Li,et al.  Visual Sentiment Prediction with Deep Convolutional Neural Networks , 2014, ArXiv.

[24]  John Langford,et al.  Multi-Label Prediction via Compressed Sensing , 2009, NIPS.