One picture is worth more than thousand words. A picture contains not only visual features as well as object and scene information, but also emotional meanings that are usually represented in adjective form, for example, happy, sad, fear, etc. (Wang, 2008). In content-based visual information retrieval (CBVIR), emotional-based one is considered as in the highest level, as it involve the subjective perception and understanding of visual information. On the other side, emotion recognition is considered as one of the important part to develop in human-human and human-computer interaction. While the recognition of emotions revealed by facial expression has been mature enough to develop available emotion recognition software, automatic classification of emotions evoked by visual scenes in pictures is still a quite preliminary and challenging area of research (Li, 2010). Few works have been conducted in this field. One typical approach is based on holistic image feature (Yanulevskaya, 2008). It decomposed complex scenes according to an annotated object vocabulary, and assigned a similarity score to all words for each region in an image. Different combinations of visual words in a similarity histogram provided a sufficient characterization of a complex scene picture. Wiccest features and Gabor features were then used for regional feature extraction, and SVM framework was adopted for supervised learning of emotion classes. Two particular new works are presented here to show the recent advances in this field. One is focused on the middle level semantic extraction, in an intention to fill the gap between low-level visual features and high-level emotional concepts to connect human and machine in a uniform framework. Another is to combine low-level visual features and high-level text features to improve the efficiency of classification procedure.
[1]
Sam J. Maglio,et al.
Emotional category data on images from the international affective picture system
,
2005,
Behavior research methods.
[2]
Nicu Sebe,et al.
Emotional valence categorization using holistic image features
,
2008,
2008 15th IEEE International Conference on Image Processing.
[3]
Sebastian Nowozin,et al.
On feature combination for multiclass object classification
,
2009,
2009 IEEE 12th International Conference on Computer Vision.
[4]
Alexei A. Efros,et al.
Discovering objects and their location in images
,
2005,
Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.
[5]
Thomas Hofmann,et al.
Unsupervised Learning by Probabilistic Latent Semantic Analysis
,
2004,
Machine Learning.
[6]
Cordelia Schmid,et al.
Multimodal semi-supervised learning for image classification
,
2010,
2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[7]
Michael I. Jordan,et al.
Multiple kernel learning, conic duality, and the SMO algorithm
,
2004,
ICML.
[8]
G LoweDavid,et al.
Distinctive Image Features from Scale-Invariant Keypoints
,
2004
.
[9]
Koen E. A. van de Sande,et al.
Evaluating Color Descriptors for Object and Scene Recognition
,
2010,
IEEE Transactions on Pattern Analysis and Machine Intelligence.
[10]
Huachun Tan,et al.
Discovering latent semantic factors for emotional picture categorization
,
2010,
2010 IEEE International Conference on Image Processing.
[11]
Derek Hoiem,et al.
Building text features for object image classification
,
2009,
CVPR.
[12]
Andrew Zisserman,et al.
Scene Classification Via pLSA
,
2006,
ECCV.
[13]
Kwang-Eun Ko,et al.
Analysis of Physiological Signals for Emotion Recognition Based on Support Vector Machine
,
2012,
RiTA.