Studying user agreement on aesthetic appeal ratings and its relation with technical knowledge

In this paper, a crowdsourcing experiment was conducted involving different panels of participants. The aim of this study is to evaluate how the preference of one image over another one is related with the knowledge of the participant in photography. In previous work the two discriminant evaluation concepts “presence of a main subject” and “exposure” were found to distinguish group participants with different degrees of knowledge in photography. Each of these groups provided different means of aesthetic appeal ratings when asked to rate on an absolute category scale. The present paper extends previous work by studying preference ratings on a set of image pairs as a function of technical knowledge and more specifically adding a focus on the variance of rating and agreement between participants. The conducted study was composed of two different steps where the participants had to first report their preference of one image over another (paired comparison), and an evaluation of the technical background of the participant using a specific set of images. Based on preference-rating patterns groups of participants were identified. These groups were formed by clustering the participants who saw and shared the same preference rating on images in one group, and the participants with low agreement with other participants in another group. A per-group analysis showed that a high agreement between participants could be observed when participants have technical knowledge. This indicates that higher consistency between participants can be reached when expert users are being recruited, and therefore participants should be carefully selected in image aesthetic appeal evaluation to ensure stable results.

[1]  Marcus Barkowsky,et al.  Subjective assessment methodology for preference of experience in 3DTV , 2013, IVMSP 2013.

[2]  Jukka Häkkinen,et al.  Profiles of the evaluators: impact of psychographic variables on the consumer-oriented quality assessment of mobile television , 2008, Electronic Imaging.

[3]  H. Leder,et al.  A model of aesthetic appreciation and aesthetic judgments. , 2004, British journal of psychology.

[4]  W. Chu Studying Aesthetics in Photographic Images Using a Computational Approach , 2013 .

[5]  Edward A. Vessel,et al.  Personalized visual aesthetics , 2014, Electronic Imaging.

[6]  Yan Ke,et al.  The Design of High-Level Features for Photo Quality Assessment , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[7]  Edward A. Vessel,et al.  Perceptual Pleasure and the Brain , 2006 .

[8]  D. Berlyne Novelty, complexity, and hedonic value , 1970 .

[9]  John C. Handley,et al.  Comparative Analysis of Bradley-Terry and Thurstone-Mosteller Paired Comparison Models for Image Quality Assessment , 2001, PICS.

[10]  Judith Redi,et al.  Best Practices and Recommendations for Crowdsourced QoE - Lessons learned from the Qualinet Task Force Crowdsourcing , 2014 .

[11]  Edward A. Vessel,et al.  Beauty and the beholder: highly individual taste for abstract, but not real-world images. , 2010, Journal of vision.

[12]  Alexander Raake,et al.  Evaluation of aesthetic appeal with regard of user's knowledge , 2016, HVEI.