论文信息 - Enhancing Reliability through Screening and Segmentation: An Online Video Subjective Quality of Experience Case Study

Enhancing Reliability through Screening and Segmentation: An Online Video Subjective Quality of Experience Case Study

Abstract In this paper we examine the reliability of subjective rating judgments along a single dimension, focusing on estimates of technical quality produced by integrity impairments and failures (non-accessibility, and non-retainability) associated with viewing video. There is often considerable variability, both within and between individuals, in subjective rating tasks. In the research reported here we consider different approaches to screening out unreliable participants. We review available alternatives, including a method developed by the ITU, a method based on screening outliers, a method based on strength of correlations with an assumed “natural” ordering of impairments, and a clustering technique that makes no assumptions about the data. We report on an experiment that assesses subjective quality of experience associated with impairments and failures of online video. We then assess the reliability of the results using a correlation method and a clustering method, both of which give similar results. Since the clustering method utilized here makes fewer assumptions about the data, it may be a useful supplement to existing techniques for assessing reliability of participants when making subjective evaluations of the technical quality of videos.

Weiwei Li | Leon Zucherman | Mark Chignell | Alberto Leon-Garcia | Jie Jiang

[1] Philip J. Corriveau,et al. Study of Rating Scales for Subjective Quality Assessment of High-Definition Video , 2011, IEEE Transactions on Broadcasting.

[2] S. S. Stevens. Issues in psychophysical measurement. , 1971 .

[3] Leon Zucherman,et al. Generalizing MOS to assess technical quality for end-to-end Telecom session , 2014, 2014 IEEE Globecom Workshops (GC Wkshps).

[4] Juyun Lim,et al. Hedonic scaling: A review of methods and theory , 2011 .

[5] N. Jaworska,et al. A Review of Multidimensional Scaling (MDS) and its Utility in Various Psychological Domains , 2009 .

[6] Daniel Kahneman,et al. Evaluation by Moments: Past and Future , 2002 .

[7] Phuoc Tran-Gia,et al. Best Practices for QoE Crowdtesting: QoE Assessment With Crowdsourcing , 2014, IEEE Transactions on Multimedia.

[8] Alexander Raake,et al. Quality of Experience , 2014, T-Labs Series in Telecommunication Services.

[9] Ahmet M. Kondoz,et al. Automatic QOE Prediction in Stereoscopic Videos , 2012, 2012 IEEE International Conference on Multimedia and Expo Workshops.