Fast Adaptive Algorithm for Robust Evaluation of Quality of Experience

Outlier detection is an integral part of robust evaluation for crowdsourceable Quality of Experience (QoE) and has attracted much attention in recent years. In QoE for multimedia, outliers happen because of different test conditions, human errors, abnormal variations in context, etc. In this paper, we propose a simple yet effective algorithm for outlier detection and robust QoE evaluation named iterative Least Trimmed Squares (iLTS). The algorithm assigns binary weights to samples, i.e., 0 or 1 indicating if a sample is an outlier, then the outlier-trimmed subset least squares solutions give robust ranking scores. An iterative optimization is carried alternatively between updating weights and ranking scores which converges to a local optimizer in finite steps. In our test setting, iLTS is up to 190 times faster than LASSO-based methods with a comparable performance. Moreover, a varied version of this method shows adaptation in outlier detection, which provides an automatic detection to determine whether a data sample is an outlier without a priori knowledge about the amount of the outliers. The effectiveness and efficiency of iLTS are demonstrated on both simulated examples and real-world applications. A Matlab package is provided to researchers exploiting crowdsourcing paired comparison data for robust ranking.

[1]  K. Arrow Social Choice and Individual Values , 1951 .

[2]  Stanley Osher,et al.  Enhanced statistical rankings via targeted data collection , 2013, ICML.

[3]  J. Marsden,et al.  A mathematical introduction to fluid mechanics , 1979 .

[4]  Yiyuan She,et al.  Outlier Detection Using Nonconvex Penalized Regression , 2010, ArXiv.

[5]  Arun Rajkumar,et al.  A Statistical Convergence Perspective of Algorithms for Rank Aggregation from Pairwise Data , 2014, ICML.

[6]  G. E. Noether,et al.  Remarks about a paired comparison model , 1960 .

[7]  Aichi Chien,et al.  An L1-based variational model for Retinex theory and its application to medical images , 2011, CVPR 2011.

[8]  Ming Yan,et al.  Exact Low-Rank Matrix Completion from Sparsely Corrupted Entries Via Adaptive Outlier Pursuit , 2013, J. Sci. Comput..

[9]  Christos Faloutsos,et al.  LOCI: fast outlier detection using the local correlation integral , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[10]  Ragnhild Eg,et al.  Randomised pair comparison: an economic and robust method for audiovisual quality assessment , 2010, NOSSDAV.

[11]  Chin-Laung Lei,et al.  Quadrant of euphoria: a crowdsourcing platform for QoE assessment , 2010, IEEE Network.

[12]  Robert D. Nowak,et al.  Active Ranking using Pairwise Comparisons , 2011, NIPS.

[13]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[14]  David A. Forsyth,et al.  Utility data annotation with Amazon Mechanical Turk , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[15]  Jing Yuan,et al.  Convex Hodge Decomposition and Regularization of Image Flows , 2009, Journal of Mathematical Imaging and Vision.

[16]  Qingming Huang,et al.  Robust evaluation for quality of experience in crowdsourcing , 2013, ACM Multimedia.

[17]  Qingming Huang,et al.  Online HodgeRank on Random Graphs for Crowdsourceable QoE Evaluation , 2014, IEEE Transactions on Multimedia.

[18]  Ming Yan,et al.  Restoration of Images Corrupted by Impulse Noise and Mixed Gaussian Impulse Noise using Blind Inpainting , 2013, SIAM J. Imaging Sci..

[19]  Ming Yan,et al.  Robust 1-bit Compressive Sensing Using Adaptive Outlier Pursuit , 2012, IEEE Transactions on Signal Processing.

[20]  Qingming Huang,et al.  Online crowdsourcing subjective image quality assessment , 2012, ACM Multimedia.

[21]  H. A. David,et al.  The method of paired comparisons , 1966 .

[22]  T. L. Saaty A Scaling Method for Priorities in Hierarchical Structures , 1977 .

[23]  Yi-Hsuan Yang,et al.  1000 songs for emotional analysis of music , 2013, CrowdMM '13.

[24]  Patrick Le Callet,et al.  Subjective quality assessment IRCCyN/IVC database , 2004 .

[25]  Quoc V. Le,et al.  Abstract , 2003, Appetite.

[26]  Tobias Hoßfeld,et al.  From Packets to People: Quality of Experience as a New Measurement Challenge , 2013, Data Traffic Monitoring and Analysis.

[27]  Asuman E. Ozdaglar,et al.  Flows and Decompositions of Games: Harmonic and Potential Games , 2010, Math. Oper. Res..

[28]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[29]  Chin-Laung Lei,et al.  A crowdsourceable QoE evaluation framework for multimedia content , 2009, ACM Multimedia.

[30]  N. Wormald Models of random regular graphs , 2010 .

[31]  A. Madansky Identification of Outliers , 1988 .

[32]  Jinfeng Yi,et al.  Inferring Users' Preferences from Crowdsourced Pairwise Comparisons: A Matrix Completion Approach , 2013, HCOMP.

[33]  Gregory N. Hullender,et al.  Learning to rank using gradient descent , 2005, ICML.

[34]  Mário A. T. Figueiredo,et al.  Robust binary fused compressive sensing using adaptive outlier pursuit , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[35]  Paul N. Bennett,et al.  Pairwise ranking aggregation in a crowdsourced setting , 2013, WSDM.

[36]  S. Osher,et al.  Statistical ranking using the $l^{1}$-norm on graphs , 2013 .

[37]  Anil N. Hirani,et al.  Least Squares Ranking on Graphs , 2010, 1011.1716.

[38]  Qingming Huang,et al.  Random partial paired comparison for subjective video quality assessment via hodgerank , 2011, ACM Multimedia.

[39]  Stefanie Nowak,et al.  How reliable are annotations via crowdsourcing: a study about inter-annotator agreement for multi-label image annotation , 2010, MIR '10.

[40]  B. Ripley,et al.  Robust Statistics , 2018, Wiley Series in Probability and Statistics.

[41]  N. Wormald,et al.  Models of the , 2010 .

[42]  Omar Alonso,et al.  Crowdsourcing for relevance evaluation , 2008, SIGF.

[43]  Phuoc Tran-Gia,et al.  Best Practices for QoE Crowdtesting: QoE Assessment With Crowdsourcing , 2014, IEEE Transactions on Multimedia.

[44]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[45]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[46]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[47]  Yuan Yao,et al.  Statistical ranking and combinatorial Hodge theory , 2008, Math. Program..

[48]  M. Ries,et al.  Impact of screening technique on crowdsourcing QoE assessments , 2012, Proceedings of 22nd International Conference Radioelektronika 2012.

[49]  Christian Keimel,et al.  Challenges in crowd-based video quality assessment , 2012, 2012 Fourth International Workshop on Quality of Multimedia Experience.

[50]  Stella Yu,et al.  Angular Embedding: A Robust Quadratic Criterion , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51]  Tie-Yan Liu,et al.  Adapting ranking SVM to document retrieval , 2006, SIGIR.

[52]  Chin-Laung Lei,et al.  Crowdsourcing Multimedia QoE Evaluation: A Trusted Framework , 2013, IEEE Transactions on Multimedia.

[53]  Qingming Huang,et al.  HodgeRank on Random Graphs for Subjective Video Quality Assessment , 2012, IEEE Transactions on Multimedia.

[54]  METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .

[55]  Nir Ailon,et al.  An Active Learning Algorithm for Ranking from Pairwise Preferences with an Almost Optimal Query Complexity , 2010, J. Mach. Learn. Res..

[56]  Mehryar Mohri,et al.  Magnitude-preserving ranking algorithms , 2007, ICML '07.

[57]  Stella X. Yu,et al.  Angular embedding: From jarring intensity differences to perceived luminance , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[58]  Aniket Kittur,et al.  Crowdsourcing user studies with Mechanical Turk , 2008, CHI.

[59]  João Magalhães,et al.  Crowdsourcing for affective-interaction in computer games , 2013, CrowdMM '13.

[60]  Weisi Lin,et al.  Perceptual visual quality metrics: A survey , 2011, J. Vis. Commun. Image Represent..

[61]  Yannis Sismanis,et al.  How I won the "Chess Ratings - Elo vs the Rest of the World" Competition , 2010, ArXiv.