A Statistical Evaluation of Recent Full Reference Image Quality Assessment Algorithms

Measurement of visual quality is of fundamental importance for numerous image and video processing applications, where the goal of quality assessment (QA) algorithms is to automatically assess the quality of images or videos in agreement with human quality judgments. Over the years, many researchers have taken different approaches to the problem and have contributed significant research in this area and claim to have made progress in their respective domains. It is important to evaluate the performance of these algorithms in a comparative setting and analyze the strengths and weaknesses of these methods. In this paper, we present results of an extensive subjective quality assessment study in which a total of 779 distorted images were evaluated by about two dozen human subjects. The "ground truth" image quality data obtained from about 25 000 individual human quality judgments is used to evaluate the performance of several prominent full-reference image quality assessment algorithms. To the best of our knowledge, apart from video quality studies conducted by the Video Quality Experts Group, the study presented in this paper is the largest subjective image quality study in the literature in terms of number of images, distortion types, and number of human judgments per image. Moreover, we have made the data from the study freely available to the research community . This would allow other researchers to easily report comparative results in the future

[1]  Maliha S. Nash,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 2001, Technometrics.

[2]  Etienne E. Kerre,et al.  Using similarity measures and homogeneity for the comparison of images , 2004, Image Vis. Comput..

[3]  Robert V. Brill,et al.  Applied Statistics and Probability for Engineers , 2004, Technometrics.

[4]  V. Ralph Algazi,et al.  Objective picture quality scale (PQS) for image coding , 1998, IEEE Trans. Commun..

[5]  修 杉本,et al.  VQEG(Video Quality Experts Group)の動向と関連技術 , 2008 .

[6]  Zhou Wang,et al.  Multi-scale structural similarity for image quality assessment , 2003 .

[7]  Sugato Chakravarty,et al.  Methodology for the subjective assessment of the quality of television pictures , 1995 .

[8]  Andrew B. Watson,et al.  DCTune: A TECHNIQUE FOR VISUAL OPTIMIZATION OF DCT QUANTIZATION MATRICES FOR INDIVIDUAL IMAGES. , 1993 .

[9]  Gary W. Meyer,et al.  Comparison of two image quality models , 1998, Electronic Imaging.

[10]  D. Sheskin Handbook of parametric and nonparametric statistical procedures, 2nd ed. , 2000 .

[11]  Hocine Cherifi,et al.  A comparison of image quality models and metrics based on human visual sensitivity , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[12]  Jerome R. Cox,et al.  Experimental evaluation of psychophysical distortion metrics for JPEG-encoded images , 1995, J. Electronic Imaging.

[13]  Wilson S. Geisler,et al.  Image quality assessment based on a degradation model , 2000, IEEE Trans. Image Process..

[14]  Alan C. Bovik,et al.  Image information and visual quality , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  Jean-Bernard Martens,et al.  Quality asessment of coded images using numerical category scaling , 1995, Other Conferences.

[16]  Paul S. Fisher,et al.  Image quality measures and their performance , 1995, IEEE Trans. Commun..

[17]  Gustavo de Veciana,et al.  An information fidelity criterion for image quality assessment using natural scene statistics , 2005, IEEE Transactions on Image Processing.

[18]  Scott J. Daly,et al.  Visible differences predictor: an algorithm for the assessment of image fidelity , 1992, Electronic Imaging.

[19]  Bülent Sankur,et al.  Statistical evaluation of image quality measures , 2002, J. Electronic Imaging.

[20]  Philip Corriveau,et al.  Video Quality Experts Group , 2005 .

[21]  Michael W. Marcellin,et al.  JPEG2000 - image compression fundamentals, standards and practice , 2002, The Kluwer International Series in Engineering and Computer Science.

[22]  D. Sheskin Handbook of Parametric and Nonparametric Statistical Procedures: Third Edition , 2000 .

[23]  Zhou Wang,et al.  Stimulus synthesis for efficient evaluation and refinement of perceptual image quality metrics , 2004, IS&T/SPIE Electronic Imaging.