Scalable image quality assessment with 2D mel-cepstrum and machine learning approach

Measurement of image quality is of fundamental importance to numerous image and video processing applications. Objective image quality assessment (IQA) is a two-stage process comprising of the following: (a) extraction of important information and discarding the redundant one, (b) pooling the detected features using appropriate weights. These two stages are not easy to tackle due to the complex nature of the human visual system (HVS). In this paper, we first investigate image features based on two-dimensional (2D) mel-cepstrum for the purpose of IQA. It is shown that these features are effective since they can represent the structural information, which is crucial for IQA. Moreover, they are also beneficial in a reduced-reference scenario where only partial reference image information is used for quality assessment. We address the second issue by exploiting machine learning. In our opinion, the well established methodology of machine learning/pattern recognition has not been adequately used for IQA so far; we believe that it will be an effective tool for feature pooling since the required weights/parameters can be determined in a more convincing way via training with the ground truth obtained according to subjective scores. This helps to overcome the limitations of the existing pooling methods, which tend to be over simplistic and lack theoretical justification. Therefore, we propose a new metric by formulating IQA as a pattern recognition problem. Extensive experiments conducted using six publicly available image databases (totally 3211 images with diverse distortions) and one video database (with 78 video sequences) demonstrate the effectiveness and efficiency of the proposed metric, in comparison with seven relevant existing metrics.

[1]  Bernd Girod,et al.  What's wrong with mean-squared error? , 1993 .

[2]  Pietro Laface,et al.  Parallel implementation of Artificial Neural Network training for speech recognition , 2010, Pattern Recognit. Lett..

[3]  Xin Yang,et al.  Image quality assessment using contourlet transform , 2009 .

[4]  Hung-Hsu Tsai,et al.  Wavelet-based image watermarking with visibility range estimation based on HVS and neural networks , 2011, Pattern Recognit..

[5]  A. Enis Çetin,et al.  Image Feature Extraction Using 2D Mel-Cepstrum , 2010, 2010 20th International Conference on Pattern Recognition.

[6]  Weisi Lin,et al.  Non-intrusive Speech Quality Assessment with Support Vector Regression , 2010, MMM.

[7]  Robert W. Heath,et al.  Rate Bounds on SSIM Index of Quantized Images , 2008, IEEE Transactions on Image Processing.

[8]  Vivian O'Brien,et al.  Contour Perception, Illusion and Reality* , 1958 .

[9]  Zhou Wang,et al.  Why is image quality assessment so difficult? , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Hiroko Terasawa,et al.  A timbre space for speech , 2005, INTERSPEECH.

[11]  Ramesh C. Jain,et al.  A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video , 2002, Pattern Recognit..

[12]  A. Enis Çetin,et al.  Mel-cepstral methods for image feature extraction , 2010, 2010 IEEE International Conference on Image Processing.

[13]  Stefan Winkler,et al.  The Evolution of Video Quality Measurement: From PSNR to Hybrid Metrics , 2008, IEEE Transactions on Broadcasting.

[14]  Yuukou Horita,et al.  No reference image quality assessment for JPEG2000 based on spatial features , 2008, Signal Process. Image Commun..

[15]  Anthony J. Maeder The image importance approach to human vision based image quality characterization , 2005, Pattern Recognit. Lett..

[16]  Stefan Winkler,et al.  Perceptual Video Quality Metrics — A Review , 2005 .

[17]  Lei Zhang,et al.  RFSIM: A feature based image quality assessment metric using Riesz transforms , 2010, 2010 IEEE International Conference on Image Processing.

[18]  S. Tubaro,et al.  Subjective assessment of H.264/AVC video sequences transmitted over a noisy channel , 2009, 2009 International Workshop on Quality of Multimedia Experience.

[19]  Weisi Lin,et al.  Objective Image Quality Assessment Based on Support Vector Regression , 2010, IEEE Transactions on Neural Networks.

[20]  Stefan Winkler,et al.  Perceptual distortion metric for digital color video , 1999, Electronic Imaging.

[21]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[22]  Eero P. Simoncelli,et al.  Natural signal statistics and sensory gain control , 2001, Nature Neuroscience.

[23]  Patrick Le Callet,et al.  Does where you Gaze on an Image Affect your Perception of Quality? Applying Visual Attention to Image Quality Metric , 2007, 2007 IEEE International Conference on Image Processing.

[24]  Peter L. Bartlett,et al.  Model Selection and Error Estimation , 2000, Machine Learning.

[25]  Dong-O Kim,et al.  Gradient information-based image quality metric , 2010, IEEE Transactions on Consumer Electronics.

[26]  D. Burr,et al.  Feature detection in human vision: a phase-dependent energy model , 1988, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[27]  Olivier Verscheure,et al.  Perceptual quality measure using a spatiotemporal model of the human visual system , 1996, Electronic Imaging.

[28]  Zhou Wang,et al.  Spatial Pooling Strategies for Perceptual Image Quality Assessment , 2006, 2006 International Conference on Image Processing.

[29]  Wen Gao,et al.  No-reference perceptual image quality metric using gradient profiles for JPEG2000 , 2010, Signal Process. Image Commun..

[30]  Abdelmajid Ben Hamadou,et al.  Off-line handwritten word recognition using multi-stream hidden Markov models , 2010, Pattern Recognit. Lett..

[31]  Ramakrishnan Mukundan,et al.  Image quality assessment by discrete orthogonal moments , 2010, Pattern Recognit..

[32]  Chaofeng Li,et al.  Content-partitioned structural similarity index for image quality assessment , 2010, Signal Process. Image Commun..

[33]  Nikolay N. Ponomarenko,et al.  Color image database for evaluation of image quality metrics , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[34]  Michael Egmont-Petersen,et al.  Image processing with neural networks - a review , 2002, Pattern Recognit..

[35]  Weisi Lin,et al.  Scalable image quality assessment based on structural vectors , 2009, 2009 IEEE International Workshop on Multimedia Signal Processing.

[36]  S. Zucker,et al.  Evidence for boundary-specific grouping , 1998, Vision Research.

[37]  Alex Pappachen James,et al.  Inter-image outliers and their application to image classification , 2010, Pattern Recognit..

[38]  Alan C. Bovik,et al.  Image information and visual quality , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[39]  Weisi Lin,et al.  Perceptual image quality assessment: recent progress and trends , 2010, Visual Communications and Image Processing.

[40]  David J. Sakrison,et al.  The effects of a visual fidelity criterion of the encoding of images , 1974, IEEE Trans. Inf. Theory.

[41]  Sheila S. Hemami,et al.  Suprathreshold image compression based on contrast allocation and global precedence , 2003, IS&T/SPIE Electronic Imaging.

[42]  Daijin Kim,et al.  Real-time lip reading system for isolated Korean word recognition , 2011, Pattern Recognit..

[43]  J. Astola,et al.  ON BETWEEN-COEFFICIENT CONTRAST MASKING OF DCT BASIS FUNCTIONS , 2007 .

[44]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[45]  P. O. Bishop,et al.  Spatial vision. , 1971, Annual review of psychology.

[46]  Raveendran Paramesran,et al.  Fast computation of geometric moments using a symmetric kernel , 2008, Pattern Recognit..

[47]  Ahmet M. Eskicioglu,et al.  An SVD-based grayscale image quality measure for local and global assessment , 2006, IEEE Transactions on Image Processing.

[48]  François Pachet,et al.  Music Similarity Measures: What's the use? , 2002, ISMIR.

[49]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[50]  Weisi Lin,et al.  Perceptual visual quality metrics: A survey , 2011, J. Vis. Commun. Image Represent..

[51]  R.W. Schafer,et al.  From frequency to quefrency: a history of the cepstrum , 2004, IEEE Signal Processing Magazine.

[52]  Sugato Chakravarty,et al.  Methodology for the subjective assessment of the quality of television pictures , 1995 .

[53]  Kang Ryoung Park,et al.  Age estimation using a hierarchical classifier based on global and local facial features , 2011, Pattern Recognit..

[54]  T. Metin Sezgin,et al.  Sketch recognition by fusion of temporal and image-based features , 2011, Pattern Recognit..

[55]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[56]  Azeddine Beghdadi,et al.  Image quality assessment based on wave atoms transform , 2010, 2010 IEEE International Conference on Image Processing.

[57]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[58]  Wufeng Xue,et al.  Reduced reference image quality assessment based on Weibull statistics , 2010, 2010 Second International Workshop on Quality of Multimedia Experience (QoMEX).

[59]  Cheng-Hsing Yang,et al.  Inverted pattern approach to improve image quality of information hiding by LSB substitution , 2008, Pattern Recognit..

[60]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[61]  Scott J. Daly,et al.  Visible differences predictor: an algorithm for the assessment of image fidelity , 1992, Electronic Imaging.

[62]  Chun-Ling Yang,et al.  Gradient-Based Structural Similarity for Image Quality Assessment , 2006, 2006 International Conference on Image Processing.

[63]  Sheila S. Hemami,et al.  Wavelet coefficient quantization to produce equivalent visual distortions in complex stimuli , 2000, Electronic Imaging.

[64]  Patrick Le Callet,et al.  Subjective quality assessment IRCCyN/IVC database , 2004 .

[65]  Sheila S. Hemami,et al.  VSNR: A Wavelet-Based Visual Signal-to-Noise Ratio for Natural Images , 2007, IEEE Transactions on Image Processing.

[66]  William K. Pratt,et al.  Digital Image Processing: PIKS Inside , 2001 .

[67]  Homer H. Chen,et al.  Perceptual Rate-Distortion Optimization Using Structural Similarity Index as Quality Metric , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[68]  Fabrice Labeau,et al.  Low Complexity Image Quality Assessment Using Frequency Domain Transforms , 2006, 2006 International Conference on Image Processing.

[69]  Zhong Liu,et al.  Perceptual image quality assessment using a geometric structural distortion model , 2010, 2010 IEEE International Conference on Image Processing.

[70]  Nariman Farvardin,et al.  A perceptually motivated three-component image model-Part I: description of the model , 1995, IEEE Trans. Image Process..

[71]  Ting Wang,et al.  Color image segmentation using pixel wise support vector machine classification , 2011, Pattern Recognit..

[72]  Yuukou Horita,et al.  Impact of subjective dataset on the performance of image quality metrics , 2008, 2008 15th IEEE International Conference on Image Processing.

[73]  Domenec Puig,et al.  Multi-level pixel-based texture classification through efficient prototype selection via normalized cut , 2010, Pattern Recognit..

[74]  Andrew B. Watson,et al.  Measurement of visual impairment scales for digital video , 2001, IS&T/SPIE Electronic Imaging.

[75]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[76]  Serdar Cakir,et al.  Mel-cepstral feature extraction methods for image representation , 2010 .

[77]  Ke Lu,et al.  Hessian optimal design for image retrieval , 2011, Pattern Recognit..

[78]  Sheila S. Hemami,et al.  Natural image utility assessment using image contours , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[79]  Hideki Noda,et al.  Local MAP estimation for quality improvement of compressed color images , 2010, TENCON 2010 - 2010 IEEE Region 10 Conference.

[80]  Miska M. Hannuksela,et al.  Perceptual quality assessment based on visual attention analysis , 2009, ACM Multimedia.

[81]  Nam Ik Cho,et al.  Image segmentation algorithms based on the machine learning of features , 2010, Pattern Recognit. Lett..

[82]  Z. L. Budrikis,et al.  Picture Quality Prediction Based on a Visual Model , 1982, IEEE Trans. Commun..

[83]  Yutaka Satoh,et al.  Object detection based on a robust and accurate statistical multi-point-pair model , 2011, Pattern Recognit..