Low-Complexity, Nonintrusive Speech Quality Assessment

Monitoring of speech quality in emerging heterogeneous networks is of great interest to network operators. The most efficient way to satisfy such a need is through nonintrusive, objective speech quality assessment. In this paper, we describe a low-complexity algorithm for monitoring the speech quality over a network. The features used in the proposed algorithm can be computed from commonly used speech-coding parameters. Reconstruction and perceptual transformation of the signal is not performed. The critical advantage of the approach lies in generating quality assessment ratings without explicit distortion modeling. The results from the performed experiments indicate that the proposed nonintrusive objective quality measure performs better than the ITU-T P.563 standard

[1]  A. W. Rix,et al.  Quality VoIP — An Engineering Challenge , 2001 .

[2]  Doh-Suk Kim,et al.  Perceptual model for non-intrusive speech quality assessment , 2006, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  John G. Beerends,et al.  A Perceptual Audio Quality Measure Based on a Psychoacoustic Sound Representation , 1992 .

[4]  Steven Kay,et al.  Fundamentals Of Statistical Signal Processing , 2001 .

[5]  Thomas Eriksson,et al.  Time evolution in LPC spectrum coding , 2004, IEEE Transactions on Speech and Audio Processing.

[6]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[7]  K. H. Barratt Digital Coding of Waveforms , 1985 .

[8]  Stephen D. Voran,et al.  A simplified version of the ITU algorithm for objective measurement of speech codec quality , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[9]  M. Goldstein,et al.  Classification of methods used for assessment of text-to-speech systems according to the demands placed on the listener , 1995, Speech Commun..

[10]  METHODS FOR SUBJECTIVE DETERMINATION OF TRANSMISSION QUALITY Summary , 2022 .

[11]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[12]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Josef Kittler,et al.  Pattern recognition : a statistical approach , 1982 .

[14]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[15]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .

[16]  Vijay Parsa,et al.  Bayesian model based non-intrusive speech quality evaluation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[17]  Stephen D. Voran,et al.  Objective estimation of perceived speech quality. I. Development of the measuring normalizing block technique , 1999, IEEE Trans. Speech Audio Process..

[18]  W. Bastiaan Kleijn,et al.  Spectral dynamics is more important than spectral distortion , 1995, ICASSP.

[19]  Robert F. Kubichek,et al.  Output-based objective speech quality , 1994, Proceedings of IEEE Vehicular Technology Conference (VTC).

[20]  Peter Vary,et al.  Quality control for AMR speech channels in GSM networks , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  Stephen D. Voran,et al.  Objective estimation of perceived speech quality .II. Evaluation of the measuring normalizing block technique , 1999, IEEE Trans. Speech Audio Process..

[22]  Doh-Suk Kim,et al.  ANIQUE: An Auditory Model for Single-Ended Speech Quality Estimation , 2005, IEEE Trans. Speech Audio Process..

[23]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[24]  W. Bastiaan Kleijn,et al.  A 5.85 kbits CELP algorithm for cellular applications , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[25]  Robert B. Dunn,et al.  Speech enhancement based on auditory spectral change , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[26]  Rajiv Laroia,et al.  Robust and efficient quantization of speech LSP parameters using structured vector quantizers , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[27]  J. Berger,et al.  P.563—The ITU-T Standard for Single-Ended Speech Quality Assessment , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[28]  Mike P. Hollier,et al.  Non-intrusive speech-quality assessment using vocal-tract models , 2000 .

[29]  James W. Beauchamp,et al.  Synthesis by Spectral Amplitude and "Brightness" Matching of Analyzed Musical Instrument Tones , 1981 .

[30]  Sugato Chakravarty,et al.  Method for the subjective assessment of intermedi-ate quality levels of coding systems , 2001 .

[31]  Andrew Sekey,et al.  An Objective Measure for Predicting Subjective Quality of Speech Coders , 1992, IEEE J. Sel. Areas Commun..

[32]  E. Owens,et al.  An Introduction to the Psychology of Hearing , 1997 .

[33]  Josef Kittler,et al.  Floating search methods for feature selection with nonmonotonic criterion functions , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[34]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[35]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[36]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[37]  Tiago H. Falk,et al.  Non-intrusive GMM-based speech quality measurement , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[38]  O. C. Au,et al.  A novel output-based objective speech quality measure for wireless communication , 1998, ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344).

[39]  S. Kay Fundamentals of statistical signal processing: estimation theory , 1993 .

[40]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .