Distribution Preserving Quantization

In the lossy coding of perceptually relevant signals, such as sound and images, the ultimate goal is to achieve good perceived quality of the reconstructed signal, under a constraint on the bit-rat ...

[1]  M. Rosenblatt Remarks on a Multivariate Transformation , 1952 .

[2]  Bin Yu,et al.  Perceptual audio coding using adaptive pre- and post-filters and lossless compression , 2002, IEEE Trans. Speech Audio Process..

[3]  Jeroen Breebaart,et al.  ADVANCES IN PARAMETRIC CODING FOR HIGH-QUALITY AUDIO , 2003 .

[4]  Chi-Min Liu,et al.  Compression Artifacts in Perceptual Audio Coding , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  H. McKean Geometry of Differential Space , 1973 .

[6]  Michael M. Goodwin,et al.  Matching pursuit with damped sinusoids , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  W. Bastiaan Kleijn,et al.  Encoding speech using prototype waveforms , 1993, IEEE Trans. Speech Audio Process..

[8]  W. Bastiaan Kleijn,et al.  Speech Quality Assessment , 2008 .

[9]  Manuel Rosa-Zurera,et al.  Transient modeling by matching pursuits with a wavelet dictionary for parametric audio coding , 2004, IEEE Signal Processing Letters.

[10]  Uri Erez,et al.  Achieving 1/2 log (1+SNR) on the AWGN channel with lattice encoding and decoding , 2004, IEEE Transactions on Information Theory.

[11]  Markus Erne Perceptual Audio Coders "What to listen for" , 2001 .

[12]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. I. , 1962 .

[13]  Alan McCree,et al.  Low-Bit-Rate Speech Coding , 2008 .

[14]  Tony Ezzat,et al.  Spectro-temporal analysis of speech using 2-d Gabor filters , 2007, INTERSPEECH.

[15]  Simon Litsyn,et al.  Lattices which are good for (almost) everything , 2005, IEEE Transactions on Information Theory.

[16]  Andrew P. Bradley,et al.  A wavelet visible difference predictor , 1999, IEEE Trans. Image Process..

[17]  Gregory Poltyrev,et al.  On coding without restrictions for the AWGN channel , 1993, IEEE Trans. Inf. Theory.

[18]  Stefan Winkler,et al.  Issues in vision modeling for perceptual video quality assessment , 1999, Signal Process..

[19]  P. F. Panter,et al.  Quantization distortion in pulse-count modulation with nonuniform spacing of levels , 1951, Proceedings of the IRE.

[20]  Mounya Elhilali,et al.  A spectro-temporal modulation index (STMI) for assessment of speech intelligibility , 2003, Speech Commun..

[21]  Sae-Young Chung,et al.  Sphere-bound-achieving coset codes and multilevel coset codes , 2000, IEEE Trans. Inf. Theory.

[22]  Vivek K. Goyal,et al.  Transform coding with backward adaptive updates , 2000, IEEE Trans. Inf. Theory.

[23]  Yair Shoham,et al.  Efficient bit allocation for an arbitrary set of quantizers [speech coding] , 1988, IEEE Trans. Acoust. Speech Signal Process..

[24]  C. Krumhansl Concerning the applicability of geometric models to similarity data: The interrelationship between similarity and spatial density. , 1978 .

[25]  Tamás Linder,et al.  High-Resolution Source Coding for Non-Difference Distortion Measures: Multidimensional Companding , 1999, IEEE Trans. Inf. Theory.

[26]  Vladimir Cuperman,et al.  Matching pursuits sinusoidal speech coding , 2003, IEEE Trans. Speech Audio Process..

[27]  J. M. Foley,et al.  Contrast masking in human vision. , 1980, Journal of the Optical Society of America.

[28]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[29]  James Glass,et al.  Research Developments and Directions in Speech Recognition and Understanding, Part 1 , 2009 .

[30]  T. Dau,et al.  A computational model of human auditory signal processing and perception. , 2008, The Journal of the Acoustical Society of America.

[31]  R. Nosofsky Attention, similarity, and the identification-categorization relationship. , 1986, Journal of experimental psychology. General.

[32]  Kenneth Mullen,et al.  A multidimensional stochastic theory of similiarity , 1988 .

[33]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[34]  N. Jayant,et al.  Digital Coding of Waveforms: Principles and Applications to Speech and Video , 1990 .

[35]  W. Bastiaan Kleijn,et al.  Quantization with Constrained Relative Entropy and Its Application to Audio Coding , 2009 .

[36]  P. Noll,et al.  MPEG digital audio coding , 1997, IEEE Signal Process. Mag..

[37]  Alexander Kain,et al.  Spectral voice conversion for text-to-speech synthesis , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[38]  Jeffrey Scott Vitter,et al.  Arithmetic coding for data compression , 1994 .

[39]  W. Bastiaan Kleijn,et al.  A Low-Delay Audio Coder with Constrained-Entropy Quantization , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[40]  Philip A. Chou,et al.  Entropy-constrained vector quantization , 1989, IEEE Trans. Acoust. Speech Signal Process..

[41]  W. Bastiaan Kleijn,et al.  A Bayesian approach to non-intrusive quality assessment of speech , 2009, INTERSPEECH.

[42]  W. Bastiaan Kleijn,et al.  KLT-based adaptive classified VQ of the speech signal , 2004, IEEE Transactions on Speech and Audio Processing.

[43]  Barry Vercoe,et al.  Structured audio: creation, transmission, and rendering of parametric sound representations , 1998, Proc. IEEE.

[44]  M. Abramowitz,et al.  Handbook of Mathematical Functions With Formulas, Graphs and Mathematical Tables (National Bureau of Standards Applied Mathematics Series No. 55) , 1965 .

[45]  Joel A. Tropp,et al.  Just relax: convex programming methods for identifying sparse signals in noise , 2006, IEEE Transactions on Information Theory.

[46]  R. Nosofsky Attention, similarity, and the identification-categorization relationship. , 1986 .

[47]  R. Hellman Asymmetry of masking between noise and tone , 1972 .

[48]  Abraham Lempel,et al.  A universal algorithm for sequential data compression , 1977, IEEE Trans. Inf. Theory.

[49]  Ahmed H. Tewfik,et al.  Low bit rate high quality audio coding with combined harmonic and wavelet representations , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[50]  Simone Santini,et al.  Similarity Measures , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[51]  Abraham Lempel,et al.  Compression of individual sequences via variable-rate coding , 1978, IEEE Trans. Inf. Theory.

[52]  Thomas Sporer,et al.  PEAQ - The ITU Standard for Objective Measurement of Perceived Audio Quality , 2000 .

[53]  Richard Heusdens,et al.  A new psychoacoustical masking model for audio coding applications , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[54]  Frank Hartung,et al.  Multimedia watermarking techniques , 1999, Proc. IEEE.

[55]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[56]  Doh-Suk Kim,et al.  ANIQUE+: A new American national standard for non-intrusive estimation of narrowband speech quality , 2007, Bell Labs Technical Journal.

[57]  T. Dau,et al.  A quantitative model of the "effective" signal processing in the auditory system. II. Simulations and measurements. , 1996, The Journal of the Acoustical Society of America.

[58]  Lawrence G. Roberts,et al.  Picture coding using pseudo-random noise , 1962, IRE Trans. Inf. Theory.

[59]  M. Hirano Objective evaluation of the human voice: clinical aspects. , 1989, Folia phoniatrica.

[60]  Hans-Andrea Loeliger,et al.  Averaging bounds for lattices and linear codes , 1997, IEEE Trans. Inf. Theory.

[61]  A. Tversky,et al.  Similarity of rectangles: An analysis of subjective dimensions , 1975 .

[62]  Robert M. Gray,et al.  High-resolution quantization theory and the vector quantizer advantage , 1989, IEEE Trans. Inf. Theory.

[63]  W. Bastiaan Kleijn,et al.  Flexcode - flexible audio coding , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[64]  Toby Berger,et al.  Lossy Source Coding , 1998, IEEE Trans. Inf. Theory.

[65]  A. Tversky,et al.  Similarity, Separability, and the Triangle Inequality , 1982 .

[66]  Peter Jax,et al.  Bandwidth extension of speech signals: a catalyst for the introduction of wideband speech coding? , 2006, IEEE Communications Magazine.

[67]  Thomas Baer,et al.  A model for the prediction of thresholds, loudness, and partial loudness , 1997 .

[68]  Avideh Zakhor,et al.  Very low bit-rate video coding based on matching pursuits , 1997, IEEE Trans. Circuits Syst. Video Technol..

[69]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[70]  Christof Faller,et al.  Binaural cue coding-Part II: Schemes and applications , 2003, IEEE Trans. Speech Audio Process..

[71]  Alan C. Bovik,et al.  Image information and visual quality , 2006, IEEE Trans. Image Process..

[72]  W. Bastiaan Kleijn,et al.  An efficient stochastically excited linear predictive coding algorithm for high quality low bit rate transmission of speech , 1988, Speech Commun..

[73]  J. Pearl,et al.  Comparison of the cosine and Fourier transforms of Markov-1 signals , 1976 .

[74]  Per Hedelin A tone oriented voice excited vocoder , 1981, ICASSP.

[75]  W. Bastiaan Kleijn,et al.  Low-Complexity, Nonintrusive Speech Quality Assessment , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[76]  R. Zamir Lattices are everywhere , 2009, 2009 Information Theory and Applications Workshop.

[77]  Robert A. Wannamaker Psychoacoustically Optimal Noise Shaping , 1992 .

[78]  Yuval Kochman,et al.  Achieving the Gaussian Rate–Distortion Function by Prediction , 2007, IEEE Transactions on Information Theory.

[79]  W. Bastiaan Kleijn,et al.  Rate-distortion optimized quantization in multistage audio coding , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[80]  Brockway McMillan,et al.  Two inequalities implied by unique decipherability , 1956, IRE Trans. Inf. Theory.

[81]  R. Nelsen An Introduction to Copulas , 1998 .

[82]  Oded Ghitza,et al.  Objective Assessment of Speech and Audio Quality - Technology and Applications , 2006, IEEE Trans. Speech Audio Process..

[83]  Markus Flierl,et al.  A Motion-Compensated Orthogonal Transform with Energy-Concentration Constraint , 2006, 2006 IEEE Workshop on Multimedia Signal Processing.

[84]  Alex Pentland,et al.  Coding, Analysis, Interpretation, and Recognition of Facial Expressions , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[85]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[86]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[87]  Robert M. Gray,et al.  Dithered quantizers , 1993, IEEE Trans. Inf. Theory.

[88]  C. Shannon Coding Theorems for a Discrete Source With a Fidelity Criterion-Claude , 2009 .

[89]  B. Kollmeier,et al.  Modeling auditory processing of amplitude modulation. I. Detection and masking with narrow-band carriers. , 1997, The Journal of the Acoustical Society of America.

[90]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[91]  Tiago H. Falk,et al.  Single-Ended Speech Quality Measurement Using Machine Learning Methods , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[92]  S. Kay Fundamentals of statistical signal processing: estimation theory , 1993 .

[93]  Henrique S. Malvar Lapped transforms for efficient transform/subband coding , 1990, IEEE Trans. Acoust. Speech Signal Process..

[94]  Vinay A. Vaishampayan,et al.  Design of multiple description scalar quantizers , 1993, IEEE Trans. Inf. Theory.

[95]  B. Julesz,et al.  Spatial-frequency masking in vision: critical bands and spread of masking. , 1972, Journal of the Optical Society of America.

[96]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[97]  Tamás Linder,et al.  A Lagrangian formulation of Zador's entropy-constrained quantization theorem , 2002, IEEE Trans. Inf. Theory.

[98]  F. Gregory Ashby,et al.  Toward a Unified Theory of Similarity and Recognition , 1988 .

[99]  Robert Ulichney,et al.  Digital Halftoning , 1987 .

[100]  Marc Schröder,et al.  Emotional speech synthesis: a review , 2001, INTERSPEECH.

[101]  Okamoto Keiji,et al.  A signal transmission. , 1993 .

[102]  Zhou Wang,et al.  Modern Image Quality Assessment , 2006, Modern Image Quality Assessment.

[103]  Weisi Lin,et al.  Non-intrusive Speech Quality Assessment with Support Vector Regression , 2010, MMM.

[104]  James Arvo,et al.  A framework for realistic image synthesis , 1997, SIGGRAPH.

[105]  Alan C. Bovik,et al.  Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures , 2009, IEEE Signal Processing Magazine.

[106]  Herbert Gish,et al.  Asymptotically efficient quantizing , 1968, IEEE Trans. Inf. Theory.

[107]  Edward J. Delp,et al.  Moment preserving quantization [signal processing] , 1991, IEEE Trans. Commun..

[108]  Christof Faller,et al.  Binaural cue coding-Part I: psychoacoustic fundamentals and design principles , 2003, IEEE Trans. Speech Audio Process..

[109]  Kenneth Rose,et al.  A mapping approach to rate-distortion computation and analysis , 1994, IEEE Trans. Inf. Theory.

[110]  A. Ng Feature selection, L1 vs. L2 regularization, and rotational invariance , 2004, Twenty-first international conference on Machine learning - ICML '04.

[111]  S. Mallat A wavelet tour of signal processing , 1998 .

[112]  Joel A. Tropp,et al.  Greed is good: algorithmic results for sparse approximation , 2004, IEEE Transactions on Information Theory.

[113]  W.B. Kleijn,et al.  Flexible Quantization of Audio and Speech based on the Autoregressive Model , 2007, 2007 Conference Record of the Forty-First Asilomar Conference on Signals, Systems and Computers.

[114]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[115]  Joel Max,et al.  Quantizing for minimum distortion , 1960, IRE Trans. Inf. Theory.

[116]  P. Schultheiss,et al.  Block Quantization of Correlated Gaussian Random Variables , 1963 .

[117]  Torsten Dau,et al.  Auditory processing models , 2008 .

[118]  Zhifeng Zhang,et al.  Adaptive time-frequency decompositions , 1994 .

[119]  Lawrence R. Rabiner,et al.  Perceptual evaluation of the effects of dither on low bit rate PCM systems , 1972 .

[120]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[121]  L. Schuchman Dither Signals and Their Effect on Quantization Noise , 1964 .

[122]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[123]  L. L. Elliott Backward and Forward Masking , 1971 .

[124]  Roch Lefebvre,et al.  A wideband speech and audio codec at 16/24/32 kbit/s using hybrid ACELP/TCX techniques , 1999, 1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351).

[125]  T Dau,et al.  A quantitative model of the "effective" signal processing in the auditory system. I. Model structure. , 1996, The Journal of the Acoustical Society of America.

[126]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[127]  Christof Faller Parametric multichannel audio coding: synthesis of coherence cues , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[128]  B. Kollmeier,et al.  Modeling auditory processing of amplitude modulation. II. Spectral and temporal integration. , 1997, The Journal of the Acoustical Society of America.

[129]  Alan C. Bovik,et al.  Automatic prediction of perceptual quality of multimedia signals—a survey , 2010, Multimedia Tools and Applications.

[130]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[131]  Doaa Mohammed Image Compression Using Block Truncation Coding , 2011 .

[132]  Guillermo Sapiro,et al.  Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[133]  Minjie Xie,et al.  ITU-T G.722.1 Annex C: A New Low-Complexity 14 KHZ Audio Coding Standard , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[134]  Eero P. Simoncelli,et al.  Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[135]  Tamás Linder,et al.  On the asymptotic tightness of the Shannon lower bound , 1994, IEEE Trans. Inf. Theory.

[136]  Jonathan Baxter,et al.  The Canonical Distortion Measure for Vector Quantization and Function Approximation , 1997, ICML.

[137]  Renato Vicente,et al.  An information-theoretic approach to statistical dependence: Copula information , 2009, ArXiv.

[138]  R. Tibshirani,et al.  An introduction to the bootstrap , 1993 .

[139]  R. Gray,et al.  Dithered Quantizers , 1993, Proceedings. 1991 IEEE International Symposium on Information Theory.

[140]  Donald P. Greenberg,et al.  A model of visual masking for computer graphics , 1997, SIGGRAPH.

[141]  Kiyoharu Aizawa,et al.  Model-based image coding advanced video coding techniques for very low bit-rate applications , 1995, Proc. IEEE.

[142]  E. Holman Monotonic models for asymmetric proximities , 1979 .

[143]  T J Sejnowski,et al.  Learning the higher-order structure of a natural sound. , 1996, Network.

[144]  Michael S. Lewicki,et al.  Efficient coding of natural sounds , 2002, Nature Neuroscience.

[145]  Edward Jones,et al.  Audio quality assessment techniques - A review, and recent developments , 2009, Signal Process..

[146]  Meir Feder,et al.  On universal quantization by randomized uniform/lattice quantizers , 1992, IEEE Trans. Inf. Theory.

[147]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[148]  Torsten Dau,et al.  Masking patterns for sinusoidal and narrow-band noise maskers. , 1998, The Journal of the Acoustical Society of America.

[149]  Matti Karjalainen,et al.  A new auditory model for the evaluation of sound quality of audio systems , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[150]  Paul L. Zador,et al.  Asymptotic quantization error of continuous signals and the quantization dimension , 1982, IEEE Trans. Inf. Theory.

[151]  W. R. Bennett,et al.  Spectra of quantized signals , 1948, Bell Syst. Tech. J..

[152]  Allen Gersho,et al.  Adaptive postfiltering for quality enhancement of coded speech , 1995, IEEE Trans. Speech Audio Process..

[153]  Birger Kollmeier,et al.  PEMO-Q—A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[154]  Robert J. Safranek,et al.  Signal compression based on models of human perception , 1993, Proc. IEEE.

[155]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[156]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[157]  P. Billingsley,et al.  Ergodic theory and information , 1966 .

[158]  Allen Gersho,et al.  Asymptotically optimal block quantization , 1979, IEEE Trans. Inf. Theory.

[159]  James R. Glass,et al.  Developments and directions in speech recognition and understanding, Part 1 [DSP Education] , 2009, IEEE Signal Processing Magazine.

[160]  R. L. Wegel,et al.  The Auditory Masking of One Pure Tone by Another and its Probable Relation to the Dynamics of the Inner Ear , 1924 .

[161]  S. S. Stevens,et al.  The Masking of Pure Tones and of Speech by White Noise , 1950 .

[162]  W. Bastiaan Kleijn,et al.  Predictive audio coding using rate-distortion-optimal pre- and post-filtering , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[163]  Rüdiger L. Urbanke,et al.  Lattice Codes Can Achieve Capacity on the AWGN Channel , 1998, IEEE Trans. Inf. Theory.

[164]  R. Vafin,et al.  Sinusoidal modeling using psychoacoustic-adaptive matching pursuits , 2002, IEEE Signal Processing Letters.

[165]  D. Huffman A Method for the Construction of Minimum-Redundancy Codes , 1952 .

[166]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. II , 1962 .

[167]  R. Zamir,et al.  On lattice quantization noise , 1994, Proceedings of 1994 IEEE International Symposium on Information Theory.

[168]  Alan C. Bovik,et al.  No-reference quality assessment using natural scene statistics: JPEG2000 , 2005, IEEE Transactions on Image Processing.

[169]  Jordi Ribas-Corbera,et al.  Rate control in DCT video coding for low-delay communications , 1999, IEEE Trans. Circuits Syst. Video Technol..

[170]  R. Gallager Information Theory and Reliable Communication , 1968 .

[171]  P. Mabilleau,et al.  Fast CELP coding based on algebraic codes , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[172]  Henrique S. Malvar,et al.  The LOT: transform coding without blocking effects , 1989, IEEE Trans. Acoust. Speech Signal Process..

[173]  Robert M. Gray,et al.  Asymptotic Performance of Vector Quantizers with a Perceptual Distortion Measure , 1997, IEEE Trans. Inf. Theory.

[174]  Robert M. Gray,et al.  Toeplitz and Circulant Matrices: A Review , 2005, Found. Trends Commun. Inf. Theory.

[175]  Annie Cuyt,et al.  Gamma function and related functions , 2008 .

[176]  W. Bastiaan Kleijn,et al.  Distribution Preserving Quantization With Dithering and Transformation , 2010, IEEE Signal Processing Letters.

[177]  Adrian Segall Bit allocation and encoding for vector sources , 1976, IEEE Trans. Inf. Theory.

[178]  Allen Gersho,et al.  Globally optimal vector quantizer design by stochastic relaxation , 1992, IEEE Trans. Signal Process..

[179]  W. Bastiaan Kleijn,et al.  The Sensitivity Matrix: Using Advanced Auditory Models in Speech and Audio Processing , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[180]  N. Ruiz Reyes,et al.  Adaptive Signal Modeling Based on Sparse Approximations for Scalable Parametric Audio Coding , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[181]  Tai-Shih Chi,et al.  Perception-based objective speech quality assessment , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[182]  David J. Sakrison,et al.  A geometric treatment of the source encoding of a Gaussian random variable , 1968, IEEE Trans. Inf. Theory.

[183]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[184]  R. Shepard,et al.  Toward a universal law of generalization for psychological science. , 1987, Science.

[185]  Vivek K. Goyal,et al.  Theoretical foundations of transform coding , 2001, IEEE Signal Process. Mag..

[186]  Thomas M. Cover,et al.  Elements of information theory (2. ed.) , 2006 .

[187]  Michael Randolph Garey,et al.  The complexity of the generalized Lloyd - Max problem , 1982, IEEE Trans. Inf. Theory.

[188]  Toby Berger,et al.  Rate distortion theory : a mathematical basis for data compression , 1971 .

[189]  James Hu,et al.  DVQ: A digital video quality metric based on human vision , 2001 .

[190]  Thomas P. Barnwell,et al.  MCCREE AND BARNWELL MIXED EXCITAmON LPC VOCODER MODEL LPC SYNTHESIS FILTER 243 SYNTHESIZED SPEECH-PERIODIC PULSE TRAIN-1 PERIODIC POSITION JITTER PULSE 4 , 2004 .

[191]  Kristofer Kjörling,et al.  Spectral Band Replication, a Novel Approach in Audio Coding , 2002 .

[192]  Peter Jax,et al.  Bandwidth Extension for Hierarchical Speech and Audio Coding in ITU-T Rec. G.729.1 , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[193]  Olivier Verscheure,et al.  Perceptual quality measure using a spatiotemporal model of the human visual system , 1996, Electronic Imaging.

[194]  John Princen,et al.  Subband/Transform coding using filter bank designs based on time domain aliasing cancellation , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[195]  Meir Feder,et al.  Information rates of pre/post-filtered dithered quantizers , 1993, IEEE Trans. Inf. Theory.

[196]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[197]  Gunnar Karlsson,et al.  Three dimensional sub-band coding of video , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[198]  A. Tversky Features of Similarity , 1977 .

[199]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..