论文信息 - Distribution Preserving Quantization

Distribution Preserving Quantization

In the lossy coding of perceptually relevant signals, such as sound and images, the ultimate goal is to achieve good perceived quality of the reconstructed signal, under a constraint on the bit-rat ...

Minyue Li | Minyue Li

[1] M. Rosenblatt. Remarks on a Multivariate Transformation , 1952 .

[2] Bin Yu,et al. Perceptual audio coding using adaptive pre- and post-filters and lossless compression , 2002, IEEE Trans. Speech Audio Process..

[3] Jeroen Breebaart,et al. ADVANCES IN PARAMETRIC CODING FOR HIGH-QUALITY AUDIO , 2003 .

[4] Chi-Min Liu,et al. Compression Artifacts in Perceptual Audio Coding , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[5] H. McKean. Geometry of Differential Space , 1973 .

[6] Michael M. Goodwin,et al. Matching pursuit with damped sinusoids , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7] W. Bastiaan Kleijn,et al. Encoding speech using prototype waveforms , 1993, IEEE Trans. Speech Audio Process..

[8] W. Bastiaan Kleijn,et al. Speech Quality Assessment , 2008 .

[9] Manuel Rosa-Zurera,et al. Transient modeling by matching pursuits with a wavelet dictionary for parametric audio coding , 2004, IEEE Signal Processing Letters.

[10] Uri Erez,et al. Achieving 1/2 log (1+SNR) on the AWGN channel with lattice encoding and decoding , 2004, IEEE Transactions on Information Theory.

[11] Markus Erne. Perceptual Audio Coders "What to listen for" , 2001 .

[12] R. Shepard. The analysis of proximities: Multidimensional scaling with an unknown distance function. I. , 1962 .

[13] Alan McCree,et al. Low-Bit-Rate Speech Coding , 2008 .

[14] Tony Ezzat,et al. Spectro-temporal analysis of speech using 2-d Gabor filters , 2007, INTERSPEECH.

[15] Simon Litsyn,et al. Lattices which are good for (almost) everything , 2005, IEEE Transactions on Information Theory.

[16] Andrew P. Bradley,et al. A wavelet visible difference predictor , 1999, IEEE Trans. Image Process..

[17] Gregory Poltyrev,et al. On coding without restrictions for the AWGN channel , 1993, IEEE Trans. Inf. Theory.

[18] Stefan Winkler,et al. Issues in vision modeling for perceptual video quality assessment , 1999, Signal Process..

[19] P. F. Panter,et al. Quantization distortion in pulse-count modulation with nonuniform spacing of levels , 1951, Proceedings of the IRE.

[20] Mounya Elhilali,et al. A spectro-temporal modulation index (STMI) for assessment of speech intelligibility , 2003, Speech Commun..

[21] Sae-Young Chung,et al. Sphere-bound-achieving coset codes and multilevel coset codes , 2000, IEEE Trans. Inf. Theory.

[22] Vivek K. Goyal,et al. Transform coding with backward adaptive updates , 2000, IEEE Trans. Inf. Theory.

[23] Yair Shoham,et al. Efficient bit allocation for an arbitrary set of quantizers [speech coding] , 1988, IEEE Trans. Acoust. Speech Signal Process..

[24] C. Krumhansl. Concerning the applicability of geometric models to similarity data: The interrelationship between similarity and spatial density. , 1978 .

[25] Tamás Linder,et al. High-Resolution Source Coding for Non-Difference Distortion Measures: Multidimensional Companding , 1999, IEEE Trans. Inf. Theory.

[26] Vladimir Cuperman,et al. Matching pursuits sinusoidal speech coding , 2003, IEEE Trans. Speech Audio Process..

[27] J. M. Foley,et al. Contrast masking in human vision. , 1980, Journal of the Optical Society of America.

[28] Thomas F. Quatieri,et al. Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[29] James Glass,et al. Research Developments and Directions in Speech Recognition and Understanding, Part 1 , 2009 .

[30] T. Dau,et al. A computational model of human auditory signal processing and perception. , 2008, The Journal of the Acoustical Society of America.

[31] R. Nosofsky. Attention, similarity, and the identification-categorization relationship. , 1986, Journal of experimental psychology. General.

[32] Kenneth Mullen,et al. A multidimensional stochastic theory of similiarity , 1988 .

[33] Robert M. Gray,et al. An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[34] N. Jayant,et al. Digital Coding of Waveforms: Principles and Applications to Speech and Video , 1990 .

[35] W. Bastiaan Kleijn,et al. Quantization with Constrained Relative Entropy and Its Application to Audio Coding , 2009 .

[36] P. Noll,et al. MPEG digital audio coding , 1997, IEEE Signal Process. Mag..

[37] Alexander Kain,et al. Spectral voice conversion for text-to-speech synthesis , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[38] Jeffrey Scott Vitter,et al. Arithmetic coding for data compression , 1994 .

[39] W. Bastiaan Kleijn,et al. A Low-Delay Audio Coder with Constrained-Entropy Quantization , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[40] Philip A. Chou,et al. Entropy-constrained vector quantization , 1989, IEEE Trans. Acoust. Speech Signal Process..

[41] W. Bastiaan Kleijn,et al. A Bayesian approach to non-intrusive quality assessment of speech , 2009, INTERSPEECH.

[42] W. Bastiaan Kleijn,et al. KLT-based adaptive classified VQ of the speech signal , 2004, IEEE Transactions on Speech and Audio Processing.

[43] Barry Vercoe,et al. Structured audio: creation, transmission, and rendering of parametric sound representations , 1998, Proc. IEEE.

[44] M. Abramowitz,et al. Handbook of Mathematical Functions With Formulas, Graphs and Mathematical Tables (National Bureau of Standards Applied Mathematics Series No. 55) , 1965 .

[45] Joel A. Tropp,et al. Just relax: convex programming methods for identifying sparse signals in noise , 2006, IEEE Transactions on Information Theory.

[46] R. Nosofsky. Attention, similarity, and the identification-categorization relationship. , 1986 .

[47] R. Hellman. Asymmetry of masking between noise and tone , 1972 .

[48] Abraham Lempel,et al. A universal algorithm for sequential data compression , 1977, IEEE Trans. Inf. Theory.

[49] Ahmed H. Tewfik,et al. Low bit rate high quality audio coding with combined harmonic and wavelet representations , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[50] Simone Santini,et al. Similarity Measures , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[51] Abraham Lempel,et al. Compression of individual sequences via variable-rate coding , 1978, IEEE Trans. Inf. Theory.

[52] Thomas Sporer,et al. PEAQ - The ITU Standard for Objective Measurement of Perceived Audio Quality , 2000 .

[53] Richard Heusdens,et al. A new psychoacoustical masking model for audio coding applications , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[54] Frank Hartung,et al. Multimedia watermarking techniques , 1999, Proc. IEEE.

[55] A. Spanias,et al. Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[56] Doh-Suk Kim,et al. ANIQUE+: A new American national standard for non-intrusive estimation of narrowband speech quality , 2007, Bell Labs Technical Journal.

[57] T. Dau,et al. A quantitative model of the "effective" signal processing in the auditory system. II. Simulations and measurements. , 1996, The Journal of the Acoustical Society of America.

[58] Lawrence G. Roberts,et al. Picture coding using pseudo-random noise , 1962, IRE Trans. Inf. Theory.

[59] M. Hirano. Objective evaluation of the human voice: clinical aspects. , 1989, Folia phoniatrica.

[60] Hans-Andrea Loeliger,et al. Averaging bounds for lattices and linear codes , 1997, IEEE Trans. Inf. Theory.

[61] A. Tversky,et al. Similarity of rectangles: An analysis of subjective dimensions , 1975 .

[62] Robert M. Gray,et al. High-resolution quantization theory and the vector quantizer advantage , 1989, IEEE Trans. Inf. Theory.

[63] W. Bastiaan Kleijn,et al. Flexcode - flexible audio coding , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[64] Toby Berger,et al. Lossy Source Coding , 1998, IEEE Trans. Inf. Theory.

[65] A. Tversky,et al. Similarity, Separability, and the Triangle Inequality , 1982 .

[66] Peter Jax,et al. Bandwidth extension of speech signals: a catalyst for the introduction of wideband speech coding? , 2006, IEEE Communications Magazine.

[67] Thomas Baer,et al. A model for the prediction of thresholds, loudness, and partial loudness , 1997 .

[68] Avideh Zakhor,et al. Very low bit-rate video coding based on matching pursuits , 1997, IEEE Trans. Circuits Syst. Video Technol..

[69] Nathalie Virag,et al. Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[70] Christof Faller,et al. Binaural cue coding-Part II: Schemes and applications , 2003, IEEE Trans. Speech Audio Process..

[71] Alan C. Bovik,et al. Image information and visual quality , 2006, IEEE Trans. Image Process..

[72] W. Bastiaan Kleijn,et al. An efficient stochastically excited linear predictive coding algorithm for high quality low bit rate transmission of speech , 1988, Speech Commun..

[73] J. Pearl,et al. Comparison of the cosine and Fourier transforms of Markov-1 signals , 1976 .

[74] Per Hedelin. A tone oriented voice excited vocoder , 1981, ICASSP.

[75] W. Bastiaan Kleijn,et al. Low-Complexity, Nonintrusive Speech Quality Assessment , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[76] R. Zamir. Lattices are everywhere , 2009, 2009 Information Theory and Applications Workshop.

[77] Robert A. Wannamaker. Psychoacoustically Optimal Noise Shaping , 1992 .

[78] Yuval Kochman,et al. Achieving the Gaussian Rate–Distortion Function by Prediction , 2007, IEEE Transactions on Information Theory.

[79] W. Bastiaan Kleijn,et al. Rate-distortion optimized quantization in multistage audio coding , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[80] Brockway McMillan,et al. Two inequalities implied by unique decipherability , 1956, IRE Trans. Inf. Theory.

[81] R. Nelsen. An Introduction to Copulas , 1998 .

[82] Oded Ghitza,et al. Objective Assessment of Speech and Audio Quality - Technology and Applications , 2006, IEEE Trans. Speech Audio Process..

[83] Markus Flierl,et al. A Motion-Compensated Orthogonal Transform with Energy-Concentration Constraint , 2006, 2006 IEEE Workshop on Multimedia Signal Processing.

[84] Alex Pentland,et al. Coding, Analysis, Interpretation, and Recognition of Facial Expressions , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[85] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[86] C. E. SHANNON,et al. A mathematical theory of communication , 1948, MOCO.

[87] Robert M. Gray,et al. Dithered quantizers , 1993, IEEE Trans. Inf. Theory.

[88] C. Shannon. Coding Theorems for a Discrete Source With a Fidelity Criterion-Claude , 2009 .

[89] B. Kollmeier,et al. Modeling auditory processing of amplitude modulation. I. Detection and masking with narrow-band carriers. , 1997, The Journal of the Acoustical Society of America.

[90] N. Ahmed,et al. Discrete Cosine Transform , 1996 .

[91] Tiago H. Falk,et al. Single-Ended Speech Quality Measurement Using Machine Learning Methods , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[92] S. Kay. Fundamentals of statistical signal processing: estimation theory , 1993 .

[93] Henrique S. Malvar. Lapped transforms for efficient transform/subband coding , 1990, IEEE Trans. Acoust. Speech Signal Process..

[94] Vinay A. Vaishampayan,et al. Design of multiple description scalar quantizers , 1993, IEEE Trans. Inf. Theory.

[95] B. Julesz,et al. Spatial-frequency masking in vision: critical bands and spread of masking. , 1972, Journal of the Optical Society of America.

[96] Manfred R. Schroeder,et al. Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[97] Tamás Linder,et al. A Lagrangian formulation of Zador's entropy-constrained quantization theorem , 2002, IEEE Trans. Inf. Theory.

[98] F. Gregory Ashby,et al. Toward a Unified Theory of Similarity and Recognition , 1988 .

[99] Robert Ulichney,et al. Digital Halftoning , 1987 .

[100] Marc Schröder,et al. Emotional speech synthesis: a review , 2001, INTERSPEECH.

[101] Okamoto Keiji,et al. A signal transmission. , 1993 .

[102] Zhou Wang,et al. Modern Image Quality Assessment , 2006, Modern Image Quality Assessment.

[103] Weisi Lin,et al. Non-intrusive Speech Quality Assessment with Support Vector Regression , 2010, MMM.

[104] James Arvo,et al. A framework for realistic image synthesis , 1997, SIGGRAPH.

[105] Alan C. Bovik,et al. Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures , 2009, IEEE Signal Processing Magazine.

[106] Herbert Gish,et al. Asymptotically efficient quantizing , 1968, IEEE Trans. Inf. Theory.

[107] Edward J. Delp,et al. Moment preserving quantization [signal processing] , 1991, IEEE Trans. Commun..

[108] Christof Faller,et al. Binaural cue coding-Part I: psychoacoustic fundamentals and design principles , 2003, IEEE Trans. Speech Audio Process..

[109] Kenneth Rose,et al. A mapping approach to rate-distortion computation and analysis , 1994, IEEE Trans. Inf. Theory.

[110] A. Ng. Feature selection, L1 vs. L2 regularization, and rotational invariance , 2004, Twenty-first international conference on Machine learning - ICML '04.

[111] S. Mallat. A wavelet tour of signal processing , 1998 .

[112] Joel A. Tropp,et al. Greed is good: algorithmic results for sparse approximation , 2004, IEEE Transactions on Information Theory.

[113] W.B. Kleijn,et al. Flexible Quantization of Audio and Speech based on the Autoregressive Model , 2007, 2007 Conference Record of the Forty-First Asilomar Conference on Signals, Systems and Computers.

[114] Andries P. Hekstra,et al. Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[115] Joel Max,et al. Quantizing for minimum distortion , 1960, IRE Trans. Inf. Theory.

[116] P. Schultheiss,et al. Block Quantization of Correlated Gaussian Random Variables , 1963 .

[117] Torsten Dau,et al. Auditory processing models , 2008 .

[118] Zhifeng Zhang,et al. Adaptive time-frequency decompositions , 1994 .

[119] Lawrence R. Rabiner,et al. Perceptual evaluation of the effects of dither on low bit rate PCM systems , 1972 .

[120] Michael C. Hout,et al. Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[121] L. Schuchman. Dither Signals and Their Effect on Quantization Noise , 1964 .

[122] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[123] L. L. Elliott. Backward and Forward Masking , 1971 .

[124] Roch Lefebvre,et al. A wideband speech and audio codec at 16/24/32 kbit/s using hybrid ACELP/TCX techniques , 1999, 1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351).

[125] T Dau,et al. A quantitative model of the "effective" signal processing in the auditory system. I. Model structure. , 1996, The Journal of the Acoustical Society of America.

[126] Nasser M. Nasrabadi,et al. Pattern Recognition and Machine Learning , 2006, Technometrics.

[127] Christof Faller. Parametric multichannel audio coding: synthesis of coherence cues , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[128] B. Kollmeier,et al. Modeling auditory processing of amplitude modulation. II. Spectral and temporal integration. , 1997, The Journal of the Acoustical Society of America.

[129] Alan C. Bovik,et al. Automatic prediction of perceptual quality of multimedia signals—a survey , 2010, Multimedia Tools and Applications.

[130] Thomas M. Cover,et al. Elements of Information Theory , 2005 .

[131] Doaa Mohammed. Image Compression Using Block Truncation Coding , 2011 .

[132] Guillermo Sapiro,et al. Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[133] Minjie Xie,et al. ITU-T G.722.1 Annex C: A New Low-Complexity 14 KHZ Audio Coding Standard , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[134] Eero P. Simoncelli,et al. Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[135] Tamás Linder,et al. On the asymptotic tightness of the Shannon lower bound , 1994, IEEE Trans. Inf. Theory.

[136] Jonathan Baxter,et al. The Canonical Distortion Measure for Vector Quantization and Function Approximation , 1997, ICML.

[137] Renato Vicente,et al. An information-theoretic approach to statistical dependence: Copula information , 2009, ArXiv.

[138] R. Tibshirani,et al. An introduction to the bootstrap , 1993 .

[139] R. Gray,et al. Dithered Quantizers , 1993, Proceedings. 1991 IEEE International Symposium on Information Theory.

[140] Donald P. Greenberg,et al. A model of visual masking for computer graphics , 1997, SIGGRAPH.

[141] Kiyoharu Aizawa,et al. Model-based image coding advanced video coding techniques for very low bit-rate applications , 1995, Proc. IEEE.

[142] E. Holman. Monotonic models for asymmetric proximities , 1979 .

[143] T J Sejnowski,et al. Learning the higher-order structure of a natural sound. , 1996, Network.

[144] Michael S. Lewicki,et al. Efficient coding of natural sounds , 2002, Nature Neuroscience.

[145] Edward Jones,et al. Audio quality assessment techniques - A review, and recent developments , 2009, Signal Process..

[146] Meir Feder,et al. On universal quantization by randomized uniform/lattice quantizers , 1992, IEEE Trans. Inf. Theory.

[147] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[148] Torsten Dau,et al. Masking patterns for sinusoidal and narrow-band noise maskers. , 1998, The Journal of the Acoustical Society of America.

[149] Matti Karjalainen,et al. A new auditory model for the evaluation of sound quality of audio systems , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[150] Paul L. Zador,et al. Asymptotic quantization error of continuous signals and the quantization dimension , 1982, IEEE Trans. Inf. Theory.

[151] W. R. Bennett,et al. Spectra of quantized signals , 1948, Bell Syst. Tech. J..

[152] Allen Gersho,et al. Adaptive postfiltering for quality enhancement of coded speech , 1995, IEEE Trans. Speech Audio Process..

[153] Birger Kollmeier,et al. PEMO-Q—A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[154] Robert J. Safranek,et al. Signal compression based on models of human perception , 1993, Proc. IEEE.

[155] Jr. J.P. Campbell,et al. Speaker recognition: a tutorial , 1997, Proc. IEEE.

[156] Stéphane Mallat,et al. Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[157] P. Billingsley,et al. Ergodic theory and information , 1966 .

[158] Allen Gersho,et al. Asymptotically optimal block quantization , 1979, IEEE Trans. Inf. Theory.

[159] James R. Glass,et al. Developments and directions in speech recognition and understanding, Part 1 [DSP Education] , 2009, IEEE Signal Processing Magazine.

[160] R. L. Wegel,et al. The Auditory Masking of One Pure Tone by Another and its Probable Relation to the Dynamics of the Inner Ear , 1924 .

[161] S. S. Stevens,et al. The Masking of Pure Tones and of Speech by White Noise , 1950 .

[162] W. Bastiaan Kleijn,et al. Predictive audio coding using rate-distortion-optimal pre- and post-filtering , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[163] Rüdiger L. Urbanke,et al. Lattice Codes Can Achieve Capacity on the AWGN Channel , 1998, IEEE Trans. Inf. Theory.

[164] R. Vafin,et al. Sinusoidal modeling using psychoacoustic-adaptive matching pursuits , 2002, IEEE Signal Processing Letters.

[165] D. Huffman. A Method for the Construction of Minimum-Redundancy Codes , 1952 .

[166] R. Shepard. The analysis of proximities: Multidimensional scaling with an unknown distance function. II , 1962 .

[167] R. Zamir,et al. On lattice quantization noise , 1994, Proceedings of 1994 IEEE International Symposium on Information Theory.

[168] Alan C. Bovik,et al. No-reference quality assessment using natural scene statistics: JPEG2000 , 2005, IEEE Transactions on Image Processing.

[169] Jordi Ribas-Corbera,et al. Rate control in DCT video coding for low-delay communications , 1999, IEEE Trans. Circuits Syst. Video Technol..

[170] R. Gallager. Information Theory and Reliable Communication , 1968 .

[171] P. Mabilleau,et al. Fast CELP coding based on algebraic codes , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[172] Henrique S. Malvar,et al. The LOT: transform coding without blocking effects , 1989, IEEE Trans. Acoust. Speech Signal Process..

[173] Robert M. Gray,et al. Asymptotic Performance of Vector Quantizers with a Perceptual Distortion Measure , 1997, IEEE Trans. Inf. Theory.

[174] Robert M. Gray,et al. Toeplitz and Circulant Matrices: A Review , 2005, Found. Trends Commun. Inf. Theory.

[175] Annie Cuyt,et al. Gamma function and related functions , 2008 .

[176] W. Bastiaan Kleijn,et al. Distribution Preserving Quantization With Dithering and Transformation , 2010, IEEE Signal Processing Letters.

[177] Adrian Segall. Bit allocation and encoding for vector sources , 1976, IEEE Trans. Inf. Theory.

[178] Allen Gersho,et al. Globally optimal vector quantizer design by stochastic relaxation , 1992, IEEE Trans. Signal Process..

[179] W. Bastiaan Kleijn,et al. The Sensitivity Matrix: Using Advanced Auditory Models in Speech and Audio Processing , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[180] N. Ruiz Reyes,et al. Adaptive Signal Modeling Based on Sparse Approximations for Scalable Parametric Audio Coding , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[181] Tai-Shih Chi,et al. Perception-based objective speech quality assessment , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[182] David J. Sakrison,et al. A geometric treatment of the source encoding of a Gaussian random variable , 1968, IEEE Trans. Inf. Theory.

[183] J. Kruskal. Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[184] R. Shepard,et al. Toward a universal law of generalization for psychological science. , 1987, Science.

[185] Vivek K. Goyal,et al. Theoretical foundations of transform coding , 2001, IEEE Signal Process. Mag..

[186] Thomas M. Cover,et al. Elements of information theory (2. ed.) , 2006 .

[187] Michael Randolph Garey,et al. The complexity of the generalized Lloyd - Max problem , 1982, IEEE Trans. Inf. Theory.

[188] Toby Berger,et al. Rate distortion theory : a mathematical basis for data compression , 1971 .

[189] James Hu,et al. DVQ: A digital video quality metric based on human vision , 2001 .

[190] Thomas P. Barnwell,et al. MCCREE AND BARNWELL MIXED EXCITAmON LPC VOCODER MODEL LPC SYNTHESIS FILTER 243 SYNTHESIZED SPEECH-PERIODIC PULSE TRAIN-1 PERIODIC POSITION JITTER PULSE 4 , 2004 .

[191] Kristofer Kjörling,et al. Spectral Band Replication, a Novel Approach in Audio Coding , 2002 .

[192] Peter Jax,et al. Bandwidth Extension for Hierarchical Speech and Audio Coding in ITU-T Rec. G.729.1 , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[193] Olivier Verscheure,et al. Perceptual quality measure using a spatiotemporal model of the human visual system , 1996, Electronic Imaging.

[194] John Princen,et al. Subband/Transform coding using filter bank designs based on time domain aliasing cancellation , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[195] Meir Feder,et al. Information rates of pre/post-filtered dithered quantizers , 1993, IEEE Trans. Inf. Theory.

[196] S. P. Lloyd,et al. Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[197] Gunnar Karlsson,et al. Three dimensional sub-band coding of video , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[198] A. Tversky. Features of Similarity , 1977 .

[199] Jerome M. Shapiro,et al. Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..