Adaptive nonuniform polar quantization application to high quality speech compression

The logarithmic companding technique has shown to be extremely useful in speech quantization with rate of 8 bits/sample. However, for lower bit rates it is not the ideal solution for high quality speech coding. Because of that, in this paper we establish source coding scheme which enables better spectrum efficiency for input that has a large dynamic range. Since our wish is also to improve signal quality in comparison with quality defined with standards G.711 and G.712, we opt for adaptive technique application to the speech coding. Our research shows that proper design of forward gain-adaptive polar quantization can enable compression of about 1 bit/sample as well as significantly better quality than in case of using coder designed according to standard G.711. Furthermore, performances can be sustained over the whole speech dynamic range. Also, if the requisite speech quality is not supposed to be lower than G.712 standard quality, the achieved compression can be almost 1.5 bits/sample. Besides, we propose knew simple encoding rule which can additionally reduce bit rate for 0.1 bit/sample.

[1]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[2]  Olivier Hersent,et al.  Beyond VoIP Protocols: Understanding Voice Technology and Networking Techniques for IP Telephony , 2005 .

[3]  Peter F. Swaszek,et al.  Asymptotic performance of unrestricted polar quantizers , 1986, IEEE Trans. Inf. Theory.

[4]  Allen Gersho,et al.  Advances in speech and audio compression , 1994, Proc. IEEE.

[5]  James A. Bucklew,et al.  Two-dimensional quantization of bivariate circularly symmetric densities , 1979, IEEE Trans. Inf. Theory.

[6]  Peter F. Swaszek,et al.  Multidimensional spherical coordinates quantization , 1983, IEEE Trans. Inf. Theory.

[7]  Ahmet M. Kondoz,et al.  Digital Speech: Coding for Low Bit Rate Communication Systems , 1995 .

[8]  Takeshi Mori,et al.  A G.711 Embedded Wideband Speech Coding for VoIP Conferences , 2006, IEICE Trans. Inf. Syst..

[9]  Jon Hamkins,et al.  Gaussian source coding with spherical codes , 2002, IEEE Trans. Inf. Theory.

[10]  Kenneth C. Pohlmann,et al.  Principles of Digital Audio , 1986 .

[11]  N. Jayant,et al.  Digital Coding of Waveforms: Principles and Applications to Speech and Video , 1990 .

[12]  Thomas R. Fischer,et al.  Geometric source coding and vector quantization , 1989, IEEE Trans. Inf. Theory.

[13]  W. Pearlman,et al.  Polar Quantization of a Complex Gaussian Random Variable , 1979, IEEE Trans. Commun..

[14]  Peter F. Swaszek Uniform Spherical Coordinate Quantization of Spherically Symmetric Sources , 1985, IEEE Trans. Commun..

[15]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[16]  Bhumip Khasnabish Implementing Voice Over IP , 2003 .

[17]  Jelena Nikolic,et al.  Lloyd-Max's Algorithm Implementation in Speech Coding Algorithm Based on Forward Adaptive Technique , 2008, Informatica.

[18]  Wai C. Chu,et al.  Speech Coding Algorithms , 2003 .

[19]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .