Speech coding using Fourier-Bessel expansion of speech signals

Coding of speech signals using Bessel functions as orthogonal signals in the Fourier-Bessel (FB) expansion has been explored. It has been found that a reasonable quality of speech can be reconstructed using a set of 15 to 30 coefficients in the FB expansion of each frame of speech. At 80 frames per second and eight bits per coefficient, this corresponds to a bit rate of as low as 9600 bits/second when predetermined sequence of coefficients are used. The speech quality and the bit rate increase when higher number or a selected set of coefficients are used. Comparable results in perceptual speech quality and frame-to-frame signal-to-noise were observed for both male and female speakers.

[1]  Andreas Spanias,et al.  Speech coding: a tutorial review , 1994, Proc. IEEE.

[2]  Kaliappan Gopalan,et al.  A comparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion , 1999, IEEE Trans. Speech Audio Process..

[3]  C. Chen,et al.  Speech signal analysis and synthesis via Fourier-Bessel representation , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Harold J. Manley Analysis‐Synthesis of Connected Speech in Terms of Orthogonalized Exponentially Damped Sinusoids , 1962 .

[5]  K. Gopalan Speech modification by selective Fourier-Bessel series expansion of speech signals , 1999, 1999 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM 1999). Conference Proceedings (Cat. No.99CH36368).

[6]  L. Dolansky Choice of base signals in speech signal analysis , 1960 .

[7]  Robert E. Yantorno,et al.  Performance of the modified Bark spectral distortion as an objective speech quality measure , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).