论文信息 - Sine-wave phase coding at low data rates

Sine-wave phase coding at low data rates

In the context of a sinusoidal representation for speech waveforms, it is shown that synthetic speech of high quality can be obtained using a parametric model for the sine-wave phases, hence obviating the need to code the phases at low data rates. It was found that if a synthetic linear phase term was computed based on the time of occurrence of an artificially generated sequence of pitch pulses, then high-quality voiced speech reconstruction was possible. For unvoiced speech, the modeling study showed that the sine-wave phases were essentially uniformly distributed random variables.<<ETX>>

Thomas F. Quatieri | R. J. McAulay

[1] Thomas F. Quatieri,et al. Magnitude-only reconstruction using a sinusoidal speech modelMagnitude-only reconstruction using a sinusoidal speech model , 1984, ICASSP.

[2] T. Quatieri,et al. Phase modelling and its application to sinusoidal transform coding , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3] O. Fujimura. An approximation to voice aperiodicity , 1968 .

[4] R. McAulay,et al. Mid-rate coding based on a sinusoidal representation of speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5] Jae S. Lim,et al. Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.

[6] Luís B. Almeida,et al. Harmonic coding at 4.8 kb/s , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[7] M. Sabin,et al. Sine-Wave Amplitude Coding at Low Data Rates , 1991 .

[8] J. Makhoul,et al. A mixed‐source model for speech compression and synthesis , 1978 .

[9] Thomas F. Quatieri,et al. Phase coherence in speech reconstruction for enhancement and coding applications , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[10] Thomas F. Quatieri,et al. Pitch estimation and voicing detection based on a sinusoidal speech model , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[11] Thomas F. Quatieri,et al. Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..