Sine-wave phase coding at low data rates

In the context of a sinusoidal representation for speech waveforms, it is shown that synthetic speech of high quality can be obtained using a parametric model for the sine-wave phases, hence obviating the need to code the phases at low data rates. It was found that if a synthetic linear phase term was computed based on the time of occurrence of an artificially generated sequence of pitch pulses, then high-quality voiced speech reconstruction was possible. For unvoiced speech, the modeling study showed that the sine-wave phases were essentially uniformly distributed random variables.<<ETX>>

[1]  Thomas F. Quatieri,et al.  Magnitude-only reconstruction using a sinusoidal speech modelMagnitude-only reconstruction using a sinusoidal speech model , 1984, ICASSP.

[2]  T. Quatieri,et al.  Phase modelling and its application to sinusoidal transform coding , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  O. Fujimura An approximation to voice aperiodicity , 1968 .

[4]  R. McAulay,et al.  Mid-rate coding based on a sinusoidal representation of speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Jae S. Lim,et al.  Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.

[6]  Luís B. Almeida,et al.  Harmonic coding at 4.8 kb/s , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[7]  M. Sabin,et al.  Sine-Wave Amplitude Coding at Low Data Rates , 1991 .

[8]  J. Makhoul,et al.  A mixed‐source model for speech compression and synthesis , 1978 .

[9]  Thomas F. Quatieri,et al.  Phase coherence in speech reconstruction for enhancement and coding applications , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[10]  Thomas F. Quatieri,et al.  Pitch estimation and voicing detection based on a sinusoidal speech model , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[11]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..