Selective-LPC based representation of STRAIGHT spectrum and its applications in spectral smoothing

In this paper we propose a new method to represent STRAIGHT spectrum. The new method provides STRAIGHT spectral parameters with the capability of interpolation and quantization, which is needed for most speech manipulation, especially for spectral smoothing. The proposed method estimates 2-band selective-LPC whose spectral envelope fits the given STRAIGHT spectrum. With the interpolation properties of LSP, the estimated selective-LPC could be converted to LSP and then simply interpolated. We apply this representation in our spectral smoothing experiments and the results show that this method can get smooth spectral envelope over the segment boundaries. Listening tests prove that this algorithm effectively smooth speech boundaries with little quality degradation. . Index Terms: speech synthesis, STRAIGHT, selective-LPC

[1]  Roy D. Patterson,et al.  Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity , 1999, EUROSPEECH.

[2]  Hideki Kawahara,et al.  Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..

[3]  Hideki Kawahara,et al.  Speech representation and transformation using adaptive interpolation of weighted spectrum: vocoder revisited , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  John H. L. Hansen,et al.  An auditory-based measure for improved phone segment concatenation , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Kuldip K. Paliwal,et al.  Interpolation properties of linear prediction parametric representations , 1995, EUROSPEECH.

[6]  F. Itakura Line spectrum representation of linear predictor coefficients of speech signals , 1975 .

[7]  John H. L. Hansen,et al.  A comparison of spectral smoothing methods for segment concatenation based speech synthesis , 2002, Speech Commun..

[8]  Takao Kobayashi,et al.  Speech coding based on adaptive mel-cepstral analysis , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Biing-Hwang Juang,et al.  Line spectrum pair (LSP) and speech data compression , 1984, ICASSP.

[10]  Bishnu S. Atal,et al.  Speech synthesis by linear interpolation of spectral parameters between dyad boundaries , 1979 .

[11]  John Makhoul,et al.  Spectral linear prediction: Properties and applications , 1975 .