论文信息 - Real-Time Implementation of Time Domain Harmonic Scaling of Speech for Rate Modification and Coding

Real-Time Implementation of Time Domain Harmonic Scaling of Speech for Rate Modification and Coding

Time domain harmonic scaling (TDHS) has been realized in real time on the Bell Laboratories digital signal processing (DSP) integrated circuit. It is an algorithm that can expand or compress the bandwidth and sampling rate of speech by taking advantage of the pitch structure in the speech signal. As such it is useful in a variety of speech applications including speech coding, speech enhancement, and rate modification. A single DSP can perform compression and a second DSP can perform expansion. Both operations require pitch information to be supplied with the input speech. Included in the system is a real-time pitch/periodicity detector which has also been implemented on a single DSP. Its design is based on a novel modification of the autocorrelation function type pitch detector. This paper presents details of both the TDHS and pitch detector implementation and discusses their performances. In particular in this paper we discuss a 2:1 compression and expansion system that has been used as part of a 9.6 kbit/s speech coder. TDHS was previously thought to require a much larger buffer than the RAM memory available in the DSP. We show that for all the compression/expansion ratios of interest the buffer size needed is twice the maximum pitch period.

Ronald E. Crochiere | Richard V. Cox | James David Johnston

[1] Ronald W. Schafer,et al. Digital Processing of Speech Signals , 1978 .

[2] B Gold,et al. Parallel processing techniques for estimating pitch periods of speech in the time domain. , 1969, The Journal of the Acoustical Society of America.

[3] David Karlin,et al. VLSI Building blocks for digital signal processing , 1982, ICASSP.

[4] K. Inoue,et al. An LSI digital signal processor , 1982, ICASSP.

[5] David Malah,et al. Time-domain algorithms for harmonic bandwidth reduction and time scaling of speech signals , 1979 .

[6] Frederick A. Williams. An expandable single-IC digital filter/Correlator , 1982, ICASSP.

[7] Ronald E. Crochiere,et al. A single chip speech periodicity detector , 1982, ICASSP.

[8] Khen-Sang Tan,et al. A chip set for audio frequency digital signal processing , 1982, ICASSP.

[9] W. Daumer. Subjective Evaluation of Several Efficient Speech Coders , 1982, IEEE Trans. Commun..

[10] Glen J. Culler,et al. A high performance VLSI CMOS arithmetic processor chip , 1982, ICASSP.

[11] David Malah,et al. A generalized comb filtering technique for speech enhancement , 1982, ICASSP.