Mandarin speech coding using a modified RPE-LTP technique

This paper proposes a novel speech codec named the Chinese RPE-LTP (CRPE-LTP), which exploits some of the unique characteristics of Mandarin in speech in order to improve speech quality, for Mandarin speakers. Although the codec is based on the proven principles of the GSM06.10 RPE-LTP coder; its performance is better than that of GSM for coding Mandarin speech, and is designed to at least match GSM performance for coding English speech.

[1]  Lin-Shan Lee,et al.  Voice dictation of Mandarin Chinese , 1997, IEEE Signal Process. Mag..

[2]  Sin-Horng Chen,et al.  Vector quantization of pitch information in Mandarin speech , 1990, IEEE Trans. Commun..

[3]  Jialu Zhang Phonetic and linguistic features of spoken Chinese , 1994, Proceedings of ICSIPNN '94. International Conference on Speech, Image Processing and Neural Networks.

[4]  Ian McLoughlin,et al.  Proposal of standards for intelligibility tests of Chinese speech , 2000 .

[5]  G. A. Miller,et al.  An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .

[6]  Amir Dembo,et al.  A unified framework for LPC excitation representation in residual speech coders , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[7]  H. Levitt,et al.  Predicting consonant confusions from acoustic analysis. , 1981, The Journal of the Acoustical Society of America.

[8]  Ed F. Deprettere,et al.  Regular-pulse excitation-A novel approach to effective and efficient multipulse coding of speech , 1986, IEEE Trans. Acoust. Speech Signal Process..

[9]  Andreas Spanias,et al.  Cepstrum-based pitch detection using a new statistical V/UV classification algorithm , 1999, IEEE Trans. Speech Audio Process..