Extraction of Speech Pitch and Formant Frequencies using Discrete Wavelet Transform

Extraction of pitch and formant frequencies is an important issue in speech processing. Pitch frequency is the fundamental frequency of the speech signal, and formant frequencies are essentially resonance frequencies of the vocal tract. These frequencies vary among different persons and words, but they are within certain frequency range. Practically, the first three formants are enough for coding and other processes. The most common methods for estimating formants are cepstrum and linear predictive coding. In this study, a wavelet based method using filter bank concepts is presented to estimate these frequencies.