论文信息 - Adaptive-order fractional Fourier transform features for speech recognition

Adaptive-order fractional Fourier transform features for speech recognition

We propose an acoustic feature for speech recognition based on the combination of MFCC and fractional Fourier transform (FrFT). Since the transform order is critical for the performance of FrFT, we use the ambiguity function to adaptively determine the optimal orders of FrFT for each frame. The performance of the proposed feature is compared with traditional MFCCs on recognizing speech of isolated and connected digits under both clean and noisy backgrounds. The recognition results and detailed confusion matrices are given and analyzed, which implies that the proposed feature is promising in certain speech processing fields.

Jingming Kuang | Hui Yin | Xiang Xie

[1] Luis Weruaga,et al. High-resolution noise-robust spectral-based pitch estimation , 2005, INTERSPEECH.

[2] Ran Tao,et al. Research progress of the fractional Fourier transform in signal processing , 2006, Science in China Series F.

[3] Qilin,et al. Detection and parameter estimation of multicomponent LFM signal based on the fractional Fourier transform , 2004 .

[4] V. Namias. The Fractional Order Fourier Transform and its Application to Quantum Mechanics , 1980 .

[5] Petros Maragos,et al. On amplitude and frequency demodulation using energy operators , 1993, IEEE Trans. Signal Process..

[6] H. M. Teager,et al. Evidence for Nonlinear Sound Production Mechanisms in the Vocal Tract , 1990 .

[7] Zhao Xing. Chirp Signal Detection and Multiple Parameter Estimation Using Radon-Ambiguity and Fractional Fourier Transform , 2003 .