FPGA Implementation of Zero Frequency Filter

Epoch is the instant of significant excitation during the production of a speech signal. Due to time varying nature of the excitation source and the vocal tract system, accurate detection of epochs from the speech remains a challenging area of research. Over the years several algorithms have been proposed for the detection of epochs. Among different techniques proposed in the literature, the zero frequency filter (ZFF) approach provides better performance for clean and degraded speech. The filter design originally proposed in ZFF has an infinite impulse response (IIR) filter followed by two detrenders. Later, the IIR implementation is simplified to finite impulse response(FIR) realization. In this paper, we have designed the efficient architecture for IIR, FIR realization of ZFF and implemented these two realizations on field programmable gate array (FPGA).

[1]  Bayya Yegnanarayana,et al.  Determination of instants of significant excitation in speech using group delay function , 1995, IEEE Trans. Speech Audio Process..

[2]  Kishore Prahallad,et al.  An FIR Implementation of Zero Frequency Filtering of Speech Signals , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Puli Kishore Kumar,et al.  A fast carry chain adder using Instantiation design entry on Virtex-5 FPGA , 2016, 2016 IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics Engineering (UPCON).

[4]  Bayya Yegnanarayana,et al.  Epoch extraction from linear prediction residual , 1978, ICASSP.

[5]  Puli Kishore Kumar,et al.  An efficient hardware architecture for detection of vowel-like regions in speech signal , 2018, Integr..

[6]  Bayya Yegnanarayana,et al.  Epoch Extraction From Speech Signals , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  S. R. Mahadeva Prasanna,et al.  Foreground Speech Segmentation using Zero Frequency Filtered Signal , 2012, INTERSPEECH.

[8]  Mike Brookes,et al.  The DYPSA algorithm for estimation of glottal closure instants in voiced speech , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  L. H. Anauer,et al.  Speech Analysis and Synthesis by Linear Prediction of the Speech Wave , 2000 .

[10]  D. Ponta,et al.  A novel tool to introduce FPGA in digital design laboratory , 2012, 2012 9th International Conference on Remote Engineering and Virtual Instrumentation (REV).

[11]  B. Yegnanarayana,et al.  Fast prosody modification using instants of significant excitation , 2010 .

[12]  H. Strube Determination of the instant of glottal closure from the speech wave. , 1974, The Journal of the Acoustical Society of America.

[13]  S. R. Mahadeva Prasanna,et al.  Speaker Verification by Vowel and Nonvowel Like Segmentation , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Puli Kishore Kumar,et al.  Low latency architecture design and implementation for short-time fourier transform algorithm on FPGA , 2017, 2017 IEEE International Conference on Microwaves, Antennas, Communications and Electronic Systems (COMCAS).

[15]  Bayya Yegnanarayana,et al.  Performance of an Event-Based Instantaneous Fundamental Frequency Estimator for Distant Speech Signals , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Bayya Yegnanarayana,et al.  Voiced/Nonvoiced Detection Based on Robustness of Voiced Epochs , 2010, IEEE Signal Processing Letters.