A Hybrid Descent Method for Optimal Sigmoid Filter Design

In this letter, a hybrid descent method is used to determine a set of filter parameters for a sigmoid filter which attempts to work under various SNR conditions. It overcomes the limitations of the current sigmoid filters that performs effectively only at a single SNR. Results show that significant improvement in terms of better speech qualities can be achieved by the proposed sigmoid filter when working under various SNR conditions.

[1]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[2]  Yi Hu,et al.  Use of a sigmoidal-shaped function for noise attenuation in cochlear implants. , 2007, The Journal of the Acoustical Society of America.

[3]  Shrikanth S. Narayanan,et al.  Robust Voice Activity Detection Using Long-Term Signal Variability , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Sven Nordholm,et al.  Optimization and evaluation of sigmoid function with a priori SNR estimate for real-time speech enhancement , 2013, Speech Commun..

[5]  Paul R. White,et al.  Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors , 2009, Speech Commun..

[6]  Eric Plourde,et al.  Generalized Bayesian Estimators of the Spectral Amplitude for Speech Enhancement , 2009, IEEE Signal Processing Letters.

[7]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Kuldip K. Paliwal,et al.  Speech enhancement using a minimum mean-square error short-time spectral modulation magnitude estimator , 2012, Speech Commun..

[9]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .

[10]  R. Fletcher Practical Methods of Optimization , 1988 .

[11]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[12]  Rainer Martin,et al.  Analysis of the Decision-Directed SNR Estimator for Speech Enhancement With Respect to Low-SNR and Transient Conditions , 2011, IEEE Transactions on Audio, Speech, and Language Processing.