Speech Enhancement Employing Modified a Priori SNR Estimation

In order to improve the performance of a speech enhancement system, Plapous introduced a novel method called two-step noise reduction (TSNR) technique to refine the a priori SNR estimation of the decision-directed (DD) approach. However, the performance of this method depends on the choice of gain function. In this paper, we propose a modified approach for the a priori SNR estimation in DCT domain with two steps like the TSNR method. While in the second step, the proposed approach computes directly the square of clean speech component using the estimated a priori SNR of the DD approach, its result is not restricted on the gain function, and thus the drawback of the TSNR method is removed while the advantages are kept. A number of objective tests under various conditions are provided, and the results show the improved performance of our approach.

[1]  Ivan Stojmenovic,et al.  Honeycomb Networks: Topological Properties and Communication Algorithms , 1997, IEEE Trans. Parallel Distributed Syst..

[2]  Soo Ngee Koh,et al.  Noisy speech enhancement using discrete cosine transform , 1998, Speech Commun..

[3]  Arnold L. Rosenberg,et al.  Group Action Graphs and Parallel Architectures , 1990, SIAM J. Comput..

[4]  Sheldon B. Akers,et al.  A Group-Theoretic Model for Symmetric Interconnection Networks , 1989, IEEE Trans. Computers.

[5]  Ivan Stojmenovic,et al.  Addressing and Routing in Hexagonal Networks with Applications for Tracking Mobile Users and Connection Rerouting in Cellular Networks , 2002, IEEE Trans. Parallel Distributed Syst..

[6]  Behrooz Parhami,et al.  Swapped interconnection networks: Topological, performance, and robustness attributes , 2005, J. Parallel Distributed Comput..

[7]  Wenjun Xiao,et al.  Further mathematical properties of Cayley digraphs applied to hexagonal and honeycomb meshes , 2007, Discret. Appl. Math..

[8]  Abdel Elah Al-Ayyoub,et al.  Topological Properties of OTIS-Networks , 2002, IEEE Trans. Parallel Distributed Syst..

[9]  Behrooz Parhami,et al.  Swapped networks: unifying the architectures and algorithms of a wide class of hierarchical parallel processors , 1996, Proceedings of 1996 International Conference on Parallel and Distributed Systems.

[10]  S C Esener,et al.  Optical transpose interconnection system architectures. , 1993, Optics letters.

[11]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[12]  Pascal Scalart,et al.  Improved Signal-to-Noise Ratio Estimation for Speech Enhancement , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Wenjun Xiao,et al.  Some mathematical properties of cayley digraphs with applications to interconnection network design , 2005, Int. J. Comput. Math..

[14]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[15]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .

[16]  Md. Kamrul Hasan,et al.  DCT speech enhancement with hard and soft thresholding criteria , 2002 .

[17]  Olivier Cappé,et al.  Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor , 1994, IEEE Trans. Speech Audio Process..

[18]  Marie-Claude Heydemann,et al.  Cayley graphs and interconnection networks , 1997 .

[19]  Behrooz Parhami,et al.  A Unified Formulation of Honeycomb and Diamond Networks , 2001, IEEE Trans. Parallel Distributed Syst..