A Kalman Smoothing Algorithm for Speech Enhancement Based on the Properties of Vocal Tract Varying Slowly

The linear prediction coefficients obtained from noisy speech have an important impact on improving the quality of the enhanced speech in the speech enhancement algorithm based on the Kalman Smoother. According to the properties of the slow changes of the vocal tract, this paper proposes a novel Kalman smoothing algorithm for speech enhancement based on vocal tract parameters smoother. Firstly, the linear prediction coefficients are converted into the line spectrum frequency parameters. Then, these parameters of the adjacent frames are smoothed before they transform into state transition matrix. Experimental results indicate that the proposed Kalman smoothing algorithm for speech enhancement based on vocal tract parameters smoother can suppress the sudden changes of residual noise energy and improve the quality of enhanced speech. The quality of the enhanced speech is evaluated by means of segmental SNR and ITU-PESQ scores. Experimental results indicate that the proposed algorithm achieves obvious improvements compared with conventional Wiener filter.

[1]  Philipos C. Loizou,et al.  A multi-band spectral subtraction method for enhancing speech corrupted by colored noise , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Rajeev Rastogi,et al.  Topology discovery in heterogeneous IP networks: the NetInventory system , 2004, IEEE/ACM Transactions on Networking.

[3]  Shawn A. Routhier Management Information Base for the Internet Protocol (IP) , 2006, RFC.

[4]  Donald F. Towsley,et al.  Multicast topology inference from measured end-to-end loss , 2002, IEEE Trans. Inf. Theory.

[5]  Christian Pérez,et al.  A network topology description model for grid application deployment , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[6]  Fangzhe Chang,et al.  Topology discovery for public IPv6 networks , 2003, CCRV.

[7]  W. Bastiaan Kleijn,et al.  On causal algorithms for speech enhancement , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  W. Bastiaan Kleijn,et al.  Spectral dynamics is more important than spectral distortion , 1995, ICASSP.

[9]  A.V. Oppenheim,et al.  Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[10]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[11]  Shang-Juh Kao,et al.  Topology Discovery for Coexisting IPv6 and IPv4 Networks , 2006, 5th IEEE/ACIS International Conference on Computer and Information Science and 1st IEEE/ACIS International Workshop on Component-Based Software Engineering,Software Architecture and Reuse (ICIS-COMSAR'06).

[12]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[13]  Shang-Juh Kao,et al.  Managing the co-existing network of IPv6 and IPv4 under various transition mechanisms , 2005, Third International Conference on Information Technology and Applications (ICITA'05).

[14]  Mark Crovella,et al.  Improved Algorithms for Network Topology Discovery , 2005, PAM.

[15]  Qingxian Wang,et al.  Discovering IPv6 network topology , 2005, IEEE International Symposium on Communications and Information Technology, 2005. ISCIT 2005..

[16]  Hong Shen,et al.  Hamming distance and hop count based classification for multicast network topology inference , 2005, 19th International Conference on Advanced Information Networking and Applications (AINA'05) Volume 1 (AINA papers).

[17]  Jerry D. Gibson,et al.  Filtering of colored noise for speech enhancement and coding , 1991, IEEE Trans. Signal Process..

[18]  Hui Tian Network topology discovery and its applications , 2006 .

[19]  M.S. Scordilis,et al.  Speech enhancement by Kalman filtering with residual noise clipping , 2005, Proceedings. IEEE SoutheastCon, 2005..

[20]  Bill Fenner,et al.  Management Information Base for the User Datagram Protocol (UDP) , 2005, RFC.

[21]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[22]  Bill Fenner,et al.  Management Information Base for the Transmission Control Protocol (TCP) , 2001 .

[23]  Olivier Festor,et al.  A hierarchical topology discovery service for IPv6 networks , 2002, NOMS 2002. IEEE/IFIP Network Operations and Management Symposium. ' Management Solutions for the New Communications World'(Cat. No.02CH37327).

[24]  Hui Tian,et al.  Analysis on binary loss tree classification with hop count for multicast topology discovery , 2004, First IEEE Consumer Communications and Networking Conference, 2004. CCNC 2004..

[25]  Ratul Mahajan,et al.  Measuring ISP topologies with rocketfuel , 2002, TNET.

[26]  kc claffy,et al.  Otter: A general-purpose network visualization tool , 1999 .

[27]  Kuldip K. Paliwal,et al.  A speech enhancement method based on Kalman filtering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[28]  Damien Magoni,et al.  Analysis of the autonomous system network topology , 2001, CCRV.

[29]  Chunjian Li,et al.  Integrating Kalman filtering and multi-pulse coding for speech enhancement with a non-stationary model of the speech signal , 2004, Conference Record of the Thirty-Eighth Asilomar Conference on Signals, Systems and Computers, 2004..

[30]  Ibrahim Matta,et al.  On the origin of power laws in Internet topologies , 2000, CCRV.