论文信息 - The optimized wavelet filters for speech compression

The optimized wavelet filters for speech compression

In this paper, optimized wavelet filters for speech compression are proposed whose wavelet filter coefficients are derived with different window techniques such as Kaiser and Blackman windows via simple linear optimization. When the developed wavelet filters are exploited for speech compression, they not only give better compression ratio but also yield good fidelity parameters as compared to other wavelet filters. A comparative study of performance of different existing wavelet filters and the proposed wavelet filters is made in terms of compression ratio (CR), signal-to-noise ratio (SNR), peak signal-to-noise ratio (PSNR) and normalized root-mean square error (NRMSE) at different thresholding levels. The simulation result included in this paper shows increased efficacy and improved performance of the proposed filters in the field of speech signal processing.

[1] R. S. Anand,et al. Near Perfect Reconstruction Quadrature Mirror Filter , 2008 .

[2] M. Mangoud,et al. Speech Coding , 2005 .

[3] M. Vetterli,et al. Wavelets, subband coding, and best bases , 1996, Proc. IEEE.

[4] Allen Gersho,et al. Hybrid coding: combined harmonic and waveform coding of speech at 4 kb/s , 2001, IEEE Trans. Speech Audio Process..

[5] R. S. Anand,et al. Turning point algorithm for speech signal compression , 2012, Int. J. Speech Technol..

[6] I. Daubechies. Ten Lectures on Wavelets , 1992 .

[7] Nikos Fakotakis,et al. Modeling the Temporal Evolution of Acoustic Parameters for Speech Emotion Recognition , 2012, IEEE Transactions on Affective Computing.

[8] David Malah,et al. Design of uniform DFT filter banks optimized for subband coding of speech , 1989, IEEE Trans. Acoust. Speech Signal Process..

[9] I. Daubechies. Orthonormal bases of compactly supported wavelets , 1988 .

[10] J.D. Gibson,et al. Speech coding methods, standards, and applications , 2005, IEEE Circuits and Systems Magazine.

[11] S. A. Alfandi,et al. Multimedia speech compression techniques , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[12] R. Young,et al. An introduction to nonharmonic Fourier series , 1980 .

[13] Moshe Porat,et al. On color transforms and bit allocation for optimal subband image compression , 2007, Signal Process. Image Commun..

[14] Vassilis Anastassopoulos,et al. Morphological waveform coding for writer identification , 2000, Pattern Recognit..

[15] Palaniandavar Venkateswaran,et al. An Efficient Time Domain Speech Compression Algorithm Based on LPC and Sub-Band Coding Techniques , 2009, J. Commun..

[16] W. Robertson,et al. Comparing audio compression using wavelets with other audio compression schemes , 1999, Engineering Solutions for the Next Millennium. 1999 IEEE Canadian Conference on Electrical and Computer Engineering (Cat. No.99TH8411).

[17] Johnson I. Agbinya,et al. Discrete wavelet transform techniques in speech processing , 1996, Proceedings of Digital Processing Applications (TENCON '96).

[18] Rabul Hussain Laskar,et al. A pitch synchronous approach to design voice conversion system using source-filter correlation , 2012, Int. J. Speech Technol..

[19] N. Ahmed,et al. Speech and Image Compression Using Discrete Wavelet Transform , 2005, IEEE/Sarnoff Symposium on Advances in Wired and Wireless Communication, 2005..

[20] Allen Gersho,et al. Combined harmonic and waveform coding of speech at low bit rates , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[21] Mahmoud. A. Osman,et al. Speech compression using LPC and wavelet , 2010, 2010 2nd International Conference on Computer Engineering and Technology.

[22] Ismail Shahin,et al. Speaker identification investigation and analysis in unbiased and biased emotional talking environments , 2012, International Journal of Speech Technology.

[23] V. Prakash,et al. Speech compression using discreet wavelet transform , 2003, 4th National Conference of Telecommunication Technology, 2003. NCTT 2003 Proceedings..

[24] Stéphane Mallat,et al. A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[25] Tong Zhang,et al. Using Lossless Data Compression in Data Storage Systems: Not for Saving Space , 2011, IEEE Transactions on Computers.

[26] James L. Flanagan,et al. Speech Compression by Polynomial Approximation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[27] Shijo M Joseph. Spoken digit compression using Wavelet Packet , 2010, 2010 International Conference on Signal and Image Processing.

[28] Darryl Stewart,et al. Subband correlation and robust speech recognition , 2005, IEEE Transactions on Speech and Audio Processing.

[29] Jelena Kovacevic,et al. Wavelets and Subband Coding , 2013, Prentice Hall Signal Processing Series.

[30] R. Crochiere,et al. Speech Coding , 1979, IEEE Transactions on Communications.

[31] A. Gersho. Advances in speech and audio compression : Data compression , 1994 .

[32] A. Nejat Ince,et al. Digital Speech Processing , 1992 .

[33] Chip-Hong Chang,et al. Bayesian Separation With Sparsity Promotion in Perceptual Wavelet Domain for Speech Enhancement and Hybrid Speech Recognition , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.