Various speech processing techniques for multimedia applications

In this paper, various speech processing techniques in time, time-frequency and timescale domains for the purposes of recognition and compression are displayed. The examination of these representations in a variety of work that have been accomplished in that direction is included. In particular, we emphasize the advantages of Wavelet Transforms in recognizing and compressing speech signals.

[1]  Randy K. Young Wavelet theory and its applications , 1993, The Kluwer international series in engineering and computer science.

[2]  Fabrizio Durante,et al.  Copula Theory and Its Applications , 2010 .

[3]  Jalal Karam,et al.  A new approach in wavelet based speech compression , 2008 .

[4]  Lawrence R. Rabiner,et al.  An algorithm for determining the endpoints of isolated utterances , 1975, Bell Syst. Tech. J..

[5]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .

[6]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .

[7]  Alan V. Oppenheim,et al.  Discrete-time Signal Processing. Vol.2 , 2001 .

[8]  Dennis Gabor,et al.  Theory of communication , 1946 .

[9]  R. F. Favero Compound wavelets: wavelets for speech recognition , 1994, Proceedings of IEEE-SP International Symposium on Time- Frequency and Time-Scale Analysis.

[10]  Carl Taswell Speech compression with cosine and wavelet packet near-best bases , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[11]  Richard Kronland-Martinet,et al.  Analysis of Sound Patterns through Wavelet transforms , 1987, Int. J. Pattern Recognit. Artif. Intell..

[12]  I. Daubechies Ten Lectures on Wavelets , 1992 .

[13]  Jalal Karam,et al.  The Effect of Different Compression Schemes on Speech Signals , 2008 .

[14]  Joseph Picone,et al.  Signal modeling techniques in speech recognition , 1993, Proc. IEEE.

[15]  Ronald R. Coifman,et al.  Entropy-based algorithms for best basis selection , 1992, IEEE Trans. Inf. Theory.

[16]  Truong Q. Nguyen,et al.  Wavelets and filter banks , 1996 .

[17]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[18]  R. W. King,et al.  Wavelet parameterization for speech recognition: variations in translation and scale parameters , 1994, Proceedings of ICSIPNN '94. International Conference on Speech, Image Processing and Neural Networks.

[19]  H. Goldstein,et al.  Formant tracking using the wavelet-based DST , 1994, Proceedings of COMSIG '94 - 1994 South African Symposium on Communications and Signal Processing.

[20]  William J. Phillips,et al.  New low rate wavelet models for the recognition of single spoken digits , 2000, 2000 Canadian Conference on Electrical and Computer Engineering. Conference Proceedings. Navigating to a New Era (Cat. No.00TH8492).

[21]  Climent Nadeu,et al.  Wavelet transforms for non-uniform speech recognition systems , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[22]  William J. Phillips,et al.  Speech recognition techniques using RBF networks , 1995, IEEE WESCANEX 95. Communications, Power, and Computing. Conference Proceedings.

[23]  Amara Lynn Graps,et al.  An introduction to wavelets , 1995 .

[24]  Richard Kronland-Martinet,et al.  The Wavelet Transform for Analysis, Synthesis, and Processing of Speech and Music Sounds , 1988 .

[25]  Hong-Ye Gao,et al.  Wavelet analysis [for signal processing] , 1996 .

[26]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[27]  M.M. Artimy,et al.  Automatic detection of acoustic sub-word boundaries for single digit recognition , 1999, Engineering Solutions for the Next Millennium. 1999 IEEE Canadian Conference on Electrical and Computer Engineering (Cat. No.99TH8411).