Improved method for real-time speech stretching

An algorithm for real-time speech stretching is presented. It was designed to modify input signal dependently on its content and on its relation with the historical input data. The proposed algorithm is a combination of speech signal analysis algorithms, i.e. voice, vowels/consonants, stuttering detection and SOLA (Synchronous-Overlap-and-Add) based speech stretching algorithm. This approach enables stretching input speech signal in real-time with high quality and provides "global" synchronization of the input and output signals. Finally, the effectiveness of the engineered algorithms as well as the quality of the processed speech are discussed.

[1]  Thilo Pfau,et al.  Estimating the speaking rate by vowel detection , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[2]  R. Venkatesha Prasad,et al.  Comparison of voice activity detection algorithms for VoIP , 2002, Proceedings ISCC 2002 Seventh International Symposium on Computers and Communications.

[3]  Lawrence R. Rabiner,et al.  An algorithm for determining the endpoints of isolated utterances , 1975, Bell Syst. Tech. J..

[4]  Gail D. Chermak,et al.  Central Auditory Processing Disorders: New Perspectives , 1997 .

[5]  G. Gates,et al.  Hearing in the elderly--the Framingham cohort, 1983-1985: Part II. Prevalence of central auditory processing disorders. , 1991, Ear and hearing.

[6]  Andrzej Czyzewski,et al.  Time-scale modification of speech signals for supporting hearing impaired schoolchildren , 2009, Signal Processing Algorithms, Architectures, Arrangements, and Applications SPA 2009.

[7]  Jean Laroche,et al.  Improved phase vocoder time-scale modification of audio , 1999, IEEE Trans. Speech Audio Process..

[8]  Dongsuk Yook,et al.  Robust Voice Activity Detection Using the Spectral Peaks of Vowel Sounds , 2009 .

[9]  Andrzej Czyzewski,et al.  Real-Time Speech-Rate Modification Experiments , 2010 .

[10]  Mohammad Hossein Moattar,et al.  A new approach for robust realtime Voice Activity Detection using spectral pattern , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  H.S. Jamadagni,et al.  Second and third order adaptable threshold for VAD in VoIP , 2002, 6th International Conference on Signal Processing, 2002..