Automatic detection of prolongations and repetitions using LPCC

Stuttering is a speech disorder in which the normal flow of speech is disrupted by occurrences of dysfluencies, such as repetitions, interjection and so on. There are high proportion of repetitions and prolongations in stuttered speech, usually at the beginning of sentences. Consequently, acoustic analysis can be used to classify the stuttered events. This paper describes particular stuttering events to be located as repetitions and prolongations in stuttered speech with feature extraction algorithm. Linear Predictive Cepstral Coefficient (LPCC) feature extraction is implemented to test its effectiveness in recognizing prolongations and repetitions in stuttered speech. In this work, two classifiers, Linear Discriminant Analysis classifier (LDA) and k-nearest neighbors (k-NN) are employed. Result shows that the LPCC and classifier (LDA and k-NN) can be used for the recognition of repetitions and prolongations in stuttered speech with the best accuracy of 89.77%.

[1]  S. S. Awad The application of digital speech processing to stuttering therapy , 1997, IEEE Instrumentation and Measurement Technology Conference Sensing, Processing, Networking. IMTC Proceedings.

[2]  Peter Howell,et al.  The University College London Archive of Stuttered Speech (UCLASS). , 2009, Journal of speech, language, and hearing research : JSLHR.

[3]  Cary Hector,et al.  Martz, John D. (ed.) The Dynamics of Change in Latin America, 2nd ed., Prentice-Hall, Englewood Cliffs, New Jersey, x + 395 p. , 1972 .

[4]  Sazali Yaacob,et al.  Overview of Automatic Stuttering Recognition System , 2009 .

[5]  Bhavani M. Thuraisingham,et al.  Face Recognition Using Multiple Classifiers , 2006, 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06).

[6]  Chee-Ming Ting,et al.  Application of Malay speech technology in Malay Speech Therapy Assistance Tools , 2007, 2007 International Conference on Intelligent and Advanced Systems.

[7]  Oliver Chiu-sing Choy,et al.  An efficient MFCC extraction method in speech recognition , 2006, 2006 IEEE International Symposium on Circuits and Systems.

[8]  Peter Howell,et al.  The UCLASS archive of stuttered speech , 2009 .

[9]  Peter A. Heeman,et al.  Using a Uniform-Weight Grammar to Model Disfluencies in Stuttered Read Speech : A Pilot Study , 2004 .

[10]  Peter Howell,et al.  Facilities to assist people to research into stammered speech. , 2004, Stammering research : an on-line journal published by the British Stammering Association.

[11]  S. T. Alexander,et al.  Analytical finite precision results for Burg's algorithm and the autocorrelation method for linear prediction , 1987, IEEE Trans. Acoust. Speech Signal Process..

[12]  E. Wong,et al.  Comparison of linear prediction cepstrum coefficients and mel-frequency cepstrum coefficients for language identification , 2001, Proceedings of 2001 International Symposium on Intelligent Multimedia, Video and Speech Processing. ISIMP 2001 (IEEE Cat. No.01EX489).

[13]  K. M. Ravikumar,et al.  Automatic Detection of Syllable Repetition in Read Speech for Objective Assessment of Stuttered Disfluencies , 2008 .

[14]  H. C. Nagaraj,et al.  An Approach for Objective Assessment of Stuttered Speech Using MFCC Features , 2009 .

[15]  P Howell,et al.  Development of a two-stage procedure for the automatic recognition of dysfluencies in the speech of children who stutter: II. ANN recognition of repetitions and prolongations with supplied word segment markers. , 1997, Journal of speech, language, and hearing research : JSLHR.

[16]  S. T. Alexander,et al.  An analysis of finite precision effects for the autocorrelation method and Burg's method of linear prediction , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Jasmy Yunus,et al.  Overview of a Computer-based Stuttering Therapy , 2006 .

[18]  Elmar Nöth,et al.  Automatic stuttering recognition using hidden Markov models , 2000, INTERSPEECH.

[19]  Eric Achten,et al.  fMRI of developmental stuttering: A pilot study , 2003, Brain and Language.

[20]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.