Pattern search in dysfluent speech

Pattern recognition in time series is often used in data mining and in bioinformatics. Speech can be considered only as a different type of signal and processed as a time series. Stuttered speech is rich in events also known as dysfluencies, typically repetitions. This paper describes a new method for enumerating complex repetitions. Classical approaches to stuttered speech analyzed dysfluencies in very short intervals, which were sufficient for recognizing simple repetitions of phonemes. However, the problem of repetitions of syllables or words was typically ignored due to high computational demands of classical methods for analysis of longer intervals. Our approach uses a method adopted from data mining and bioinformatics, together with efficient representation of speech signal, which simplifies processing of speech enough to enable analysis of longer intervals. Results show applicability of the proposed method.

[1]  Peter Howell,et al.  The University College London Archive of Stuttered Speech (UCLASS). , 2009, Journal of speech, language, and hearing research : JSLHR.

[2]  Ronald W. Schafer,et al.  Introduction to Digital Speech Processing , 2007, Found. Trends Signal Process..

[3]  Eugene W. Myers,et al.  Suffix arrays: a new method for on-line string searches , 1993, SODA '90.

[4]  Jessica Lin,et al.  Finding Motifs in Time Series , 2002, KDD 2002.

[5]  M. Hariharan,et al.  MFCC based recognition of repetitions and prolongations in stuttered speech using k-NN and LDA , 2009, 2009 IEEE Student Conference on Research and Development (SCOReD).

[6]  Wieslawa Kuniszyk-Józkowiak,et al.  Speech nonfluency detection using Kohonen networks , 2009, Neural Computing and Applications.

[7]  Thomas S. Huang,et al.  Hmm-Based and Svm-Based Recognition of the Speech of Talkers With Spastic Dysarthria , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[8]  M. Hariharan,et al.  Automatic detection of prolongations and repetitions using LPCC , 2009, 2009 International Conference for Technical Postgraduates (TECHPOS).

[9]  K. M. Ravikumar,et al.  Automatic Detection of Syllable Repetition in Read Speech for Objective Assessment of Stuttered Disfluencies , 2008 .

[10]  Sergios Theodoridis,et al.  Introduction to Pattern Recognition: A Matlab Approach , 2010 .

[11]  Christos Faloutsos,et al.  Fast Time Sequence Indexing for Arbitrary Lp Norms , 2000, VLDB.

[12]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.