Support Vector Machines for Thai Phoneme Recognition

The Support Vector Machine (SVM) has recently been introduced as a new pattern classification technique. It learns the boundary regions between samples belonging to two classes by mapping the input samples into a high dimensional space, and seeking a separating hyperplane in this space. This paper describes an application of SVMs to two phoneme recognition problems: 5 Thai tones, and 12 Thai vowels spoken in isolation. The best results on tone recognition are 96.09% and 90.57% for the inside test and outside test, respectively, and on vowel recognition are 95.51% and 87.08% for the inside test and outside test, respectively.

[1]  Xiaoyan Zhu,et al.  An approach to smooth fundamental frequencies in tone recognition , 1998, ICCT'98. 1998 International Conference on Communication Technology. Proceedings (IEEE Cat. No.98EX243).

[2]  Bernhard Schölkopf,et al.  Support vector learning , 1997 .

[3]  Nello Cristianini,et al.  Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[4]  Pedro J. Moreno,et al.  On the use of support vector machines for phonetic classification , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[5]  Pak-Chung Ching,et al.  Tone recognition of isolated Cantonese syllables , 1995, IEEE Trans. Speech Audio Process..

[6]  M. Ross,et al.  Average magnitude difference function pitch extractor , 1974 .

[7]  Herbert Gish,et al.  Speaker identification via support vector classifiers , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[8]  Linkai Bu,et al.  Perceptual speech processing and phonetic feature mapping for robust vowel recognition , 2000, IEEE Trans. Speech Audio Process..

[9]  Nakarin Satthamnuwong,et al.  Effects of Speaking Rate on Thai Tones , 1999, Phonetica.

[10]  Partha Niyogi,et al.  Distinctive feature detection using support vector machines , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[11]  Joseph Picone,et al.  Support vector machines for speech recognition , 1998, ICSLP.

[12]  Hynek Hermansky,et al.  RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..

[13]  Sudaporn Luksaneeyanawin,et al.  Intonation in Thai. , 1983 .

[14]  Siripong Potisuk,et al.  Tonal Coarticulation in Thai , 1994 .