Comparison of different multiclass SVM methods for speaker independent phoneme recognition

Four multiclass Support Vector Machines (SVMs) methods were designed for the task of speaker independent phoneme recognition. These are the All-at-once, One-against-all, One-against-one, and the Directed Acyclic Graph SVM (DAGSVM). The Discrete Wavelet Transform (DWT) 8 frequency band power percentages are used for feature extraction. All tests were carried out on the TIMIT database. Comparable recognition rates were obtained from all designed systems. However, the One-against-One method performed best, achieving an accuracy of 53.70% for multi-speaker unlimited vocabulary speech. The phoneme recognition system, adopting the DWT and the One-against-one method, are intended to be implemented on a dedicated chip. The dedicated chip will improve the speed performance by approximately 100 times when comparing the hardware setup with the software implementation. This is obtained by providing the hardware parallelism, which accommodates the algorithms that have been used.

[1]  Pedro J. Moreno,et al.  On the use of support vector machines for phonetic classification , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[2]  Carmen Peláez-Moreno,et al.  SVMs for Automatic Speech Recognition: A Survey , 2005, WNSP.

[3]  Tetsunori Kobayashi,et al.  A Sequential Pattern Classifier Based on Hidden Markov Kernel Machine and Its Application to Phoneme Classification , 2010, IEEE Journal of Selected Topics in Signal Processing.

[4]  S. Sathiya Keerthi,et al.  Which Is the Best Multiclass SVM Method? An Empirical Study , 2005, Multiple Classifier Systems.

[5]  O. Casha,et al.  Neural network architectures for speaker independent phoneme recognition , 2011, 2011 7th International Symposium on Image and Signal Processing and Analysis (ISPA).

[6]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[7]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[8]  Simon King,et al.  Framewise phone classification using support vector machines , 2002, INTERSPEECH.

[9]  Hyrum S. Anderson,et al.  Training a support vector machine to classify signals in a real environment given clean training data , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Gert Cauwenberghs,et al.  Forward Decoding Kernel Machines: A Hybrid HMM/SVM Approach to Sequence Recognition , 2002, SVM.

[11]  Václav Hlavác,et al.  Multi-class support vector machine , 2002, Object recognition supported by user interaction for service robots.

[12]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[13]  Zhang Xue-ying,et al.  Speech Recognition Based on Support Vector Machine and Error Correcting Output Codes , 2010, 2010 First International Conference on Pervasive Computing, Signal Processing and Applications.