Hardware-based support vector machine for phoneme classification

This paper presents the design of a digital hardware implementation based on Support Vector Machines (SVMs), for the task of multi-speaker phoneme recognition. The One-against-one multiclass SVM method, with the Radial Basis Function (RBF) kernel was considered. Furthermore, a priority scheme was also included in the architecture, in order to forecast the three most likely phonemes. The designed system was synthesised on a Xilinx Virtex-II XC2V3000 FPGA, and evaluated with the TIMIT corpus. This phoneme recognition system is intended to be implemented on a dedicated chip, along with the Discrete Wavelet Transforms (DWTs) for feature extraction, to further improve the resultant performance.

[1]  Minyue Fu,et al.  The use of wavelet transforms in phoneme recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[2]  Boonserm Kijsirikul,et al.  Support Vector Machines for Thai Phoneme Recognition , 2001, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[3]  Ahmed Ben Hamida,et al.  CDHMM parameters selection for speaker-independent phone recognition in continuous speech system , 2010, MELECON 2010.

[4]  Carmen Peláez-Moreno,et al.  SVMs for Automatic Speech Recognition: A Survey , 2005, WNSP.

[5]  Edward Gatt,et al.  Phoneme recognition using neural networks , 2008, 2008 15th IEEE International Conference on Electronics, Circuits and Systems.

[6]  Ivan Grech,et al.  Support Vector Machines with the priorities method for speaker independent phoneme recognition , 2011, 2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT).

[7]  Linda Dailey Paulson Speech Recognition Moves from Software to Hardware , 2006, Computer.

[8]  William M. Pottenger,et al.  Hardware-based support vector machine classification in logarithmic number systems , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[9]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[10]  Ivan Grech,et al.  Comparison of different multiclass SVM methods for speaker independent phoneme recognition , 2012, 2012 5th International Symposium on Communications, Control and Signal Processing.

[11]  Andreas Wendemuth,et al.  Speech recognition with support vector machines in a hybrid system , 2005, INTERSPEECH.

[12]  William M. Pottenger,et al.  Finite precision analysis of support vector machine classification in logarithmic number systems , 2004, Euromicro Symposium on Digital System Design, 2004. DSD 2004..

[13]  Niraj K. Jha,et al.  Energy efficiency of handheld computer interfaces: limits, characterization and practice , 2005, MobiSys '05.

[14]  Davide Anguita,et al.  A digital architecture for support vector machines: theory, algorithm, and FPGA implementation , 2003, IEEE Trans. Neural Networks.

[15]  Hao Tang,et al.  An initial attempt for phoneme recognition using Structured Support Vector Machine (SVM) , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[16]  B. Venkataramani,et al.  Design of a real time automatic speech recognition system using Modified One Against All SVM classifier , 2011, Microprocess. Microsystems.

[17]  Simon King,et al.  Framewise phone classification using support vector machines , 2002, INTERSPEECH.

[18]  William M. Pottenger,et al.  Finite precision analysis of support vector machine classification in logarithmic number systems , 2004 .

[19]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[20]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[21]  Zhang Xue-yingb Speech Recognition Based on Support Vector Machine and Error Correcting Output Codes , 2011 .

[22]  B. Venkataramani,et al.  FPGA Implementation of Support Vector Machine Based Isolated Digit Recognition System , 2009, 2009 22nd International Conference on VLSI Design.

[23]  Eric A. Brewer,et al.  Hardware speech recognition for user interfaces in low cost, low power devices , 2005, Proceedings. 42nd Design Automation Conference, 2005..