论文信息 - Speech Recognition Based on Open Source Speech Processing Software

Speech Recognition Based on Open Source Speech Processing Software

Creating of speech recognition application requires advanced speech processing techniques realized by specialized speech processing software. It is very possible to improve the speech recognition research by using frameworks based on open source speech processing software. The article presents the possibility of using open source speech processing software to construct own speech recognition application.

Piotr Klosowski | Jacek Izydorczyk | Adam Dustor | Jan Kotas | Jacek Slimok

[1] J.A. Bilmes,et al. Graphical model architectures for speech recognition , 2005, IEEE Signal Processing Magazine.

[2] Jonathan G. Fiscus,et al. Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .

[3] Kadri Hacioglu,et al. Recent improvements in the CU Sonic ASR system for noisy speech: the SPINE task , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[4] Piotr Klosowski. Speech Processing Application Based on Phonetics and Phonology of the Polish Language , 2010, CN.

[5] S. S. Stevens,et al. The Relation of Pitch to Frequency: A Revised Scale , 1940 .

[6] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[7] B. Zi'olko,et al. Application of HTK to the Polish language , 2008, 2008 International Conference on Audio, Language and Image Processing.

[8] Reinhold Orglmeister,et al. CMU Sphinx4 speech recognizer in a Service-oriented Computing style , 2011, 2011 IEEE International Conference on Service-Oriented Computing and Applications (SOCA).

[9] Piotr Klosowski,et al. Biometric Voice Identification Based on Fuzzy Kernel Classifier , 2013, CN.

[10] P. Kłosowski. Improving speech processing based on phonetics and phonology of Polish language , 2013 .

[11] Ronald W. Schafer,et al. Introduction to Digital Speech Processing , 2007, Found. Trends Signal Process..

[12] Driss Matrouf,et al. State-of-the-Art Performance in Text-Independent Speaker Verification Through Open-Source Software , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[13] Jean-François Bonastre,et al. ALIZE, a free toolkit for speaker recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[14] Steve Young,et al. The HTK book , 1995 .

[15] S. Howard Bartley,et al. The relation of pitch to frequency. , 1950 .

[16] Geoffrey Zweig,et al. The graphical models toolkit: An open source software system for speech and time-series processing , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17] Piotr Klosowski,et al. Automatic Speech Segmentation for Automatic Speech Translation , 2013, CN.