An automatic speech recognition system with speaker-independent identification support

The novelty of this work relies on the application of an open source research software toolkit (CMU Sphinx) to train, build and evaluate a speech recognition system, with speaker-independent support, for voice-controlled hardware applications. Moreover, we propose to use the trained acoustic model to successfully decode offline voice commands on embedded hardware, such as an ARMv6 low-cost SoC, Raspberry PI. This type of single-board computer, mainly used for educational and research activities, can serve as a proof-of-concept software and hardware stack for low cost voice automation systems.

[1]  Tamás Nagy,et al.  Low-cost photoplethysmograph solutions using the Raspberry Pi , 2013, 2013 IEEE 14th International Symposium on Computational Intelligence and Informatics (CINTI).

[2]  Ronald W. Schafer,et al.  Introduction to Digital Speech Processing , 2007, Found. Trends Signal Process..

[3]  Ellen-Louise Bleeker,et al.  Creating a Raspberry Pi-Based Beowulf Cluster , 2017 .

[4]  Lawrence R. Rabiner,et al.  Applications of speech recognition in the area of telecommunications , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[5]  Wayne H. Ward,et al.  Speech recognition , 1997 .

[6]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[7]  Michiel Bacchiani,et al.  Restoring punctuation and capitalization in transcribed speech , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Kai-Fu Lee,et al.  Automatic Speech Recognition , 1989 .

[9]  Fung Po Tso,et al.  The Glasgow Raspberry Pi Cloud: A Scale Model for Cloud Computing Infrastructures , 2013, 2013 IEEE 33rd International Conference on Distributed Computing Systems Workshops.