Amazigh Speech Recognition Embedded System

This paper investigates the Amazigh speech recognition and its usage for controlling external devices. We describe our experience to design a speech system founded on hidden Markov Models (HMMs), Gaussian mixture models (GMMs), Mel frequency spectral coefficients (MFCCs) and optimization of parameters in order to have a portability in resource limited embedded system. Our objective is developing a control Amazigh speech recognition system through a Raspberry Pi board, as well as achieving the best solution with a higher automatic speech recognition parametrization for lowcost minicomputers on a speaker-independent approach. The designed speech system was implemented on the open-source platform. The system achieves the best performance of 90.43% when trained by using 3 HMMs and 16 GMMs.

[1]  Hafedh Abid,et al.  Remote control of a domestic equipment from an Android application based on Raspberry pi card , 2014, 2014 15th International Conference on Sciences and Techniques of Automatic Control and Computer Engineering (STA).

[2]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[3]  P Ravi Babu,et al.  A Smart Home Automation technique with Raspberry Pi using IoT , 2015, 2015 International Conference on Smart Sensors and Systems (IC-SSS).

[4]  Khalid Satori,et al.  Speech Coding Effect on Amazigh Alphabet Speech Recognition Performance , 2019 .

[5]  Anant Vaibhav,et al.  Raspberry Pi based interactive home automation system through E-mail , 2014, 2014 International Conference on Reliability Optimization and Information Technology (ICROIT).

[6]  Fatima El Haoussi,et al.  Investigation Amazigh speech recognition using CMU tools , 2014, Int. J. Speech Technol..

[7]  Khalid Satori,et al.  Amazigh digits through interactive speech recognition system in noisy environment , 2020, Int. J. Speech Technol..

[8]  Frank Raffaeli,et al.  Portable low-cost platform for embedded speech analysis and synthesis , 2016, 2016 12th International Computer Engineering Conference (ICENCO).

[9]  Hamidi Mohamed,et al.  Interactive Voice Response Server Voice Network Administration Using Hidden Markov Model Speech Recognition System , 2018, 2018 Second World Conference on Smart Trends in Systems, Security and Sustainability (WorldS4).

[10]  M. Picheny,et al.  Comparison of Parametric Representation for Monosyllabic Word Recognition in Continuously Spoken Sentences , 2017 .

[11]  Sanjay B. Deshmukh,et al.  Raspberry Pi for automation of water treatment plant , 2014, 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[12]  Alex Waibel,et al.  Readings in speech recognition , 1990 .

[13]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[14]  Fung Po Tso,et al.  The Glasgow Raspberry Pi Cloud: A Scale Model for Cloud Computing Infrastructures , 2013, 2013 IEEE 33rd International Conference on Distributed Computing Systems Workshops.

[15]  Marwan Al-Zabibi An acoustic-phonetic approach in automatic arabic speech recognition , 1990 .

[16]  B. S. Sreeja,et al.  International Journal of Emerging Technology in Computer Science & Electronics ( IJETCSE , 2014 .

[17]  Khalid Satori,et al.  Voice comparison between smokers and non-smokers using HMM speech recognition system , 2017, Int. J. Speech Technol..

[18]  Khalid Satori,et al.  Voice pathology assessment based on automatic speech recognition using Amazigh digits , 2018, ICSDE'18.

[19]  Youssef Es Saady,et al.  AMHCD: A Database for Amazigh Handwritten Character Recognition Research , 2011 .

[20]  Khalid Satori,et al.  Speech Recognition for Moroccan Dialects: Feature Extraction and Classification Methods , 2019 .

[21]  Yao Liang,et al.  Raspberry Pi: An Effective Vehicle in Teaching the Internet of Things in Computer Science and Engineering , 2016 .

[22]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[23]  Khalid Satori,et al.  Vocal parameters analysis of smoker using Amazigh language , 2018, Int. J. Speech Technol..

[24]  Ondrej Krejcar,et al.  Voice Recognition Software on Embedded Devices , 2018, ACIIDS.