论文信息 - Amazigh Speech Recognition Embedded System

Amazigh Speech Recognition Embedded System

This paper investigates the Amazigh speech recognition and its usage for controlling external devices. We describe our experience to design a speech system founded on hidden Markov Models (HMMs), Gaussian mixture models (GMMs), Mel frequency spectral coefficients (MFCCs) and optimization of parameters in order to have a portability in resource limited embedded system. Our objective is developing a control Amazigh speech recognition system through a Raspberry Pi board, as well as achieving the best solution with a higher automatic speech recognition parametrization for lowcost minicomputers on a speaker-independent approach. The designed speech system was implemented on the open-source platform. The system achieves the best performance of 90.43% when trained by using 3 HMMs and 16 GMMs.

[1] Hafedh Abid,et al. Remote control of a domestic equipment from an Android application based on Raspberry pi card , 2014, 2014 15th International Conference on Sciences and Techniques of Automatic Control and Computer Engineering (STA).

[2] Stan Davis,et al. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[3] P Ravi Babu,et al. A Smart Home Automation technique with Raspberry Pi using IoT , 2015, 2015 International Conference on Smart Sensors and Systems (IC-SSS).

[4] Khalid Satori,et al. Speech Coding Effect on Amazigh Alphabet Speech Recognition Performance , 2019 .

[5] Anant Vaibhav,et al. Raspberry Pi based interactive home automation system through E-mail , 2014, 2014 International Conference on Reliability Optimization and Information Technology (ICROIT).

[6] Fatima El Haoussi,et al. Investigation Amazigh speech recognition using CMU tools , 2014, Int. J. Speech Technol..

[7] Khalid Satori,et al. Amazigh digits through interactive speech recognition system in noisy environment , 2020, Int. J. Speech Technol..

[8] Frank Raffaeli,et al. Portable low-cost platform for embedded speech analysis and synthesis , 2016, 2016 12th International Computer Engineering Conference (ICENCO).

[9] Hamidi Mohamed,et al. Interactive Voice Response Server Voice Network Administration Using Hidden Markov Model Speech Recognition System , 2018, 2018 Second World Conference on Smart Trends in Systems, Security and Sustainability (WorldS4).

[10] M. Picheny,et al. Comparison of Parametric Representation for Monosyllabic Word Recognition in Continuously Spoken Sentences , 2017 .

[11] Sanjay B. Deshmukh,et al. Raspberry Pi for automation of water treatment plant , 2014, 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[12] Alex Waibel,et al. Readings in speech recognition , 1990 .

[13] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .

[14] Fung Po Tso,et al. The Glasgow Raspberry Pi Cloud: A Scale Model for Cloud Computing Infrastructures , 2013, 2013 IEEE 33rd International Conference on Distributed Computing Systems Workshops.

[15] Marwan Al-Zabibi. An acoustic-phonetic approach in automatic arabic speech recognition , 1990 .

[16] B. S. Sreeja,et al. International Journal of Emerging Technology in Computer Science & Electronics ( IJETCSE , 2014 .

[17] Khalid Satori,et al. Voice comparison between smokers and non-smokers using HMM speech recognition system , 2017, Int. J. Speech Technol..

[18] Khalid Satori,et al. Voice pathology assessment based on automatic speech recognition using Amazigh digits , 2018, ICSDE'18.

[19] Youssef Es Saady,et al. AMHCD: A Database for Amazigh Handwritten Character Recognition Research , 2011 .

[20] Khalid Satori,et al. Speech Recognition for Moroccan Dialects: Feature Extraction and Classification Methods , 2019 .

[21] Yao Liang,et al. Raspberry Pi: An Effective Vehicle in Teaching the Internet of Things in Computer Science and Engineering , 2016 .

[22] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[23] Khalid Satori,et al. Vocal parameters analysis of smoker using Amazigh language , 2018, Int. J. Speech Technol..

[24] Ondrej Krejcar,et al. Voice Recognition Software on Embedded Devices , 2018, ACIIDS.