Interactive Voice Application-Based Amazigh Speech Recognition

This paper aims to build an interactive speaker-independent automatic Amazigh speech recognition system. The proposed system offers a methodology to extract data remotely from a distance database using the combined interactive voice response (IVR) and automatic speech recognition (ASR) technologies. We describe our experience to design an interactive speech system based on hidden Markov models (HMMs), Gaussian mixture models (GMMs) and Mel frequency spectral coefficients (MFCCs) based on ten first Amazigh digits and six Amazigh words. The best-obtained performance is 89.64% by using 3 HMMs and 16 GMMs.

[1]  Hamidi Mohamed,et al.  Interactive Voice Response Server Voice Network Administration Using Hidden Markov Model Speech Recognition System , 2018, 2018 Second World Conference on Smart Trends in Systems, Security and Sustainability (WorldS4).

[2]  Carl E. Rasmussen,et al.  Factorial Hidden Markov Models , 1997 .

[3]  Joyanta Basu,et al.  Real Time Challenges to Handle the Telephonic Speech Recognition System , 2013 .

[4]  Khalid Satori,et al.  Voice comparison between smokers and non-smokers using HMM speech recognition system , 2017, Int. J. Speech Technol..

[5]  William Yurcik,et al.  Multiple design patterns for voice over IP (VoIP) security , 2006, 2006 IEEE International Performance Computing and Communications Conference.

[6]  Khalid Satori,et al.  Voice pathology assessment based on automatic speech recognition using Amazigh digits , 2018, ICSDE'18.

[7]  Fatima El Haoussi,et al.  Investigation Amazigh speech recognition using CMU tools , 2014, Int. J. Speech Technol..

[8]  Khalid Satori,et al.  Vocal parameters analysis of smoker using Amazigh language , 2018, Int. J. Speech Technol..

[9]  Volker Steinbiss,et al.  The Philips automatic train timetable information system , 1995, Speech Commun..

[10]  Chitralekha Bhat,et al.  Deploying usable speech enabled IVR systems for mass use , 2013, 2013 International Conference on Human Computer Interactions (ICHCI).

[11]  Khalid Satori,et al.  Speech Recognition for Moroccan Dialects: Feature Extraction and Classification Methods , 2019 .

[12]  Jim Van Meggelen,et al.  Asterisk: The Definitive Guide , 2005 .

[13]  Alex Acero,et al.  Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .

[14]  Satya Prakash Ghrera,et al.  A novel approach for security issues in VoIP networks in Virtualization with IVR , 2012, ArXiv.

[15]  Muhammad Ali Akbar,et al.  Evaluating DoS Attacks against Sip-Based VoIP Systems , 2009, GLOBECOM 2009 - 2009 IEEE Global Telecommunications Conference.

[16]  Khalid Satori,et al.  Speech Coding Effect on Amazigh Alphabet Speech Recognition Performance , 2019 .