Algerian Modern Colloquial Arabic Speech Corpus (AMCASC): regional accents recognition within complex socio-linguistic environments

AbstractThe Algerian linguistic situation is very intricate due to the ethnic, geographical and colonial occupation influences which have lead to a complex sociolinguistic environment. As a result of the contact between different languages and accents, the Algerian speech community has acquired a distinctive sociolinguistic situation. In addition to the intra- and inter- lingual variations describing day-to-day linguistic behavior of the Algerian speakers, their speech is characterized by the presence of many linguistic phenomena such as bilingualism and code switching. The study of automatic regional accent recognition in such a type of environment is a new idea in the field of automatic languages, dialect and accent recognition especially that previous studies were conducted using monolingual evaluation data. The assessment of the effectiveness of GMM-UBM and i-vectors frameworks for accent recognition approaches through the use of the Algerian Modern Colloquial Arabic Speech Corpus (AMCASC), which is a linguistic resource collected for this purpose, shows that not only the recording conditions mismatch, channels mismatch, recordings length mismatch and the amplitude clipping which have a non-desirable effect on the effectiveness of these acoustic approaches but also language contact phenomena are other perturbation sources which should be taken into consideration especially in real life applications .

[1]  Hugo Van hamme,et al.  Accent recognition using i-vector, Gaussian Mean Supervector and Gaussian posterior probability supervector for spontaneous telephone speech , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  Martine Adda-Decker,et al.  Characterisation and identi cation of non-native French accents , 2012 .

[3]  John H. L. Hansen,et al.  Dialect/Accent Classification Using Unrestricted Audio , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Hayet Bagui,et al.  ASPECTS OF DIGLOSSIC CODE SWITCHING SITUATIONS: A SOCIOLINGUISTIC INTERPRETATION , 2014 .

[5]  Bin Ma,et al.  Spoken Language Recognition: From Fundamentals to Practice , 2013, Proceedings of the IEEE.

[6]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[7]  Peter Ladefoged,et al.  Vowels and Consonants , 2000, Manchu Grammar.

[8]  Alice Bardiaux,et al.  Perception of French, Belgian and Swiss accents by French and Belgian listeners , 2011, ExLing.

[9]  Rong Tong,et al.  Chinese Dialect Identification Using Tone Features Based on Pitch Flux , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[10]  Mahmoud Al-Ayyoub,et al.  Spoken Arabic dialects identification: The case of Egyptian and Jordanian dialects , 2014, 2014 5th International Conference on Information and Communication Systems (ICICS).

[11]  Fayez A. Alhargan,et al.  Saudi accented Arabic voice bank , 2008, ExLing.

[12]  Anu Khosla,et al.  Automatic identification of gender & accent in spoken Hindi utterances with regional Indian accents , 2008, 2008 IEEE Spoken Language Technology Workshop.

[13]  Wen-Whei Chang,et al.  Discriminative training of Gaussian mixture bigram models with application to Chinese dialect identification , 2002, Speech Commun..

[14]  Lukás Burget,et al.  Discriminatively trained Probabilistic Linear Discriminant Analysis for speaker verification , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15]  James R. Glass,et al.  Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[16]  W. Idsardi,et al.  Perceptual and Phonetic Experiments on American English Dialect Identification , 1999 .

[17]  Tomi Kinnunen,et al.  Factors affecting i-vector based foreign accent recognition: A case study in spoken Finnish , 2015, Speech Commun..

[18]  E. Amalsaleh,et al.  The Identity of Patients in 12th Century Medical Text in Persian: a Sociolinguistic Perspective , 2013 .

[19]  Kanae Amino,et al.  Native vs. non-native accent identification using Japanese spoken telephone numbers , 2014, Speech Commun..

[20]  Douglas A. Reynolds,et al.  Language Recognition via i-vectors and Dimensionality Reduction , 2011, INTERSPEECH.

[21]  Tomi Kinnunen,et al.  Foreign accent detection from spoken Finnish using i-vectors , 2013, INTERSPEECH.

[22]  Sid-Ahmed Selouani,et al.  ALGERIAN ARABIC SPEECH DATABASE (ALGASD): CORPUS DESIGN AND AUTOMATIC SPEECH RECOGNITION APPLICATION , 2010 .

[23]  Bahdja Medjdoub Meriem A Sociolinguistic Profile of French in Algeria: The Case of Tlemcen Speech Community , 2014 .

[24]  A. Akbari,et al.  A classifier combination approach for Farsi accents recognition , 2012, 20th Iranian Conference on Electrical Engineering (ICEE2012).

[25]  Nizar Habash,et al.  Spoken Arabic Dialect Identification Using Phonotactic Modeling , 2009, SEMITIC@EACL.

[26]  Yun Lei,et al.  Factor analysis-based information integration for Arabic dialect identification , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[27]  Treebank Penn,et al.  Linguistic Data Consortium , 1999 .

[28]  Joseph P. Campbell,et al.  Informative dialect recognition using context-dependent pronunciation modeling , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[29]  M. A. Kohler,et al.  Language identification using shifted delta cepstra , 2002, The 2002 45th Midwest Symposium on Circuits and Systems, 2002. MWSCAS-2002..

[30]  Zhansheng Feng,et al.  A corpus for forensic identification of dialect speaker in some area , 2011, 2011 IEEE 3rd International Conference on Communication Software and Networks.

[31]  Joseph P. Campbell,et al.  A linguistically-informative approach to dialect recognition using dialect-discriminating context-dependent phonetic models , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[32]  Bozhao Tan,et al.  An automatic non-native speaker recognition system , 2010, 2010 IEEE International Conference on Technologies for Homeland Security (HST).

[33]  M. Shamsi,et al.  The contribution of prosody to the identification of Persian regional accents , 2012, 2012 IEEE Symposium on Industrial Electronics and Applications.

[34]  Pietro Laface,et al.  Analysis of Large-Scale SVM Training Algorithms for Language and Speaker Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[35]  Patrick Kenny,et al.  Joint Factor Analysis Versus Eigenchannels in Speaker Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[36]  K. Versteegh,et al.  Encyclopedia of Arabic Language and Linguistics , 2006 .

[37]  Sébastien Marcel,et al.  Swiss French Regional Accent Identification , 2014, Odyssey.

[38]  Simon King,et al.  The voice bank corpus: Design, collection and data analysis of a large regional accent speech database , 2013, 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE).

[39]  John H. L. Hansen,et al.  Unsupervised Discriminative Training With Application to Dialect Classification , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[40]  Rong Tong,et al.  Spoken Language Recognition Using Ensemble Classifiers , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[41]  Stephen J. Cox,et al.  Iterative classification of regional British accents in i-vector space , 2012, MLSLP.

[42]  Hugo Van hamme,et al.  Rapid speaker adaptation in latent speaker space with non-negative matrix factorization , 2013, Speech Commun..

[43]  Patrick Kenny,et al.  Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[44]  Bin Ma,et al.  Shifted-Delta MLP Features for Spoken Language Recognition , 2013, IEEE Signal Processing Letters.

[45]  Martin J. Russell,et al.  Human and computer recognition of regional accents and ethnic groups from British English speech , 2013, Comput. Speech Lang..

[46]  David A. van Leeuwen,et al.  NFI-FRITS: A forensic speaker recognition database and some first experiments , 2014, Odyssey.

[47]  Joseph P. Campbell,et al.  Large-scale analysis of formant frequency estimation variability in conversational telephone speech , 2009, INTERSPEECH.

[48]  Lukás Burget,et al.  Prosodic speaker verification using subspace multinomial models with intersession compensation , 2010, INTERSPEECH.

[49]  Jirí Navrátil,et al.  Spoken language recognition-a step toward multilinguality in speech processing , 2001, IEEE Trans. Speech Audio Process..

[50]  Stephen Cox,et al.  A comparison of two unsupervised approaches to accent identification , 1998, ICSLP.

[51]  Joseph P. Campbell,et al.  Characterizing Phonetic Transformations and Acoustic Differences Across English Dialects , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[52]  Julia Hirschberg,et al.  Automatic Dialect and Accent Recognition and its Application to Speech Recognition , 2011 .

[53]  B. Youcef,et al.  The phonological interlanguage of the undergraduate students at the department of English, university of Mentouri, Constantine , 2017 .