BERT-based Multi-Task Model for Country and Province Level Modern Standard Arabic and Dialectal Arabic Identification

Dialect and standard language identification are crucial tasks for many Arabic natural language processing applications. In this paper, we present our deep learning-based system, submitted to the second NADI shared task for country-level and province-level identification of Modern Standard Arabic (MSA) and Dialectal Arabic (DA). The system is based on an end-to-end deep Multi-Task Learning (MTL) model to tackle both country-level and province-level MSA/DA identification. The latter MTL model consists of a shared Bidirectional Encoder Representation Transformers (BERT) encoder, two task-specific attention layers, and two classifiers. Our key idea is to leverage both the task-discriminative and the inter-task shared features for country and province MSA/DA identification. The obtained results show that our MTL model outperforms single-task models on most subtasks.

[1]  Chris Callison-Burch,et al.  Arabic Dialect Identification , 2014, CL.

[2]  Muhammad Abdul-Mageed,et al.  ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic , 2020, ACL.

[3]  Nizar Habash,et al.  Conventional Orthography for Dialectal Arabic , 2012, LREC.

[4]  Nizar Habash,et al.  Introduction to Arabic Natural Language Processing , 2010, Introduction to Arabic Natural Language Processing.

[5]  Motaz Saad,et al.  ArbDialectID at MADAR Shared Task 1: Language Modelling and Ensemble Learning for Fine Grained Arabic Dialect Identification , 2019, WANLP@ACL 2019.

[6]  Karima Meftouh,et al.  The SMarT Classifier for Arabic Fine-Grained Dialect Identification , 2019, WANLP@ACL 2019.

[7]  Ibrahim A. Al-Kharashi,et al.  Arabic morphological analysis techniques: A comprehensive survey , 2004, J. Assoc. Inf. Sci. Technol..

[8]  Ahmed Khoumsi,et al.  Weighted combination of BERT and N-GRAM features for Nuanced Arabic Dialect Identification , 2020, WANLP.

[9]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[10]  Muhammad Abdul-Mageed,et al.  Toward Micro-Dialect Identification in Diaglossic and Code-Switched Environments , 2020, EMNLP.

[11]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[12]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[13]  Nizar Habash,et al.  NADI 2021: The Second Nuanced Arabic Dialect Identification Shared Task , 2021, WANLP.

[14]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[15]  Houda Bouamor,et al.  Fine-Grained Arabic Dialect Identification , 2018, COLING.

[16]  Ibraheem Tuffaha,et al.  Multi-dialect Arabic BERT for Country-level Dialect Identification , 2020, WANLP.

[17]  Nizar Habash,et al.  The MADAR Shared Task on Arabic Fine-Grained Dialect Identification , 2019, WANLP@ACL 2019.

[18]  Nizar Habash,et al.  NADI 2020: The First Nuanced Arabic Dialect Identification Shared Task , 2020, WANLP.