论文信息 - Clustering-based Sequence to Sequence Model for Generative Question Answering in a Low-resource Language

Clustering-based Sequence to Sequence Model for Generative Question Answering in a Low-resource Language

Despite the impressive success of sequence to sequence models for generative question answering, they need a vast amount of question-answer pairs during training, which is hard and expensive to obtain, especially for low-resource languages. In this article, we present a framework that exploits the semantic clusters among the question-answer pairs to compensate for the lack of enough training data. In the training phase, the question-answer pairs are clustered, and a cluster predictor is trained to identify the cluster each question belongs to. Then, a sequence to sequence model is trained, where there is a different generator for each cluster in the decoder component. During the test phase, the cluster of the input question is first identified using the trained cluster predictor, and the appropriate decoder is exploited. Our experiments on a Persian religious dataset show that the proposed method outperforms the standard sequence to sequence model by a large margin in terms of ROUGE and BLEU scores. This is traced back to the lower number of words in each cluster, leading to a reduction in the number of effective parameters each generator needs to learn, which help the model learn from fewer training data with less overfitting.

Hossein Amirkhani | A. Bidgoly | Razieh Baradaran

[1] Yingjie Deng,et al. Multi-level retrieval with semantic Axiomatic Fuzzy Set clustering for question answering , 2021, Appl. Soft Comput..

[2] Xianghua Fu,et al. Lexicon-Enhanced Transformer with Pointing for Domains Specific Generative Question Answering , 2020, ICA3PP.

[3] Pavel Smrz,et al. Rethinking the Objectives of Extractive Question Answering , 2020, MRQA.

[4] Valeriia Baranova-Bolotova,et al. Multi-Document Answer Generation for Non-Factoid Questions , 2020, SIGIR.

[5] Madian Khabsa,et al. To Pretrain or Not to Pretrain: Examining the Benefits of Pretrainng on Resource Rich Tasks , 2020, ACL.

[6] Arantxa Otegi,et al. Conversational Question Answering in Low Resource Scenarios: A Dataset and Case Study for Basque , 2020, LREC.

[7] Shafaatunnur Hasan,et al. A question answering system in hadith using linguistic knowledge , 2020, Comput. Speech Lang..

[8] Makoto Nakatsuji,et al. Conclusion-Supplement Answer Generation for Non-Factoid Questions , 2019, AAAI.

[9] DEEPAK GUPTA,et al. A Deep Neural Network Framework for English Hindi Question Answering , 2019, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[10] Raj Dabre,et al. Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation , 2019, EMNLP.

[11] Gerard de Melo,et al. A Robust Self-Learning Framework for Cross-Lingual Text Classification , 2019, EMNLP.

[12] Tie-Yan Liu,et al. Machine Translation With Weakly Paired Documents , 2019, EMNLP.

[13] Seung-won Hwang,et al. Learning with Limited Data for Multilingual Reading Comprehension , 2019, EMNLP.

[14] Talaat Khalil,et al. Cross-lingual intent classification in a low resource industrial setting , 2019, EMNLP.

[15] Petr Motlicek,et al. Abstract Text Summarization: A Low Resource Challenge , 2019, EMNLP.

[16] Zhang Yue,et al. Cross-Lingual Dependency Parsing Using Code-Mixed TreeBank , 2019, Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP).

[17] Ming Yan,et al. Incorporating External Knowledge into Machine Reading for Generative Question Answering , 2019, EMNLP.

[18] Willie Brink,et al. Towards Automating Healthcare Question Answering in a Noisy Multilingual Low-Resource Setting , 2019, ACL.

[19] Ludovic Denoyer,et al. Unsupervised Question Answering by Cloze Translation , 2019, ACL.

[20] Kyomin Jung,et al. A Compare-Aggregate Model with Latent Clustering for Answer Selection , 2019, CIKM.

[21] A Saradha,et al. A framework for intelligent question answering system using semantic context-specific document clustering and Wordnet , 2019, Sādhanā.

[22] Adriane Boyd,et al. Using Wikipedia Edits in Low Resource Grammatical Error Correction , 2018, NUT@EMNLP.

[23] Bernardo Magnini,et al. Exploring Named Entity Recognition As an Auxiliary Task for Slot Filling in Conversational Language Understanding , 2018, SCAI@EMNLP.

[24] Mike Lewis,et al. Generative Question Answering: Learning to Answer the Whole Question , 2018, ICLR.

[25] Jun Zhao,et al. Curriculum Learning for Natural Answer Generation , 2018, IJCAI.

[26] Luiz Chaimowicz,et al. Learning Transferable Features For Open-Domain Question Answering , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[27] Percy Liang,et al. Know What You Don’t Know: Unanswerable Questions for SQuAD , 2018, ACL.

[28] Yansong Feng,et al. Natural Answer Generation with Heterogeneous Memory , 2018, NAACL.