Clustering-based Sequence to Sequence Model for Generative Question Answering in a Low-resource Language

Despite the impressive success of sequence to sequence models for generative question answering, they need a vast amount of question-answer pairs during training, which is hard and expensive to obtain, especially for low-resource languages. In this article, we present a framework that exploits the semantic clusters among the question-answer pairs to compensate for the lack of enough training data. In the training phase, the question-answer pairs are clustered, and a cluster predictor is trained to identify the cluster each question belongs to. Then, a sequence to sequence model is trained, where there is a different generator for each cluster in the decoder component. During the test phase, the cluster of the input question is first identified using the trained cluster predictor, and the appropriate decoder is exploited. Our experiments on a Persian religious dataset show that the proposed method outperforms the standard sequence to sequence model by a large margin in terms of ROUGE and BLEU scores. This is traced back to the lower number of words in each cluster, leading to a reduction in the number of effective parameters each generator needs to learn, which help the model learn from fewer training data with less overfitting.

[1]  Yingjie Deng,et al.  Multi-level retrieval with semantic Axiomatic Fuzzy Set clustering for question answering , 2021, Appl. Soft Comput..

[2]  Xianghua Fu,et al.  Lexicon-Enhanced Transformer with Pointing for Domains Specific Generative Question Answering , 2020, ICA3PP.

[3]  Pavel Smrz,et al.  Rethinking the Objectives of Extractive Question Answering , 2020, MRQA.

[4]  Valeriia Baranova-Bolotova,et al.  Multi-Document Answer Generation for Non-Factoid Questions , 2020, SIGIR.

[5]  Madian Khabsa,et al.  To Pretrain or Not to Pretrain: Examining the Benefits of Pretrainng on Resource Rich Tasks , 2020, ACL.

[6]  Arantxa Otegi,et al.  Conversational Question Answering in Low Resource Scenarios: A Dataset and Case Study for Basque , 2020, LREC.

[7]  Shafaatunnur Hasan,et al.  A question answering system in hadith using linguistic knowledge , 2020, Comput. Speech Lang..

[8]  Makoto Nakatsuji,et al.  Conclusion-Supplement Answer Generation for Non-Factoid Questions , 2019, AAAI.

[9]  DEEPAK GUPTA,et al.  A Deep Neural Network Framework for English Hindi Question Answering , 2019, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[10]  Raj Dabre,et al.  Exploiting Multilingualism through Multistage Fine-Tuning for Low-Resource Neural Machine Translation , 2019, EMNLP.

[11]  Gerard de Melo,et al.  A Robust Self-Learning Framework for Cross-Lingual Text Classification , 2019, EMNLP.

[12]  Tie-Yan Liu,et al.  Machine Translation With Weakly Paired Documents , 2019, EMNLP.

[13]  Seung-won Hwang,et al.  Learning with Limited Data for Multilingual Reading Comprehension , 2019, EMNLP.

[14]  Talaat Khalil,et al.  Cross-lingual intent classification in a low resource industrial setting , 2019, EMNLP.

[15]  Petr Motlicek,et al.  Abstract Text Summarization: A Low Resource Challenge , 2019, EMNLP.

[16]  Zhang Yue,et al.  Cross-Lingual Dependency Parsing Using Code-Mixed TreeBank , 2019, Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP).

[17]  Ming Yan,et al.  Incorporating External Knowledge into Machine Reading for Generative Question Answering , 2019, EMNLP.

[18]  Willie Brink,et al.  Towards Automating Healthcare Question Answering in a Noisy Multilingual Low-Resource Setting , 2019, ACL.

[19]  Ludovic Denoyer,et al.  Unsupervised Question Answering by Cloze Translation , 2019, ACL.

[20]  Kyomin Jung,et al.  A Compare-Aggregate Model with Latent Clustering for Answer Selection , 2019, CIKM.

[21]  A Saradha,et al.  A framework for intelligent question answering system using semantic context-specific document clustering and Wordnet , 2019, Sādhanā.

[22]  Adriane Boyd,et al.  Using Wikipedia Edits in Low Resource Grammatical Error Correction , 2018, NUT@EMNLP.

[23]  Bernardo Magnini,et al.  Exploring Named Entity Recognition As an Auxiliary Task for Slot Filling in Conversational Language Understanding , 2018, SCAI@EMNLP.

[24]  Mike Lewis,et al.  Generative Question Answering: Learning to Answer the Whole Question , 2018, ICLR.

[25]  Jun Zhao,et al.  Curriculum Learning for Natural Answer Generation , 2018, IJCAI.

[26]  Luiz Chaimowicz,et al.  Learning Transferable Features For Open-Domain Question Answering , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[27]  Percy Liang,et al.  Know What You Don’t Know: Unanswerable Questions for SQuAD , 2018, ACL.

[28]  Yansong Feng,et al.  Natural Answer Generation with Heterogeneous Memory , 2018, NAACL.

[29]  Ming Zhou,et al.  S-Net: From Answer Extraction to Answer Synthesis for Machine Reading Comprehension , 2018, AAAI.

[30]  Bhuwan Dhingra,et al.  Simple and Effective Semi-Supervised Question Answering , 2018, NAACL.

[31]  Tao Zhong,et al.  Multitask learning for neural generative question answering , 2018, Machine Vision and Applications.

[32]  Kyomin Jung,et al.  Learning to Rank Question-Answer Pairs Using Hierarchical Recurrent Encoder with Latent Topic Clustering , 2017, NAACL.

[33]  Jun Zhao,et al.  Generating Natural Answers by Incorporating Copying and Retrieving Mechanisms in Sequence-to-Sequence Learning , 2017, ACL.

[34]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[35]  Rajarshi Das,et al.  Question Answering on Knowledge Bases and Text using Universal Schema and Memory Networks , 2017, ACL.

[36]  P Deepak,et al.  MixKMeans: Clustering Question-Answer Archives , 2016, EMNLP.

[37]  Amit Mishra,et al.  A survey on question answering systems with classification , 2016, J. King Saud Univ. Comput. Inf. Sci..

[38]  Jian Zhang,et al.  SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[39]  Xin Jiang,et al.  Neural Generative Question Answering , 2015, IJCAI.

[40]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[41]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[42]  Rivindu Perera,et al.  IPedagogy: Question Answering System Based on Web Information Clustering , 2012, 2012 IEEE Fourth International Conference on Technology for Education.

[43]  Dietrich Klakow,et al.  A word clustering approach for language model-based sentence retrieval in question answering systems , 2009, CIKM.

[44]  Ankush Mittal,et al.  Question Processing and Clustering in INDOC: A Biomedical Question Answering System , 2007, EURASIP J. Bioinform. Syst. Biol..

[45]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[46]  Xin Liu,et al.  Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[47]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[48]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[49]  Zhe Gan,et al.  Cluster-Former: Clustering-based Sparse Transformer for Question Answering , 2021, FINDINGS.

[50]  Hossein Amirkhani,et al.  FarsTail: A Persian Natural Language Inference Dataset , 2020, ArXiv.

[51]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[52]  Yang Zhang,et al.  Natural Answer Generation With Attention Over Instances , 2019, IEEE Access.

[53]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .