A Domain-Specific Non-Factoid Question Answering System based on Terminology Mining and Siamese Neural Network

The non-factoid question answering system (QAS) responds to an input question by fetching an answer from a question answering (QA) database. The existing non-factoid QASs still cannot well adapt to specific professional domains due to the lack of domain knowledge. Aiming at this problem, this paper proposes a domain-specific non-factoid QAS by combining information retrieval technique and deep neural network. First, it extracts professional terms from the domain-specific documents. The professional terms can be used as an important source of domain knowledge. Second, it trains a deep Siamese neural network for semantically matching the questions. Finally, it queries and ranks the candidate answers based on the professional terms and the deep Siamese neural network. We conducted experiments based on two real domain-specific QA databases, and the experiment results have demonstrated the effectiveness of the proposed QAS.

[1]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[2]  K. Srinathan,et al.  Automatic keyphrase extraction from scientific documents using N-gram filtration technique , 2008, ACM Symposium on Document Engineering.

[3]  Katja Hose,et al.  Retrieving Textual Evidence for Knowledge Graph Facts , 2019, ESWC.

[4]  Zhiyuan Liu,et al.  Convolutional Neural Networks for Soft-Matching N-Grams in Ad-hoc Search , 2018, WSDM.

[5]  M. Abulaish,et al.  A SUPERVISED LEARNING APPROACH FOR AUTOMATIC KEYPHRASE EXTRACTION , 2012 .

[6]  W. Bruce Croft,et al.  Answer Interaction in Non-factoid Question Answering Systems , 2019, CHIIR.

[7]  Wanxiang Che,et al.  Pre-Training with Whole Word Masking for Chinese BERT , 2019, ArXiv.

[8]  Xindong Wu,et al.  Document-Specific Keyphrase Extraction Using Sequential Patterns with Wildcards , 2014, 2014 IEEE International Conference on Data Mining.

[9]  Qingbo Yang,et al.  Question Answering System Based on Web , 2012, 2012 Fifth International Conference on Intelligent Computation Technology and Automation.

[10]  Byeong Ho Kang,et al.  Open-Domain Question Answering Framework Using Wikipedia , 2016, Australasian Conference on Artificial Intelligence.

[11]  Xuanjing Huang,et al.  Convolutional Neural Tensor Network Architecture for Community-Based Question Answering , 2015, IJCAI.

[12]  Qingtang Liu,et al.  Design and Implementation of Automatic Question Answering System in Information Retrieval , 2018, IEEA '18.

[13]  Jin Liu,et al.  The Hot Keyphrase Extraction Based on TF*PDF , 2011, 2011IEEE 10th International Conference on Trust, Security and Privacy in Computing and Communications.

[14]  Lee-Feng Chien,et al.  PAT-tree-based keyword extraction for Chinese information retrieval , 1997, SIGIR '97.

[15]  Denis Savenkov Ranking Answers and Web Passages for Non-factoid Question Answering: Emory University at TREC LiveQA , 2015, TREC.

[16]  Juan Zhou,et al.  A SOA Based Domain-Specific Chinese QA System , 2010, 2010 Ninth International Conference on Grid and Cloud Computing.

[17]  Ali Farhadi,et al.  Bidirectional Attention Flow for Machine Comprehension , 2016, ICLR.

[18]  Yang Xiang,et al.  A Chinese question answering system based on web search , 2014, 2014 International Conference on Machine Learning and Cybernetics.

[19]  Tingting He,et al.  Learning semantic representation with neural networks for community question answering retrieval , 2016, Knowl. Based Syst..

[20]  Rakesh Chandra Balabantaray,et al.  Question answering system for factoid based question , 2016, 2016 2nd International Conference on Contemporary Computing and Informatics (IC3I).

[21]  Hajime Morita,et al.  Question answering system using Q & A site corpus Query expansion and answer candidate evaluation , 2013, SpringerPlus.

[22]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[23]  Huan Wang,et al.  Incorporating External Knowledge to Boost Machine Comprehension Based Question Answering , 2019, ECIR.

[24]  Tie-Yan Liu,et al.  Dual Learning for Machine Translation , 2016, NIPS.

[25]  W. Bruce Croft,et al.  Beyond Factoid QA: Effective Methods for Non-factoid Answer Sentence Retrieval , 2016, ECIR.

[26]  Yann LeCun,et al.  Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[27]  Jimmy J. Lin,et al.  Data-Intensive Question Answering , 2001, TREC.

[28]  Jonas Mueller,et al.  Siamese Recurrent Architectures for Learning Sentence Similarity , 2016, AAAI.

[29]  Jamie Callan,et al.  Deeper Text Understanding for IR with Contextual Neural Language Modeling , 2019, SIGIR.

[30]  Ahmad Aghaebrahimian,et al.  Open-domain Factoid Question Answering via Knowledge Graph Search , 2016 .

[31]  Arpita Das,et al.  Together we stand: Siamese Networks for Similar Question Retrieval , 2016, ACL.

[32]  Peter D. Turney Learning Algorithms for Keyphrase Extraction , 2000, Information Retrieval.