Chinese medical question answer selection via hybrid models based on CNN and GRU

Question answer selection in the Chinese medical field is very challenging since it requires effective text representations to capture the complex semantic relationships between Chinese questions and answers. Recent approaches on deep learning, e.g., CNN and RNN, have shown their potential in improving the selection quality. However, these existing methods can only capture a part or one-side of semantic relationships while ignoring the other rich and sophisticated ones, leading to limited performance improvement. In this paper, a series of neural network models are proposed to address Chinese medical question answer selection issue. In order to model the complex relationships between questions and answers, we develop both single and hybrid models with CNN and GRU to combine the merits of different neural network architectures. This is different from existing works that can onpy capture partial relationships by utilizing a single network structure. Extensive experimental results on cMedQA dataset demonstrate that the proposed hybrid models, especially BiGRU-CNN, significantly outperform the state-of-the-art methods. The source codes of our models are available in the GitHub ( https://github.com/zhangyuteng/MedicalQA-CNN-BiGRU ).

[1]  Huimin Lu,et al.  FDCNet: filtering deep convolutional network for marine organism classification , 2018, Multimedia Tools and Applications.

[2]  George Hripcsak,et al.  Development, implementation, and a cognitive evaluation of a definitional question answering system for physicians , 2007, J. Biomed. Informatics.

[3]  LeeYue-Shi,et al.  A support vector machine-based context-ranking model for question answering , 2013 .

[4]  Yue-Shi Lee,et al.  A support vector machine-based context-ranking model for question answering , 2013, Inf. Sci..

[5]  Huimin Lu,et al.  Multi-scale deep context convolutional neural networks for semantic segmentation , 2017, World Wide Web.

[6]  Heyan Huang,et al.  Feature Words Selection for Knowledge-based Word Sense Disambiguation with Syntactic Parsing , 2012 .

[7]  Hang Li,et al.  Convolutional Neural Network Architectures for Matching Natural Language Sentences , 2014, NIPS.

[8]  Daniele Bonadiman,et al.  Convolutional Neural Networks vs. Convolution Kernels: Feature Engineering for Answer Sentence Reranking , 2016, NAACL.

[9]  ZaragozaHugo,et al.  The Probabilistic Relevance Framework , 2009 .

[10]  Wenpeng Lu Word sense disambiguation based on dependency constraint knowledge , 2018, Cluster Computing.

[11]  Alessandro Moschitti,et al.  Linguistic kernels for answer re-ranking in question answering systems , 2011, Inf. Process. Manag..

[12]  Jing Wang,et al.  An answer recommendation algorithm for medical community question answering systems , 2016, 2016 IEEE International Conference on Service Operations and Logistics, and Informatics (SOLI).

[13]  Huimin Lu,et al.  Learning unified binary codes for cross-modal retrieval via latent semantic hashing , 2016, Neurocomputing.

[14]  Zhaoyun Ding,et al.  Chinese Medical Question Answer Matching Using End-to-End Character-Level Multi-Scale CNNs , 2017 .

[15]  Yue Zhang,et al.  Character-Level Chinese Dependency Parsing , 2014, ACL.

[16]  Yuan Ling,et al.  Improving Clinical Diagnosis Inference through Integration of Structured and Unstructured Knowledge , 2017 .

[17]  Heng Tao Shen,et al.  Video Captioning With Attention-Based LSTM and Semantic Consistency , 2017, IEEE Transactions on Multimedia.

[18]  Xuanjing Huang,et al.  Convolutional Neural Tensor Network Architecture for Community-Based Question Answering , 2015, IJCAI.

[19]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[20]  Yann LeCun,et al.  Signature Verification Using A "Siamese" Time Delay Neural Network , 1993, Int. J. Pattern Recognit. Artif. Intell..

[21]  Bowen Zhou,et al.  ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs , 2015, TACL.

[22]  Fu Jie Huang,et al.  A Tutorial on Energy-Based Learning , 2006 .

[23]  Xuelong Li,et al.  Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval , 2017, IEEE Transactions on Image Processing.

[24]  Noah A. Smith,et al.  Tree Edit Models for Recognizing Textual Entailments, Paraphrases, and Answers to Questions , 2010, NAACL.

[25]  Bowen Zhou,et al.  LSTM-based Deep Learning Models for non-factoid answer selection , 2015, ArXiv.

[26]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[27]  Huimin Lu,et al.  Low illumination underwater light field images reconstruction using deep convolutional neural networks , 2018, Future Gener. Comput. Syst..

[28]  Huimin Lu,et al.  Deep adversarial metric learning for cross-modal retrieval , 2019, World Wide Web.

[29]  Tat-Seng Chua,et al.  Discovering high quality answers in community question answering archives using a hierarchy of classifiers , 2014, Inf. Sci..

[30]  Xiaoyong Du,et al.  Analogical Reasoning on Chinese Morphological and Semantic Relations , 2018, ACL.

[31]  Huimin Lu,et al.  Dual Learning for Visual Question Generation , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[32]  Huimin Lu,et al.  Brain Intelligence: Go beyond Artificial Intelligence , 2017, Mobile Networks and Applications.

[33]  Chris Callison-Burch,et al.  Answer Extraction as Sequence Tagging with Tree Edit Distance , 2013, NAACL.

[34]  Hao Wu,et al.  An Empirical Study of Classifier Combination Based Word Sense Disambiguation , 2018, IEICE Trans. Inf. Syst..

[35]  Hugo Zaragoza,et al.  The Probabilistic Relevance Framework: BM25 and Beyond , 2009, Found. Trends Inf. Retr..

[36]  Yuhua Zhang,et al.  A Chinese Question Answering Approach Integrating Count-Based and Embedding-Based Features , 2016, NLPCC/ICCPOL.

[37]  Pierre Zweigenbaum,et al.  Medical question answering: translating medical questions into sparql queries , 2012, IHI '12.

[38]  Longbing Cao,et al.  Inferring Implicit Rules by Learning Explicit and Hidden Item Dependency , 2020, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[39]  Longbing Cao,et al.  Training deep neural networks on imbalanced data sets , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[40]  Hyoil Han,et al.  A framework of a logic-based question-answering system for the medical domain (LOQAS-Med) , 2009, SAC '09.

[41]  Lingyun Xiang,et al.  A Word-Embedding-Based Steganalysis Method for Linguistic Steganography via Synonym Substitution , 2018, IEEE Access.

[42]  Wei-Ping Zhu,et al.  Multi-scale context for scene labeling via flexible segmentation graph , 2016, Pattern Recognit..

[43]  Rodney D. Nielsen,et al.  The MiPACQ clinical question answering system. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[44]  Tripti Dodiya,et al.  Rule Based Architecture for Medical Question Answering System , 2012, SocProS.

[45]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[46]  Amel Benazza-Benyahia,et al.  Efficient transform-based texture image retrieval techniques under quantization effects , 2016, Multimedia Tools and Applications.

[47]  Longbing Cao,et al.  Attention-Based Transactional Context Embedding for Next-Item Recommendation , 2018, AAAI.