Cross-language question retrieval with multi-layer representation and layer-wise adversary

Abstract In cross-language question retrieval (CLQR), users employ a new question in one language to search the community question answering (CQA) archives for similar questions in another language. In addition to the ranking problem in monolingual question retrieval, one needs to bridge the language gap in CLQR. The existing adversarial models for cross-language learning normally rely on a single adversarial component. Since natural languages consist of units of different abstract levels, we argue that crossing the language gap adaptatively on different levels with multiple adversarial components should lead to smoother text representation and better CLQR performance. To this end, we first encode questions into multi-layer representations of different abstract levels with a CNN based model which enhances conventional models with diverse kernel shapes and the corresponding pooling strategy so as to capture different aspects of a text segment. We then impose a set of adversarial components on different layers of question representation so as to decide the appropriate abstract levels and their role in performing cross-language mapping. Experimental results on two real-world datasets demonstrate that our model outperforms state-of-the-art models for CLQR, which is on par with the strong machine translation baselines and most monolingual baselines.

[1]  Y. Rui,et al.  Learning to Rank Using User Clicks and Visual Features for Image Retrieval , 2015, IEEE Transactions on Cybernetics.

[2]  Ben He,et al.  Question-answer topic model for question retrieval in community question answering , 2012, CIKM.

[3]  Xuanjing Huang,et al.  Adversarial Multi-task Learning for Text Classification , 2017, ACL.

[4]  Qingming Huang,et al.  Spatial Pyramid-Enhanced NetVLAD With Weighted Triplet Loss for Place Recognition , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[5]  Qinmin Hu,et al.  CAN: Enhancing Sentence Similarity Modeling with Collaborative and Adversarial Network , 2018, SIGIR.

[6]  Larry P. Heck,et al.  Learning deep structured semantic models for web search using clickthrough data , 2013, CIKM.

[7]  Jung-Tae Lee,et al.  Bridging Lexical Gaps between Queries and Questions on Large Online Q&A Collections with Compact Translation Models , 2008, EMNLP.

[8]  Dong Zhou,et al.  An iterative method for personalized results adaptation in cross-language search , 2018, Inf. Sci..

[9]  Amir Pouran Ben Veyseh Cross-Lingual Question Answering Using Common Semantic Space , 2016, TextGraphs@NAACL-HLT.

[10]  W. Bruce Croft,et al.  Retrieval models for question and answer archives , 2008, SIGIR '08.

[11]  Jun Yu,et al.  Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Arpita Das,et al.  Together we stand: Siamese Networks for Similar Question Retrieval , 2016, ACL.

[13]  Shan Wu,et al.  A neural generative autoencoder for bilingual word embeddings , 2018, Inf. Sci..

[14]  Ye Zhang,et al.  A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification , 2015, IJCNLP.

[15]  Daniele Bonadiman,et al.  Injecting Relational Structural Representation in Neural Networks for Question Similarity , 2018, ACL.

[16]  Dong Zhou,et al.  Translation techniques in cross-language information retrieval , 2012, CSUR.

[17]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[18]  C. Villani Optimal Transport: Old and New , 2008 .

[19]  Léon Bottou,et al.  Wasserstein Generative Adversarial Networks , 2017, ICML.

[20]  Muhammad Mahbubur Rahman,et al.  Query Expansion for Cross-Language Question Re-Ranking , 2019, ArXiv.

[21]  Wenpeng Yin,et al.  Attention-Based Convolutional Neural Network for Machine Comprehension , 2016, ArXiv.

[22]  Preslav Nakov,et al.  Cross-language Learning with Adversarial Neural Networks: Application to Community Question Answering , 2017 .

[23]  Xuanjing Huang,et al.  Convolutional Neural Tensor Network Architecture for Community-Based Question Answering , 2015, IJCAI.

[24]  Preslav Nakov,et al.  Cross-Language Question Re-Ranking , 2017, SIGIR.

[25]  Yang Yang,et al.  Adversarial Cross-Modal Retrieval , 2017, ACM Multimedia.

[26]  Ferhan Türe,et al.  Learning to Translate for Multilingual Question Answering , 2016, EMNLP.

[27]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[28]  Bowen Zhou,et al.  ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs , 2015, TACL.

[29]  Xueqi Cheng,et al.  Match-SRNN: Modeling the Recursive Matching Structure with Spatial RNN , 2016, IJCAI.

[30]  Aaron C. Courville,et al.  Improved Training of Wasserstein GANs , 2017, NIPS.

[31]  Zhoujun Li,et al.  Question Retrieval with High Quality Answers in Community Question Answering , 2014, CIKM.

[32]  Mária Bieliková,et al.  A Comprehensive Survey and Classification of Approaches for Community Question Answering , 2016, ACM Trans. Web.

[33]  Yong Zhang,et al.  Concept Embedded Convolutional Semantic Model for Question Retrieval , 2017, WSDM.

[34]  Dong Xu,et al.  Collaborative and Adversarial Network for Unsupervised Domain Adaptation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[35]  Bogdan Sacaleanu,et al.  Overview of the CLEF 2008 Multilingual Question Answering Track , 2008, CLEF.

[36]  Gosse Bouma,et al.  Question Answering with Joost at CLEF 2007 , 2007, CLEF.

[37]  Sven Hartrumpf,et al.  University of Hagen at QA@CLEF 2008: Efficient Question Answering with Question Decomposition and Multiple Answer Streams , 2008, CLEF.

[38]  George Trigeorgis,et al.  Domain Separation Networks , 2016, NIPS.

[39]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[40]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[41]  Jun Yu,et al.  Multimodal Face-Pose Estimation With Multitask Manifold Deep Learning , 2019, IEEE Transactions on Industrial Informatics.

[42]  Teruko Mitamura,et al.  Bootstrap Pattern Learning for Open-Domain CLQA* , 2010, NTCIR.

[43]  Iryna Gurevych,et al.  Improved Cross-Lingual Question Retrieval for Community Question Answering , 2019, WWW.

[44]  Houfeng Wang,et al.  Attentive Interactive Neural Networks for Answer Selection in Community Question Answering , 2017, AAAI.