Paraphrase-focused learning to rank for domain-specific frequently asked questions retrieval

We study the potential of supervised learning to rank for FAQ retrieval.Supervised models offer performance improvements for this task.We explored low-effort paraphrase-based data labeling strategies.Paraphrase-based labeling was effective for the best models on two FAQ data collections.We make a new FAQ retrieval data set publicly available. A frequently asked questions (FAQ) retrieval system improves the access to information by allowing users to pose natural language queries over an FAQ collection. From an information retrieval perspective, FAQ retrieval is a challenging task, mainly because of the lexical gap that exists between a query and an FAQ pair, both of which are typically very short. In this work, we explore the use of supervised learning to rank to improve the performance of domain-specific FAQ retrieval. While supervised learning-to-rank models have been shown to yield effective retrieval performance, they require costly human-labeled training data in the form of document relevance judgments or question paraphrases. We investigate how this labeling effort can be reduced using a labeling strategy geared toward the manual creation of query paraphrases rather than the more time-consuming relevance judgments. In particular, we investigate two such strategies, and test them by applying supervised ranking models to two domain-specific FAQ retrieval data sets, showcasing typical FAQ retrieval scenarios. Our experiments show that supervised ranking models can yield significant improvements in the precision-at-rank-5 measure compared to unsupervised baselines. Furthermore, we show that a supervised model trained using data labeled via a low-effort paraphrase-focused strategy has the same performance as that of the same model trained using fully labeled data, indicating that the strategy is effective at reducing the labeling effort while retaining the performance gains of the supervised approach. To encourage further research on FAQ retrieval we make our FAQ retrieval data set publicly available.

[1]  Ivan Laptev,et al.  Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Chung-Hsien Wu,et al.  Domain-specific FAQ retrieval using independent aspects , 2005, TALIP.

[3]  Razvan C. Bunescu,et al.  Learning the Relative Usefulness of Questions in Community QA , 2010, EMNLP.

[4]  Yi Liu,et al.  Statistical Machine Translation for Query Expansion in Answer Retrieval , 2007, ACL.

[5]  Gabriella Kazai,et al.  An analysis of human factors and label accuracy in crowdsourcing relevance judgments , 2013, Information Retrieval.

[6]  Cícero Nogueira dos Santos,et al.  Learning Hybrid Representations to Retrieve Semantically Equivalent Questions , 2015, ACL.

[7]  Yoshua Bengio,et al.  Algorithms for Hyper-Parameter Optimization , 2011, NIPS.

[8]  Pierre Isabelle,et al.  Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , 2002, ACL 2002.

[9]  Peter Jansen,et al.  Spinning Straw into Gold: Using Free Text to Train Monolingual Alignment Models for Non-factoid Question Answering , 2015, HLT-NAACL.

[10]  Edward Y. Chang,et al.  Active Learning for Interactive Multimedia Retrieval , 2008, Proceedings of the IEEE.

[11]  Eriks Sneiders Automated FAQ answering with question-specific knowledge representation for web self-service , 2009, 2009 2nd Conference on Human System Interactions.

[12]  Michael Collins,et al.  New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron , 2002, ACL.

[13]  Ingrid Zukerman,et al.  An Empirical Study of Corpus-Based Response Automation Methods for an E-mail-Based Help-Desk Domain , 2009, CL.

[14]  Philipp Slusallek,et al.  Introduction to real-time ray tracing , 2005, SIGGRAPH Courses.

[15]  Xuanjing Huang,et al.  Convolutional Neural Tensor Network Architecture for Community-Based Question Answering , 2015, IJCAI.

[16]  Alessandro Moschitti,et al.  Semi-supervised Question Retrieval with Gated Convolutions , 2015, NAACL.

[17]  L. Venkata Subramaniam,et al.  SMS based Interface for FAQ Retrieval , 2009, ACL.

[18]  Omar Alonso,et al.  Crowdsourcing for relevance evaluation , 2008, SIGF.

[19]  Tie-Yan Liu,et al.  Learning to rank: from pairwise approach to listwise approach , 2007, ICML '07.

[20]  Alexander J. Smola,et al.  Fast Kernels for String and Tree Matching , 2002, NIPS.

[21]  Kyung-Joong Kim,et al.  Design of a visual perception model with edge-adaptive Gabor filter and support vector machine for traffic sign detection , 2013, Expert Syst. Appl..

[22]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[23]  Ellen M. Voorhees,et al.  Retrieval evaluation with incomplete information , 2004, SIGIR '04.

[24]  Alejandro Figueroa,et al.  Automatically generating effective search queries directly from community question-answering questions for finding related questions , 2017, Expert Syst. Appl..

[25]  John Salvatier,et al.  Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.

[26]  Timothy Baldwin,et al.  CQADupStack: A Benchmark Data Set for Community Question-Answering Research , 2015, ADCS.

[27]  Rahul Malik,et al.  Automatically Selecting Answer Templates to Respond to Customer Emails , 2007, IJCAI.

[28]  Roberto Basili,et al.  KeLP at SemEval-2016 Task 3: Learning Semantic Relations between Questions and Answers , 2016, *SEMEVAL.

[29]  Valentin Jijkoun,et al.  Retrieving answers from frequently asked questions pages on the web , 2005, CIKM '05.

[30]  Ming-Wei Chang,et al.  Question Answering Using Enhanced Lexical Semantic Models , 2013, ACL.

[31]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[32]  Richard Socher,et al.  A Neural Network for Factoid Question Answering over Paragraphs , 2014, EMNLP.

[33]  Quoc V. Le,et al.  Learning to Rank with Non-Smooth Cost Functions , 2007 .

[34]  Preslav Nakov,et al.  SemEval-2015 Task 3: Answer Selection in Community Question Answering , 2015, *SEMEVAL.

[35]  Araceli Sanchis,et al.  Combining additive input noise annealing and pattern transformations for improved handwritten character recognition , 2014, Expert Syst. Appl..

[36]  Qiang Wu,et al.  Adapting boosting for information retrieval measures , 2010, Information Retrieval.

[37]  João Francisco Valiati,et al.  Document-level sentiment classification: An empirical comparison between SVM and ANN , 2013, Expert Syst. Appl..

[38]  Günter Neumann,et al.  Category-specific models for ranking effective paraphrases in community Question Answering , 2014, Expert Syst. Appl..

[39]  A. Lehrer Semantic fields and lexical structure , 1974 .

[40]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[41]  Stephen E. Robertson,et al.  GatfordCentre for Interactive Systems ResearchDepartment of Information , 1996 .

[42]  D. Rumelhart Parallel Distributed Processing Volume 1: Foundations , 1987 .

[43]  Alessandro Moschitti,et al.  Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees , 2006, ECML.

[44]  Eriks Sneiders,et al.  Automated Question Answering Using Question Templates That Cover the Conceptual Model of the Database , 2002, NLDB.

[45]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[46]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[47]  Danqi Chen,et al.  A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[48]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[49]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[50]  A. Wayne Whitney,et al.  A Direct Method of Nonparametric Measurement Selection , 1971, IEEE Transactions on Computers.

[51]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[52]  Yi Yang,et al.  A Multimedia Retrieval Framework Based on Semi-Supervised Ranking and Relevance Feedback , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  José Luis Vicedo González,et al.  TREC: Experiment and evaluation in information retrieval , 2007, J. Assoc. Inf. Sci. Technol..

[54]  Eriks Sneiders Automated Email Answering by Text Pattern Matching , 2010, IceTAL.

[55]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[56]  Alessandro Moschitti,et al.  Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks , 2015, SIGIR.

[57]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[58]  Eneko Agirre,et al.  *SEM 2013 shared task: Semantic Textual Similarity , 2013, *SEMEVAL.

[59]  Roberto Basili,et al.  Exploiting Syntactic and Shallow Semantic Kernels for Question Answer Classification , 2007, ACL.

[60]  Jungyun Seo,et al.  High-performance FAQ retrieval using an automatic clustering method of query logs , 2006, Inf. Process. Manag..

[61]  Brian Moon,et al.  Automated text classification using a dynamic artificial neural network model , 2012, Expert Syst. Appl..

[62]  Vibhu O. Mittal,et al.  Bridging the lexical chasm: statistical approaches to answer-finding , 2000, SIGIR '00.

[63]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[64]  Roberto Basili,et al.  Structured Lexical Similarity via Convolution Kernels on Dependency Trees , 2011, EMNLP.

[65]  Roberto Basili,et al.  KeLP: a Kernel-based Learning Platform for Natural Language Processing , 2015, ACL.

[66]  Jan Snajder,et al.  FAQIR - A Frequently Asked Questions Retrieval Test Collection , 2016, TSD.

[67]  Juan Luis Castro,et al.  Learning regular expressions to template-based FAQ retrieval systems , 2013, Knowl. Based Syst..

[68]  L. Venkata Subramaniam,et al.  Handling Noisy Queries in Cross Language FAQ Retrieval , 2010, EMNLP.

[69]  Kevin Duh,et al.  Learning to rank with partially-labeled data , 2008, SIGIR '08.

[70]  Rosie Jones,et al.  The Linguistic Structure of English Web-Search Queries , 2008, EMNLP.

[71]  W. Bruce Croft,et al.  Finding semantically similar questions based on their answers , 2005, SIGIR '05.

[72]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[73]  W. Bruce Croft,et al.  Beyond Factoid QA: Effective Methods for Non-factoid Answer Sentence Retrieval , 2016, ECIR.

[74]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[75]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[76]  Alessandro Moschitti,et al.  Linguistic kernels for answer re-ranking in question answering systems , 2011, Inf. Process. Manag..

[77]  Eneko Agirre,et al.  SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity , 2012, *SEMEVAL.

[78]  Ittai Abraham,et al.  How Many Workers to Ask?: Adaptive Exploration for Collecting High Quality Labels , 2014, SIGIR.

[79]  Ellen M. Voorhees,et al.  The TREC-8 Question Answering Track Report , 1999, TREC.

[80]  Kai Wang,et al.  A syntactic tree matching approach to finding similar questions in community-based qa services , 2009, SIGIR.

[81]  Steffen Bickel,et al.  Learning from Message Pairs for Automatic Email Answering , 2004, ECML.

[82]  James R. Glass,et al.  Learning Semantic Relatedness in Community Question Answering Using Neural Models , 2016, Rep4NLP@ACL.

[83]  Jung-Tae Lee,et al.  Bridging Lexical Gaps between Queries and Questions on Large Online Q&A Collections with Compact Translation Models , 2008, EMNLP.

[84]  ImageNet Classification with Deep Convolutional Neural , 2013 .

[85]  Mihai Surdeanu,et al.  Learning to Rank Answers on Large Online QA Collections , 2008, ACL.

[86]  Peter Jansen,et al.  Discourse Complements Lexical Semantics for Non-factoid Answer Reranking , 2014, ACL.

[87]  Gareth J. F. Jones,et al.  The good, the bad and their kins: Identifying questions with negative scores in StackOverflow , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[88]  Ellen M. Voorhees,et al.  TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing) , 2005 .

[89]  Kevin Gimpel,et al.  From Paraphrase Database to Compositional Paraphrase Model and Back , 2015, Transactions of the Association for Computational Linguistics.

[90]  Klaus Brinker,et al.  Active learning of label ranking functions , 2004, ICML.

[91]  Lynette Hirschman,et al.  Natural language question answering: the view from here , 2001, Natural Language Engineering.

[92]  James Fan,et al.  Learning to rank for robust question answering , 2012, CIKM.

[93]  W. Bruce Croft,et al.  A Framework to Predict the Quality of Answers with NonTextual , 2018 .

[94]  Mihai Surdeanu,et al.  Learning to Rank Answers to Non-Factoid Questions from Web Collections , 2011, CL.

[95]  Vasile Rus,et al.  SEMILAR: The Semantic Similarity Toolkit , 2013, ACL.

[96]  Günter Neumann,et al.  Learning to Rank Effective Paraphrases from Query Logs for Community Question Answering , 2013, AAAI.

[97]  Bowen Zhou,et al.  Applying deep learning to answer selection: A study and an open task , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).

[98]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[99]  Kristian J. Hammond,et al.  Question Answering from Frequently Asked Question Files: Experiences with the FAQ FINDER System , 1997, AI Mag..

[100]  Jonathan Weese,et al.  UMBC_EBIQUITY-CORE: Semantic Textual Similarity Systems , 2013, *SEMEVAL.

[101]  Idan Szpektor,et al.  Improving Term Weighting for Community Question Answering Search Using Syntactic Analysis , 2014, CIKM.