Question Classification by Convolutional Neural Networks Embodying Subword Information

Question Classification is a core module of Question Answering paradigm. Development of classification models based on neural networks showed that convolutional architectures allow obtaining uppermost results for this task. In particular, this type of approach avoids extracting features from questions, by treating text as a sequence of words, and transforming each word in a dense vector, named word embedding. Among different techniques to learn word embeddings, a recent approach takes into account also subword information, which could be very useful for morphologically rich languages. In this paper, a Question Classification approach based on word embedding using subword information and Convolutional Neural Networks is proposed, in order to improve classification accuracy. In particular, questions taken from a TRC dataset are considered, and a comparison between English and Italian languages is reported, by highlighting eventual improvements obtained by initializing word embeddings with advanced vectors learned in an unsupervised manner using skip- gram model and comprising character-based information.

[1]  Giuseppe De Pietro,et al.  Learning to rank answers to closed-domain questions by using fuzzy logic , 2017, 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[2]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[3]  Paolo Rosso,et al.  Language Variety Identification Using Distributed Representations of Words and Documents , 2015, CLEF.

[4]  Megha Mishra,et al.  Question Classification using Semantic, Syntactic and Lexical features , 2013 .

[5]  David D. Cox,et al.  Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures , 2013, ICML.

[6]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[7]  Giuseppe De Pietro,et al.  Extracting Compact Sets of Features for Question Classification in Cognitive Systems: A Comparative Study , 2015, 2015 10th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC).

[8]  Dominique Estival,et al.  TAT: An Author Profiling Tool with Application to Arabic Emails , 2007, ALTA.

[9]  K. Robert Lai,et al.  Refining Word Embeddings for Sentiment Analysis , 2017, EMNLP.

[10]  Dell Zhang,et al.  Question classification using support vector machines , 2003, SIGIR.

[11]  Craig MacDonald,et al.  Using word embeddings in Twitter election classification , 2016, Information Retrieval Journal.

[12]  Ye Zhang,et al.  A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification , 2015, IJCNLP.

[13]  Yoav Goldberg,et al.  A Primer on Neural Network Models for Natural Language Processing , 2015, J. Artif. Intell. Res..

[14]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[15]  Shlomo Argamon,et al.  Automatically profiling the author of an anonymous text , 2009, CACM.

[16]  Teresa Gonçalves,et al.  Author Profiling using SVMs and Word Embedding Averages , 2016, CLEF.

[17]  Dani Yogatama,et al.  Bayesian Optimization of Text Representations , 2015, EMNLP.

[18]  Giuseppe De Pietro,et al.  A Forward-Selection Algorithm for SVM-Based Question Classification in Cognitive Systems , 2016, IIMSS.

[19]  Jacob Eisenstein,et al.  Mimicking Word Embeddings using Subword RNNs , 2017, EMNLP.

[20]  Peng Wang,et al.  Semantic Clustering and Convolutional Neural Network for Short Text Categorization , 2015, ACL.

[21]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[22]  Yoshua Bengio,et al.  Practical Recommendations for Gradient-Based Training of Deep Architectures , 2012, Neural Networks: Tricks of the Trade.

[23]  Babak Loni,et al.  A Survey of State-of-the-Art Methods on Question Classification , 2011 .

[24]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[25]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[26]  Hal Daumé,et al.  Deep Unordered Composition Rivals Syntactic Methods for Text Classification , 2015, ACL.

[27]  Giuseppe De Pietro,et al.  Convolutional Neural Networks for Question Classification in Italian Language , 2017, SoMeT.

[28]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[29]  Yassine Benajiba,et al.  Subword-based Deep Averaging Networks for Author Profiling in Social Media , 2017, CLEF.

[30]  Pascal Wiggers,et al.  Question Classification by Weighted Combination of Lexical, Syntactic and Semantic Features , 2011, TSD.

[31]  Robert Dale,et al.  Classical Approaches to Natural Language Processing , 2010, Handbook of Natural Language Processing.

[32]  Luísa Coheur,et al.  From symbolic to sub-symbolic information in question classification , 2011, Artificial Intelligence Review.

[33]  Zengchang Qin,et al.  Question Classification using Head Words and their Hypernyms , 2008, EMNLP.