Efficient Strategies for Hierarchical Text Classification: External Knowledge and Auxiliary Tasks

In hierarchical text classification, we perform a sequence of inference steps to predict the category of a document from top to bottom of a given class taxonomy. Most of the studies have focused on developing novels neural network architectures to deal with the hierarchical structure, but we prefer to look for efficient ways to strengthen a baseline model. We first define the task as a sequence-to-sequence problem. Afterwards, we propose an auxiliary synthetic task of bottom-up-classification. Then, from external dictionaries, we retrieve textual definitions for the classes of all the hierarchy's layers, and map them into the word vector space. We use the class-definition embeddings as an additional input to condition the prediction of the next layer and in an adapted beam search. Whereas the modified search did not provide large gains, the combination of the auxiliary task and the additional input of class-definitions significantly enhance the classification accuracy. With our efficient approaches, we outperform previous studies, using a drastically reduced number of parameters, in two well-known English datasets.

[1]  Aris Kosmopoulos,et al.  Large scale hierarchical text classification , 2015 .

[2]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[3]  Alexandros Potamianos,et al.  Attention-based Conditioning Methods for External Knowledge Integration , 2019, ACL.

[4]  Donald E. Brown,et al.  HDLTex: Hierarchical Deep Learning for Text Classification , 2017, 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA).

[5]  Yoshua Bengio,et al.  On integrating a language model into neural machine translation , 2017, Comput. Speech Lang..

[6]  Xiang Zhang,et al.  Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[7]  Hae-Chang Rim,et al.  Some Effective Techniques for Naive Bayes Text Classification , 2006, IEEE Transactions on Knowledge and Data Engineering.

[8]  Kostas Tsioutsiouliklis,et al.  Hierarchical Transfer Learning for Multi-label Text Classification , 2019, ACL.

[9]  Y. Kushal,et al.  Large Scale Hierarchical Text Classification , 2014 .

[10]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[11]  Luke S. Zettlemoyer,et al.  AllenNLP: A Deep Semantic Natural Language Processing Platform , 2018, ArXiv.

[12]  Shuming Ma,et al.  A Deep Reinforced Sequence-to-Set Model for Multi-Label Classification , 2019, ACL.

[13]  Richard Socher,et al.  Learned in Translation: Contextualized Word Vectors , 2017, NIPS.

[14]  Alex A. Freitas,et al.  A survey of hierarchical classification across different application domains , 2010, Data Mining and Knowledge Discovery.

[15]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[16]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[17]  Jing Tang,et al.  NeuralClassifier: An Open-source Neural Hierarchical Multi-label Text Classification Toolkit , 2019, ACL.

[18]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[19]  Prakhar Gupta,et al.  Learning Word Vectors for 157 Languages , 2018, LREC.

[20]  Yiming Yang,et al.  Support vector machines classification with a very large-scale taxonomy , 2005, SKDD.

[21]  Rotem Dror,et al.  Deep Dominance - How to Properly Compare Deep Neural Models , 2019, ACL.

[22]  Jingjing Tian,et al.  Hierarchical Text Classification with Reinforced Label Assignment , 2019, EMNLP/IJCNLP.

[23]  Markus Freitag,et al.  Beam Search Strategies for Neural Machine Translation , 2017, NMT@ACL.

[24]  Seung-won Hwang,et al.  Categorical Metadata Representation for Customized Text Classification , 2019, Transactions of the Association for Computational Linguistics.

[25]  Yann LeCun,et al.  Very Deep Convolutional Networks for Text Classification , 2016, EACL.

[26]  Xuanjing Huang,et al.  Recurrent Neural Network for Text Classification with Multi-Task Learning , 2016, IJCAI.

[27]  Jianxin Li,et al.  Large-Scale Hierarchical Text Classification with Recursively Regularized Deep Graph-CNN , 2018, WWW.

[28]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.