论文信息 - NTCIR13 MedWeb Task: Multi-label Classification of Tweets using an Ensemble of Neural Networks

NTCIR13 MedWeb Task: Multi-label Classification of Tweets using an Ensemble of Neural Networks

This paper describes how we tackled the Medical Natural Language Processing for Web Document (MedWeb) task as participants of NTCIR13. We utilized multi-language learning to integrate the multi-language inputs of the task into a single neural network. We then built two neural networks–a hierarchical attention network (HAN) and a deep character convolutional neural network (CharCNN)–with multilanguage learning and combined both outputs to utilize the advantages of each neural network. This combination was carried out using ensembling, specifically the method of bagging. We found that the ensemble using the loss functions NLL and hinge produced the best results with 88.0% accuracy.

[1] Shoko Wakamiya,et al. Overview of the NTCIR-13: MedWeb Task , 2017, NTCIR.

[2] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[3] Steven Bird,et al. NLTK: The Natural Language Toolkit , 2002, ACL.

[4] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[5] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[6] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[7] Xiang Zhang,et al. Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[8] Yuji Matsumoto,et al. Applying Conditional Random Fields to Japanese Morphological Analysis , 2004, EMNLP.

[9] Grigorios Tsoumakas,et al. Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[10] Diyi Yang,et al. Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[11] Ameet Talwalkar,et al. Foundations of Machine Learning , 2012, Adaptive computation and machine learning.