Learning to Explain Chinese Slang Words

The explosive development of social media has generated a large number of slang words in Chinese social network. The appearance of Chinese slang words has affected the accuracy of reading comprehension and word segmentation tasks. In this paper, we propose explaining Chinese slang word automatically for the first time. Unlike matching words in dictionary, we use a novel neural network called DCEAnn (a Dual Character-level Encoder using Attention-based neural network) for this specific task. One encodes slang word and its phonetics to learn the word representation, the other encodes example sentence containing slang word to enrich the semantic information of the slang word. Besides, we propose a public dataset for the first time to deal with the absence of parallel corpus for training model. Manual evaluation of experimental results shows that our model can generate reasonable explanations. Furthermore, we find that our model has a better performance on the network digital language which only contains numbers. To be specific, we get the state-of-the-art result on Chinese slang words interpretation whose BLEU score is 23.64, 3.59 higher than our baseline, and the state-of-the-art result on network digital language interpretation whose BLEU score is 54.23, 3.18 higher than our baseline.