Evaluating Semantic Rationality of a Sentence: A Sememe-Word-Matching Neural Network based on HowNet

Automatic evaluation of semantic rationality is an important yet challenging task, and current automatic techniques cannot well identify whether a sentence is semantically rational. The methods based on the language model do not measure the sentence by rationality but by commonness. The methods based on the similarity with human written sentences will fail if human-written references are not available. In this paper, we propose a novel model called Sememe-Word-Matching Neural Network (SWM-NN) to tackle semantic rationality evaluation by taking advantage of sememe knowledge base HowNet. The advantage is that our model can utilize a proper combination of sememes to represent the fine-grained semantic meanings of a word within the specific contexts. We use the fine-grained semantic representation to help the model learn the semantic dependency among words. To evaluate the effectiveness of the proposed model, we build a large-scale rationality evaluation dataset. Experimental results on this dataset show that the proposed model outperforms the competitive baselines with a 5.4\% improvement in accuracy.

[1]  Shi Feng,et al.  Knowledge-Based Semantic Embedding for Machine Translation , 2016, ACL.

[2]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[3]  Lucian Vlad Lita,et al.  Learning Within-Sentence Semantic Coherence , 2001, EMNLP.

[4]  Yue Zhang,et al.  Neural Networks for Open Domain Targeted Sentiment , 2015, EMNLP.

[5]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[6]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[7]  Xu Sun,et al.  Sememe Prediction: Learning Semantic Knowledge from Unstructured Textual Wiki Descriptions , 2018, ArXiv.

[8]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[9]  Alaa A. Kharbouch,et al.  Three models for the description of language , 1956, IRE Trans. Inf. Theory.

[10]  Zhiyuan Liu,et al.  Chinese LIWC Lexicon Expansion via Hierarchical Classification of Word Embeddings with Sememe Attention , 2018, AAAI.

[11]  Xu Sun,et al.  Duplicate Question Identification by Integrating FrameNet With Neural Networks , 2018, AAAI.

[12]  F ChenStanley,et al.  An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[13]  Xu Sun,et al.  A Semantic Relevance Based Neural Network for Text Summarization and Text Simplification , 2017, ArXiv.

[14]  Zhiyuan Liu,et al.  Lexical Sememe Prediction via Word Embeddings and Matrix Factorization , 2017, IJCAI.

[15]  Ian H. Witten,et al.  The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression , 1991, IEEE Trans. Inf. Theory.

[16]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[17]  L. Bloomfield A Set of Postulates for the Science of Language , 1926, International Journal of American Linguistics.

[18]  Zhiyuan Liu,et al.  Improved Word Representation Learning with Sememes , 2017, ACL.

[19]  Kenneth Ward Church,et al.  A comparison of the enhanced Good-Turing and deleted estimation methods for estimating probabilities of English bigrams , 1991 .

[20]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[21]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[22]  A. Nadas,et al.  Estimation of probabilities in the language model of the IBM speech recognition system , 1984 .

[23]  Xu Sun,et al.  Improving Semantic Relevance for Sequence-to-Sequence Learning of Chinese Social Media Text Summarization , 2017, ACL.

[24]  Chris Callison-Burch,et al.  Optimizing Statistical Machine Translation for Text Simplification , 2016, TACL.

[25]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[26]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[27]  Wei Li,et al.  Improving Word Vector with Prior Knowledge in Semantic Dictionary , 2016, NLPCC/ICCPOL.

[28]  Bo Xu,et al.  Word Sense Disambiguation through Sememe Labeling , 2007, IJCAI.

[29]  Zhiyuan Liu,et al.  Incorporating Chinese Characters of Words for Lexical Sememe Prediction , 2018, ACL.

[30]  Qiang Dong,et al.  Hownet and the Computation of Meaning: (With CD-ROM) , 2006 .

[31]  Hermann Ney,et al.  Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[32]  Razvan Pascanu,et al.  On the difficulty of training recurrent neural networks , 2012, ICML.