Chinese Syntax Parsing Based on Sliding Match of Semantic String

Different from the current syntax parsing based on deep learning, we present a novel Chinese parsing method, which is based on Sliding Match of Semantic String (SMOSS). (1) Training stage: In a treebank, headwords of tree nodes are represented by semantic codes given in the Synonym Dictionary (Tongyici Cilin). N-gram semantic templates are extracted from every layer of a syntax tree by means of sliding window to establish one N-gram semantic template library. (2) Parsing stage: Words of a sentence, including headwords of chunks, are represented by the semantic codes from Tongyici Cilin. With the sliding window method, N-gram semantic code strings are extracted to match with the templates in the N-gram semantic template library; subsequently, the mapping information of the matched templates is employed to guide the chunking of semantic code strings. The Chinese syntax parsing is completed through continuous matching and chunking. On the same training scale, N-gram semantic template can create favorable conditions for flexible matching and improve the syntax parsing performance. With train and test sets from the Tsinghua Chinese Treebank (TCT), the results are F1-score 99.71% (closed test) and F1-score 70.43% (open test), respectively.

[1]  WENLIANG CHEN,et al.  Improving Shift‐Reduce Phrase‐Structure Parsing with Constituent Boundary Information , 2017, Comput. Intell..

[2]  Yue Zhang,et al.  Enhancing Shift-Reduce Constituent Parsing with Action N-Gram Model , 2016, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[3]  Graeme Hirst,et al.  A Linear-Time Bottom-Up Discourse Parser with Constraints and Post-Editing , 2014, ACL.

[4]  Yue Zhang,et al.  Shift-Reduce Constituent Parsing with Neural Lookahead Features , 2016, TACL.

[5]  Mirella Lapata,et al.  Learning Contextually Informed Representations for Linear-Time Discourse Parsing , 2017, EMNLP.

[6]  Yufeng Chen,et al.  Improved Character-Based Chinese Dependency Parsing by Using Stack-Tree LSTM , 2018, NLPCC.

[7]  Hai Zhao,et al.  Probabilistic Graph-based Dependency Parsing with Convolutional Neural Network , 2016, ACL.

[8]  Houfeng Wang,et al.  Improving Chinese Dependency Parsing with Lexical Semantic Features , 2015, NLPCC.

[9]  Yue Zhang,et al.  Two Local Models for Neural Constituent Parsing , 2018, COLING.

[10]  Timothy Dozat,et al.  Deep Biaffine Attention for Neural Dependency Parsing , 2016, ICLR.

[11]  Jianfeng Gao,et al.  Bi-directional Attention with Agreement for Dependency Parsing , 2016, EMNLP.

[12]  Xiaoqing Zheng Incremental Graph-based Neural Dependency Parsing , 2017, EMNLP.

[13]  Yue Zhang,et al.  Character-Level Chinese Dependency Parsing , 2014, ACL.

[14]  James Cross,et al.  Incremental Parsing with Minimal Features Using Bi-Directional LSTM , 2016, ACL.

[15]  Enhong Chen,et al.  Stack-based Multi-layer Attention for Transition-based Dependency Parsing , 2017, EMNLP.

[16]  Noah A. Smith,et al.  Transition-Based Dependency Parsing with Stack Long Short-Term Memory , 2015, ACL.

[17]  Noah A. Smith,et al.  Improved Transition-based Parsing by Modeling Characters instead of Words with LSTMs , 2015, EMNLP.

[18]  Baobao Chang,et al.  Graph-based Dependency Parsing with Bidirectional LSTM , 2016, ACL.

[19]  Nianwen Xue,et al.  Feature Optimization for Constituent Parsing via Neural Networks , 2015, ACL.

[20]  Qun Liu,et al.  A subtree-based factorization of dependency parsing , 2016, COLING.

[21]  Steven P. Abney Partial parsing via finite-state cascades , 1996, Natural Language Engineering.

[22]  Dan Klein,et al.  Neural CRF Parsing , 2015, ACL.

[23]  Hai Zhao,et al.  Seq2seq Dependency Parsing , 2018, COLING.

[24]  Lingling Mu,et al.  Research on Chinese Parsing Based on the Improved Compositional Vector Grammar , 2015, CLSW.

[25]  Slav Petrov,et al.  Improved Transition-Based Parsing and Tagging with Neural Networks , 2015, EMNLP.

[26]  Chris Dyer,et al.  Transition-Based Dependency Parsing with Heuristic Backtracking , 2016, EMNLP.

[27]  Yoshua Bengio,et al.  Straight to the Tree: Constituency Parsing with Neural Syntactic Distance , 2018, ACL.

[28]  Dan Klein,et al.  Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[29]  Fuxiang Wu Dependency Parsing with Transformed Feature , 2017, Inf..

[30]  Yue Zhang,et al.  In-Order Transition-based Constituent Parsing , 2017, TACL.

[31]  Mirella Lapata,et al.  Dependency Parsing as Head Selection , 2016, EACL.

[32]  Nianwen Xue,et al.  Joint POS Tagging and Transition-based Constituent Parsing in Chinese with Non-local Features , 2014, ACL.

[33]  Ji Ma,et al.  SyntaxNet Models for the CoNLL 2017 Shared Task , 2017, ArXiv.

[34]  Noah A. Smith,et al.  Recurrent Neural Network Grammars , 2016, NAACL.

[35]  Taro Watanabe,et al.  Transition-based Neural Constituent Parsing , 2015, ACL.

[36]  Eliyahu Kiperwasser,et al.  Easy-First Dependency Parsing with Hierarchical Tree LSTMs , 2016, TACL.

[37]  Haitao Mi,et al.  Shift-Reduce Constituency Parsing with Dynamic Programming and POS Tag Lattice , 2015, NAACL.

[38]  Joachim Bingel,et al.  Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics , 2016 .

[39]  Eliyahu Kiperwasser,et al.  Simple and Accurate Dependency Parsing Using Bidirectional LSTM Feature Representations , 2016, TACL.

[40]  Sophia Ananiadou,et al.  Fast Full Parsing by Linear-Chain Conditional Random Fields , 2009, EACL.

[41]  Weiwei Sun,et al.  Parsing Chinese Sentences with Grammatical Relations , 2019, CL.

[42]  Regina Barzilay,et al.  Randomized Greedy Inference for Joint Segmentation, POS Tagging and Dependency Parsing , 2015, HLT-NAACL.

[43]  Carlos Gómez-Rodríguez,et al.  Faster Shift-Reduce Constituent Parsing with a Non-Binary, Bottom-Up Strategy , 2018, Artif. Intell..

[44]  Yu Zhao,et al.  Complete Syntactic Analysis Bases on Multi-level Chunking , 2010, CIPS-SIGHAN.

[45]  Danqi Chen,et al.  A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[46]  Daisuke Kawahara,et al.  Neural Joint Model for Transition-based Chinese Syntactic Analysis , 2017, ACL.

[47]  Hai Zhao,et al.  Neural Character-level Dependency Parsing for Chinese , 2018, AAAI.