Integration of Multiple Classifiers for Chinese Semantic Dependency Analysis

Semantic Dependency Analysis (SDA) has extensive applications in Natural Language Processing (NLP). In this paper, an integration of multiple classifiers is presented for SDA of Chinese. A Naive Bayesian Classifier, a Decision Tree and a Maximum Entropy classifier are used in a majority wins voting scheme. A portion of the Penn Chinese Treebank was manually annotated with semantic dependency structure. Then each of the three classifiers was trained on the same training data. All three of the classifiers were used to produce candidate relations for test data and the candidate relation that had the majority vote was chosen. The proposed approach achieved an accuracy of 86% in experimentation, which shows that the proposed approach is a promising one for semantic dependency analysis of Chinese.

[1]  Nianwen Xue,et al.  Automatic Semantic Role Labeling for Chinese Verbs , 2005, IJCAI.

[2]  F. Ren,et al.  Automatically determining semantic relations in Chinese sentences , 2005, 2005 International Conference on Natural Language Processing and Knowledge Engineering.

[3]  Nianwen Xue,et al.  Annotating the Propositions in the Penn Chinese Treebank , 2003, SIGHAN.

[4]  Shingo Kuroiwa,et al.  A Machine Learning Approach to Determine Semantic Dependency Structure in Chinese , 2006, FLAIRS Conference.

[5]  Keh-Jiann Chen,et al.  中文句結構樹資料庫的構建 (Sinica Treebank) [In Chinese] , 1999, ROCLING/IJCLCLP.

[6]  Charles J. Fillmore,et al.  The FrameNet tagset for frame-semantic and syntactic coding of predicate-argument structure , 2000, ANLP.

[7]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[8]  Thomas J. Watson,et al.  An empirical study of the naive Bayes classifier , 2001 .

[9]  Chu-Ren Huang,et al.  Sinica Treebank: Design Criteria, Annotation Guidelines, and On-line Interface , 2000, ACL 2000.

[10]  Shingo Kuroiwa,et al.  SEEN: a semantic dependency analyzer for Chinese , 2006 .

[11]  Alessandro Moschitti,et al.  Towards Free-text Semantic Parsing: A Unified Framework Based on FrameNet, VerbNet and PropBank , 2006, Learning Structured Information@EACL.

[12]  Jane J. Robinson Dependency Structures and Transformational Rules , 1970 .

[13]  Adwait Ratnaparkhi,et al.  A Simple Introduction to Maximum Entropy Models for Natural Language Processing , 1997 .

[14]  Nianwen Xue,et al.  Developing Guidelines and Ensuring Consistency for Chinese Text Annotation , 2000, LREC.

[15]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[16]  Juan-Zi Li,et al.  Building a Large Chinese Corpus Annotated with Semantic Dependency , 2003, SIGHAN.

[17]  Ben Hutchinson,et al.  Dependency-based semantic interpretation for answer extraction , 2002 .

[18]  Adam Krzyżak,et al.  Methods of combining multiple classifiers and their applications to handwriting recognition , 1992, IEEE Trans. Syst. Man Cybern..

[19]  Basilio Sierra,et al.  A Multiclassifier based Document Categorization System: profiting from the Singular Value Decomposition Dimensionality Reduction Technique , 2006, Learning Structured Information@EACL.

[20]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[21]  Keh-Jiann Chen,et al.  Automatic Semantic Role Assignment for a Tree Structure , 2004, SIGHAN@ACL.

[22]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[23]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[24]  Kok Wee Gan,et al.  Annotating Information Structures in Chinese Texts Using HowNet , 2000, ACL 2000.

[25]  Heidi Fox,et al.  Dependency-Based Statistical Machine Translation , 2005, ACL.

[26]  Torsten Rohlfing,et al.  Performance-based multi-classifier decision fusion for atlas-based segmentation of biomedical images , 2004, 2004 2nd IEEE International Symposium on Biomedical Imaging: Nano to Macro (IEEE Cat No. 04EX821).