Chinese Base NP Chunking by Error-driven Combination Classifiers

This paper proposes a hybrid error-driven combination approach to chunking Chinese Base noun phrase(Chinese Base NP),which combines TBL(Transformation-based Learning) model and CRF(Conditional Random Field) model.First,we give an overview of the Chinese and English Base NP chunking,followed by a description of the Chinese Base NP chunking task.In order to analyze the results respectively from the two(TBL-based and CRF-based) classifiers and improve the performance of the Base NP chunkers,an error-driven SVM(Support Vector Machine) based classifier is trained from the classification errors of the two classifiers.According to our experiments,the hybrid method achieves the best results with F-measure of 89.72% and improves by 2.35% in the best case compared with other methods.

[1]  Tong Zhang,et al.  A High-Performance Semi-Supervised Learning Method for Text Chunking , 2005, ACL.

[2]  Qiang Zhou,et al.  Chinese Base-Phrases Chunking , 2002, SIGHAN@COLING.

[3]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[4]  Hitoshi Isahara,et al.  An Empirical Study of Chinese Chunking , 2006, ACL.

[5]  Heng Li,et al.  Transductive HMM based Chinese text chunking , 2003, International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003.

[6]  Yuji Matsumoto,et al.  Chunking with Support Vector Machines , 2001, NAACL.

[7]  Robert C. Berwick,et al.  Principle-Based Parsing , 1987 .

[8]  Jian Su,et al.  Hybrid Text Chunking , 2000, CoNLL/LLL.

[9]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[10]  Kenneth Ward Church A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text , 1988, ANLP.

[11]  Mitchell P. Marcus,et al.  Text Chunking using Transformation-Based Learning , 1995, VLC@ACL.

[12]  Fernando Pereira,et al.  Shallow Parsing with Conditional Random Fields , 2003, NAACL.

[13]  Changning Huang,et al.  A Quasi-Dependency Model for Structural Analysis of Chinese BaseNPs , 1998, COLING-ACL.

[14]  Tong Zhang,et al.  Text Chunking using Regularized Winnow , 2001, ACL.

[15]  Kenneth Ward Church A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text , 1988, ANLP.