Chinese base phrases chunking based on latent semi-CRF model

In the fields of Chinese natural language processing, recognizing simple and non-recursive base phrases is an important task for natural language processing applications, such as information processing and machine translation. Instead of rule-based model, we adopt the statistical machine learning method, newly proposed Latent semi-CRF model to solve the Chinese base phrase chunking problem. The Chinese base phrases could be treated as the sequence labeling problem, which involve the prediction of a class label for each frame in an unsegmented sequence. The Chinese base phrases have sub-structures which could not be observed in training data. We propose a latent discriminative model called Latent semi-CRF(Latent Semi Conditional Random Fields), which incorporates the advantages of LDCRF(Latent Dynamic Conditional Random Fields) and semi-CRF that model the sub-structure of a class sequence and learn dynamics between class labels, in detecting the Chinese base phrases. Our results demonstrate that the latent dynamic discriminative model compares favorably to Support Vector Machines, Maximum Entropy Model, and Conditional Random Fields(including LDCRF and semi-CRF) on Chinese base phrases chunking.

[1]  Xiao Sun,et al.  Extended super function based Chinese Japanese machine translation , 2009, 2009 International Conference on Natural Language Processing and Knowledge Engineering.

[2]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[3]  Xiao Sun,et al.  Detecting New Words from Chinese Text Using Latent Semi-CRF Models , 2010, IEICE Trans. Inf. Syst..

[4]  William W. Cohen,et al.  Semi-Markov Conditional Random Fields for Information Extraction , 2004, NIPS.

[5]  Xu Sun,et al.  Modeling Latent-Dynamic in Shallow Parsing: A Latent Conditional Model with Improved Inference , 2008, COLING.

[6]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[7]  Steven Abney,et al.  Parsing By Chunks , 1991 .

[8]  Fernando Pereira,et al.  Shallow Parsing with Conditional Random Fields , 2003, NAACL.

[9]  Dan Klein,et al.  Discriminative Log-Linear Grammars with Latent Variables , 2007, NIPS.

[10]  Jun'ichi Tsujii,et al.  Probabilistic CFG with Latent Annotations , 2005, ACL.

[11]  Phil Blunsom,et al.  A Discriminative Latent Variable Model for Statistical Machine Translation , 2008, ACL.

[12]  Tong Zhang,et al.  A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , 2005, J. Mach. Learn. Res..

[13]  Sabine Buchholz,et al.  Introduction to the CoNLL-2000 Shared Task Chunking , 2000, CoNLL/LLL.

[14]  Wojciech Skut,et al.  A Linguistically Interpreted Corpus of German Newspaper Text , 1998, LREC.

[15]  Trevor Darrell,et al.  Latent-Dynamic Discriminative Models for Continuous Gesture Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Ren Fuji Super-function based machine translation , 1999 .

[17]  Xiao Sun,et al.  Dual-chain Unequal-state CRF for Chinese new word detection and POS tagging , 2008, 2008 International Conference on Natural Language Processing and Knowledge Engineering.

[18]  Qiang Zhou,et al.  Chinese Base-Phrases Chunking , 2002, SIGHAN@COLING.

[19]  Xu Sun,et al.  Sequential Labeling with Latent Variables: An Exact Inference Algorithm and its Efficient Approximation , 2009, EACL.

[20]  Heng Li,et al.  Transductive HMM based Chinese text chunking , 2003, International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003.

[21]  Satoshi Sato,et al.  Fast Base NP Chunking with Decision Trees - Experiments on Different POS Tag Settings , 2003, CICLing.

[22]  Changning Huang,et al.  A Quasi-Dependency Model for Structural Analysis of Chinese BaseNPs , 1998, COLING-ACL.

[23]  Tiejun Zhao,et al.  Statistics Based Hybrid Approach to Chinese Base Phrase Identification , 2000, ACL 2000.