Product named entity recognition for Chinese query questions based on a skip-chain CRF model

As more and more commercial information can be obtained from the Internet, product named entity recognition plays an important role in market intelligence management. In this paper, a product named entity recognition method based on a skip-chain CRF model is proposed. This method considers not only the dependence between neighboring words but also the fact that product named entities are often connected by a connective. In this situation, the dependence between the words around the connective is more important than the dependence between neighboring words. This information improves the result of product named entity recognition as shown in the experiments. Experimental results on corpuses of mobile phone and digital camera demonstrate that the skip-chain CRF model works well and produces better results than the linear-chain CRF model.

[1]  Herbert E. Krugman Public Attitudes toward the Apollo Space Program, 1965-1975. , 1977 .

[2]  Shi Shui-cai,et al.  Chinese named entity identification using cascaded hidden Markov model , 2006 .

[3]  Andrew McCallum,et al.  An Introduction to Conditional Random Fields for Relational Learning , 2007 .

[4]  Cheng Niu,et al.  A Bootstrapping Approach to Named Entity Classification Using Successive Learners , 2003, ACL.

[5]  Liu Fei-fan,et al.  Study on Product Named Entity Recognition for Business Information Extraction , 2006 .

[6]  Eckhard Bick A Named Entity Recognizer for Danish , 2004, LREC.

[7]  John M. Pierre Mining Knowledge from Text Collections Using Automatically Generated Metadata , 2002, PAKM.

[8]  Andrew McCallum,et al.  Conditional Models of Identity Uncertainty with Application to Noun Coreference , 2004, NIPS.

[9]  Han Xiao,et al.  Product Named Entity Recognition Using Conditional Random Fields , 2011, 2011 Fourth International Conference on Business Intelligence and Financial Engineering.

[10]  Ben Taskar,et al.  Discriminative Probabilistic Models for Relational Data , 2002, UAI.

[11]  Feifan Liu,et al.  Product named entity recognition in Chinese text , 2008, Lang. Resour. Evaluation.

[12]  Ben Taskar,et al.  Introduction to statistical relational learning , 2007 .

[13]  Qun Liu,et al.  HHMM-based Chinese Lexical Analyzer ICTCLAS , 2003, SIGHAN.

[14]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.