A Hybrid Approach for Chinese Named Entity Recognition in Music Domain

The amount of music information available on the Web is rapidly increasing. There is a pressing need for music information extraction. To extract useful information from natural language text, we must recognize music named entities first. This paper introduces a hybrid method to identify the Chinese named entities in music domain. Recently, machine learning approaches are frequently used to solve Name Entity Recognition (NER). So our method uses a Hidden Markov Model (HMM) as the underlying method. Since HMM has innate weaknesses, we incorporate it with rule-based method for pre-processing and post-processing. The combination of machine learning method and rule-based method results in a high precision recognition. And we improve both training and recognizing process of HMM for Music Named Entity Recognition (MNER). In this paper, a novel and convenient Musical Name Entity (MNE) tagging method to generate training data is proposed, which makes HMM method practically usable. In addition, we present an effective method of unknown words tagging in recognition. The experimental results show that our framework brings significant improvements for solving MNER.

[1]  Ferran Plà,et al.  Improving part-of-speech tagging using lexicalized HMMs , 2004, Natural Language Engineering.

[2]  Bruce T. Lowerre,et al.  The HARPY speech recognition system , 1976 .

[3]  Ralph Grishman,et al.  The NYU System for MUC-6 or Where’s the Syntax? , 1995, MUC.

[4]  Hui Zhang,et al.  Chinese Named Entity Recognition with CRFs: Two Levels , 2008, 2008 International Conference on Computational Intelligence and Security.

[5]  Jian Sun,et al.  A Class-based Language Model Approach to Chinese Named Entity Identification , 2003, Int. J. Comput. Linguistics Chin. Lang. Process..

[6]  Richard M. Schwartz,et al.  An Algorithm that Learns What's in a Name , 1999, Machine Learning.

[7]  Tao Zhang,et al.  Research on Me-based Chinese NER model , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[8]  Ralph Grishman,et al.  A Maximum Entropy Approach to Named Entity Recognition , 1999 .

[9]  Hideki Isozaki,et al.  Efficient Support Vector Classifiers for Named Entity Recognition , 2002, COLING.

[10]  Wei Li,et al.  Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons , 2003, CoNLL.

[11]  Guohong Fu,et al.  Chinese named entity recognition using lexicalized HMMs , 2005, SKDD.

[12]  A. Mamat,et al.  A New Fuzzy Support Vector Machine Method for Named Entity Recognition , 2008, 2008 International Conference on Computer Science and Information Technology.