A Chinese name identifying system based on inverse name frequency model and rules
暂无分享,去创建一个
The processing of Chinese names is important to the approach of Chinese word segmentation and automatic abstraction. In this paper we put forward an inverse name frequency model. Based on this model, context pattern, adjacent chain, special name table and position dependent information, we designed an effective system for automatically identifying Chinese names in texts. This paper describes the algorithm of this system, and the experiment result shows its upper recall and precision rate. Its recall rate reaches 93.75% and precision rate reaches 83.95%.
[1] Hermann Ney,et al. Estimating 'small' probabilities by leaving-one-out , 1993, EUROSPEECH.