Chinese Time Expression Recognition Based on Automatically Generated Basic-Time-Unit Rules

This paper proposes a generic algorithm for Time Expression Recognition(TER) task based on regular expressions.The algorithm generates rules based on "Basic Time Unit",which improves the recall value.And it prunes the rule collection through error driven method and reduces the "noise" taken from training corpus,which leads to a high precision.The two features jointlyimprove the overall efficiency of our method compared to the baseline system: with a significant better performance of up to 89.9% F-score on ACE07 Chinese Corpus.In addition,the proposed algorithm has good adaptablility and scalability for a broader application.