Chinese Temporal Expression Recognition Combining Rules with a Statistical Model

Traditional rule-based methods for recognizing Chinese temporal expressions present a lower recall rate and they cannot recognize the event-type Chinese temporal expressions, thus, we propose a new Chinese temporal expression recognition method through combining rules with a statistical model. Firstly, we divide Chinese temporal expressions into seven categories and use basic time units as the smallest unit of recognition to simplify the complexity of rule-making. Then, we use regular rules to recognize Chinese temporal expressions and label the training data automatically. Meanwhile, we label the event-type temporal expressions that rule-based method cannot recognize. Lastly, we use the labeled training data to learn a Conditional Random Fields model for Chinese temporal expression recognition. Experimental results show that our proposed method significantly reduces the amount of annotation work and effectively improves the recognition performance. The F1 value reaches 88.73%, which is higher than the rule-based method by 6.13%.

[1]  Ming Zhou,et al.  Event Detection with Burst Information Networks , 2016, COLING.

[2]  吴琼,et al.  Temporal Information Extraction Based on CRF and Time Thesaurus , 2014 .

[3]  Zhu Fang,et al.  Chinese Temporal Phrase Recognition Based on Conditional Random Fields , 2011 .

[4]  Wanxiang Che,et al.  LTP: A Chinese Language Technology Platform , 2010, COLING.

[5]  Yuan Chunfa Automatic TIMEX2 tagging of Chinese temporal information , 2008 .

[6]  Luke S. Zettlemoyer,et al.  Deep Semantic Role Labeling: What Works and What’s Next , 2017, ACL.

[7]  Wu Li-de Chinese Time Expression Recognition Based on Automatically Generated Basic-Time-Unit Rules , 2010 .

[8]  John B. Moore,et al.  Hidden Markov Models: Estimation and Control , 1994 .

[9]  Jeffrey M. Zacks,et al.  Event structure in perception and conception. , 2001, Psychological bulletin.

[10]  Mao Xiao-li Chinese time expression recognition based on semantic role , 2011 .

[11]  James Pustejovsky,et al.  ISO-TimeML: An International Standard for Semantic Annotation , 2010, LREC.

[12]  Daniel Jurafsky,et al.  Automatic Labeling of Semantic Roles , 2002, CL.

[13]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[14]  Utkarsh Upadhyay,et al.  Recurrent Marked Temporal Point Processes: Embedding Event History to Vector , 2016, KDD.

[15]  Li Sheng Recognizing the Extent of Chinese Time Expressions Based on the Dependency Parsing and Error-Driven Learning , 2007 .

[16]  Inderjeet Mani,et al.  2003 Standard for the Annotation of Temporal Expressions , 2004 .