论文信息 - TimeBank-Driven TimeML Analysis

TimeBank-Driven TimeML Analysis

The design of TimeML as an expressive language for temporal information brings promises, and challenges; in particular, its representa- tional properties raise the bar for traditional information extraction meth- ods applied to the task of text-to-TimeML analysis. A reference corpus, such as TimeBank, is an invaluable asset in this situation; however, certain characteristics of TimeBank—size and consistency, primarily—present chal- lenges of their own. We discuss the design, implementation, and perfor- mance of an automatic TimeML-compliant annotator, trained on TimeBank, and deploying a hybrid analytical strategy of mixing aggressive finite- state processing over linguistic annotations with a state-of-the-art ma- chine learning technique capable of leveraging large amounts of unan- notated data. The results we report are encouraging in the light of a close analysis of TimeBank; at the same time they are indicative of the need for more infrastructure work, especially in the direction of creating a larger and more robust reference corpus. 1

Branimir Boguraev | Rie Kubota Ando | R. Ando | B. Boguraev

[1] James Pustejovsky,et al. Introduction to the special issue on temporal information processing , 2004, TALIP.

[2] James Pustejovsky,et al. Automating Temporal Annotation with TARSQI , 2005, ACL.

[3] Branimir Boguraev,et al. Anaphora for Everyone: Pronominal Anaphora Resolution without a Parser , 1996, COLING.

[4] Alon Lavie,et al. A framework for resolution of time in natural language , 2004, TALIP.

[5] Tong Zhang,et al. Text Chunking based on a Generalization of Winnow , 2002, J. Mach. Learn. Res..

[6] Tong Zhang,et al. Named Entity Recognition through Classifier Combination , 2003, CoNLL.

[7] Rie Kubota Ando,et al. Exploiting Unannotated Corpora for Tagging and Chunking , 2004, ACL.

[8] Xiaoqiang Luo,et al. A Statistical Model for Multilingual Entity Detection and Tracking , 2004, NAACL.

[9] Tong Zhang,et al. A Robust Risk Minimization based Named Entity Recognition System , 2003, CoNLL.

[10] James Pustejovsky,et al. TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[11] R. Fikes,et al. JTP : A System Architecture and Component Library for Hybrid Reasoning , 2003 .

[12] Jerry R. Hobbs,et al. An ontology of time for the semantic web , 2004, TALIP.

[13] James Pustejovsky,et al. Annotating and Reasoning about Time and Events , 2005, The Language of Time - A Reader.