From Italian Text to TimeML Document via Dependency Parsing

This paper describes the first prototype for building TimeML xml documents starting from raw text for Italian. First, the text is parsed with the TULE parser, a dependency parser developed at the University of Turin. The parsed text is then used as input to the TimeML rule-based module we have implemented, henceforth called as 'The converter'. So far, the converter identifies and classifies events in the sentence. The results are rather satisfatory, and this leads us to support the use of dependency syntactic relations for the development of higher level semantic tools.

[1]  Estela Saquete Boró,et al.  TIPSem (English and Spanish): Evaluating CRFs and Semantic Roles in TempEval-2 , 2010, *SEMEVAL.

[2]  Rafael Muñoz,et al.  Enhancing QA Systems with Complex Temporal Question Processing Capabilities , 2009, J. Artif. Intell. Res..

[3]  Guy Aston,et al.  Introducing the La Repubblica Corpus: A Large, Annotated, TEI(XML)-compliant Corpus of Newspaper Italian , 2004, LREC.

[4]  Elisabetta Gola,et al.  A computational semantic lexicon of Italian: SIMPLE , 1999 .

[5]  Cristina Bosco,et al.  A GRAMMATICAL RELATION SYSTEM FOR TREEBANK ANNOTATION , 2003 .

[6]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[7]  Estela Saquete Boró,et al.  TimeML Events Recognition and Classification: Learning CRF Models with Semantic Roles , 2010, COLING.

[8]  Giorgio Satta,et al.  Comparing Italian parsers on a common Treebank: the EVALITA experience , 2008, LREC.

[9]  James F. Allen,et al.  TRIPS and TRIOS System for TempEval-2: Extracting Temporal Information from Text , 2010, *SEMEVAL.

[10]  James Pustejovsky,et al.  SemEval-2010 Task 13: Evaluating Events, Time Expressions, and Temporal Relations (TempEval-2) , 2009, SEW@NAACL-HLT.

[11]  Vincenzo Lombardo,et al.  Transformed Subcategorization Frames in Chunk Parsing , 2002, LREC.

[12]  James Pustejovsky,et al.  Evita: A Robust Event Recognizer For QA Systems , 2005, HLT.

[13]  Sivaji Bandyopadhyay,et al.  JU_CSE_TEMP: A First Step towards Evaluating Events, Time Expressions and Temporal Relations , 2010, *SEMEVAL.

[14]  Emanuele Pianta,et al.  Exploiting parallel texts in the creation of multilingual semantically annotated resources: the MultiSemCor Corpus , 2005, Natural Language Engineering.

[15]  Igor Mel’čuk,et al.  Actants in semantics and syntax II: actants in syntax , 2004 .

[16]  Beatrice Alex,et al.  Edinburgh-LTG: TempEval-2 System Description , 2010, *SEMEVAL.