Dependency-Based PropBanking of Clinical Finnish

In this paper, we present a PropBank of clinical Finnish, an annotated corpus of verbal propositions and arguments. The clinical PropBank is created on top of a previously existing dependency treebank annotated in the Stanford Dependency (SD) scheme and covers 90% of all verb occurrences in the treebank. We establish that the PropBank scheme is applicable to clinical Finnish as well as compatible with the SD scheme, with an overwhelming proportion of arguments being governed by the verb. This allows argument candidates to be restricted to direct verb dependents, substantially simplifying the PropBank construction. The clinical Finnish PropBank is freely available at the address http://bionlp.utu.fi.

[1]  Tapio Salakoski,et al.  Parsing Clinical Finnish: Experiments with Rule-Based and Statistical Dependency Parsers , 2009, NODALIDA.

[2]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[3]  Richard Johansson,et al.  The CoNLL-2009 Shared Task: Syntactic and Semantic Dependencies in Multiple Languages , 2009, CoNLL Shared Task.

[4]  Michael Krauthammer,et al.  Shallow Semantic Parsing of Randomized Controlled Trial Reports , 2006, AMIA.

[5]  Xavier Carreras,et al.  Introduction to the CoNLL-2004 Shared Task: Semantic Role Labeling , 2004, CoNLL.

[6]  Tapio Salakoski,et al.  Towards automated processing of clinical Finnish: Sublanguage analysis and a rule-based parser , 2009, Int. J. Medical Informatics.

[7]  Ann Bies,et al.  A Pilot Arabic Propbank , 2008, LREC.

[8]  Richard Johansson,et al.  The CoNLL 2008 Shared Task on Joint Parsing of Syntactic and Semantic Dependencies , 2008, CoNLL.

[9]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[10]  Carol Friedman,et al.  Natural Language and Text Processing in Biomedicine , 2006 .

[11]  Nianwen Xue,et al.  Annotating the Propositions in the Penn Chinese Treebank , 2003, SIGHAN.

[12]  Christopher D. Manning,et al.  Stanford typed dependencies manual , 2010 .

[13]  Josef Ruppenhofer,et al.  FrameNet II: Extended theory and practice , 2006 .

[14]  Martha Palmer,et al.  Class-Based Construction of a Verb Lexicon , 2000, AAAI/IAAI.

[15]  Christopher D. Manning,et al.  The Stanford Typed Dependencies Representation , 2008, CF+CDPE@COLING.

[16]  Xavier Carreras,et al.  Introduction to the CoNLL-2005 Shared Task: Semantic Role Labeling , 2005, CoNLL.

[17]  Wayne H. Ward,et al.  Towards Temporal Relation Discovery from the Clinical Narrative , 2009, AMIA.