Annotation of clausal functional information for semantic retrieval

The study of language functions is closely associated with the semantic and pragmatic aspects of language. While data driven approaches have been successfully applied on retrieval of functional-semantic information at the discourse level, the work at the clause level is still largely absent. In this paper, we annotate an initial corpus with Systemic Functional Linguistics, a prominent framework for the analysis of language functions at the sentence/clause level. The annotated corpus makes it possible to train a classifier to automatically classify functional processes at the clausal level. With an initial computational resource, the linking and interoperation between the two levels of functional information is now possible, giving rise to a range of potential applications in functional/semantic retrieval.

[1]  Jon Patrick,et al.  Selecting Systemic Features for Text Classification , 2004, ALTA.

[2]  G. Meade Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001 .

[3]  Graeme Hirst,et al.  Text-level Discourse Parsing with Rich Linguistic Features , 2012, ACL.

[4]  Michael Halliday,et al.  An Introduction to Functional Grammar , 1985 .

[5]  Mark Liberman,et al.  Annotation graphs as a framework for multidimensional linguistic data analysis , 1999, ArXiv.

[6]  N. Fairclough Discourse and Text: Linguistic and Intertextual Analysis within Discourse Analysis , 1992 .

[7]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[8]  Elke Teich,et al.  Systemic functional grammar in natural language generation : linguistic description and computational representation , 1999 .

[9]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[10]  Ann Bies,et al.  The Penn Treebank: Annotating Predicate Argument Structure , 1994, HLT.

[11]  James R. Curran,et al.  Creating a Systemic Functional Grammar Corpus from the Penn Treebank , 2007, ACL 2007.

[12]  Shlomo Argamon,et al.  Using appraisal groups for sentiment analysis , 2005, CIKM '05.

[13]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[14]  Mitsuru Ishizuka,et al.  HILDA: A Discourse Parser Using Support Vector Machine Classification , 2010, Dialogue Discourse.

[15]  Hengbin Yan,et al.  Collaborative Annotation and Visualization of Functional and Discourse Structures , 2012, ROCLING.