LARGE SCALE EXPERIMENTS WITH FUNCTION TAGGING

We present in this paper large scale experiments with two De- cision Trees based approaches to the task of function tagging. The task of function tagging involves labeling certain nodes in an input parse tree with a set of functional marks such as logical subject, predicate, etc. In the flrst ap- proach, we consider only nodes that are labeled with a functional tag. In the second approach, all nodes are considered whether they are labeled with func- tion tags or not. The non-labeled nodes are simply considered being labeled with the generic tag NON-F. The results obtained on a standard data set are signiflcantly outperforming baseline approaches when the most frequent tag is assigned.

[1]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[2]  Ann Bies,et al.  Bracketing Guidelines for Treebank II Style , 2002 .

[3]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[4]  Shalom Lappin,et al.  An Algorithm for Pronominal Anaphora Resolution , 1994, CL.

[5]  Dekang Lin,et al.  Dependency-Based Evaluation of Minipar , 2003 .

[6]  Mark Johnson,et al.  A Simple Pattern-matching Algorithm for Recovering Empty Nodes and their Antecedents , 2002, ACL.

[7]  Eugene Charniak,et al.  Function tagging , 2004 .

[8]  Ian H. Witten,et al.  Data mining - practical machine learning tools and techniques, Second Edition , 2005, The Morgan Kaufmann series in data management systems.

[9]  Valentin Jijkoun,et al.  Enriching the Output of a Parser Using Memory-based Learning , 2004, ACL.

[10]  Eugene Charniak,et al.  Assigning Function Tags to Parsed Text , 2000, ANLP.

[11]  Vasile Rus,et al.  Large Scale Experiments with Naive Bayes and Decision Trees for Function Tagging , 2008, Int. J. Artif. Intell. Tools.

[12]  David M. Magerman Natural Language Parsing as Statistical Pattern Recognition , 1994, ArXiv.

[13]  Donato Malerba,et al.  A Comparative Analysis of Methods for Pruning Decision Trees , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Ellen M. Voorhees,et al.  Overview of the TREC 2004 Novelty Track. , 2005 .

[15]  Eugene Charniak,et al.  A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[16]  Ann Bies,et al.  Bracketing Guidelines For Treebank II Style Penn Treebank Project , 1995 .

[17]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .