论文信息 - Domain Adaptation by Active Learning

Domain Adaptation by Active Learning

We tackled the Evalita 2011 Domain Adaptation task with a strategy of active learning. The DeSR parser can be configured to provide different measures of perplexity in its own ability to parse sentences correctly. After parsing sentences in the target domain, a small number of the sentences with the highest perplexity were selected, revised manually and added to the training corpus in order to build a new parser model incorporating some knowledge from the target domain. The process was repeated a few times for building a new training resource partially adapted to the target domain. Using the new resource we trained three stacked parsers, and their combination was used to produce the final results.

Maria Simi | Giuseppe Attardi | Andrea Zanelli

[1] Felice Dell'Orletta,et al. Domain Adaptation for Dependency Parsing at Evalita 2011 , 2011, EVALITA.

[2] Maria Simi,et al. Active Learning for Building a Corpus of Questions for Parsing , 2010, LREC.

[3] Felice Dell'Orletta,et al. Reverse Revision and Linear Tree Combination for Dependency Parsing , 2009, HLT-NAACL.

[4] Eugene Charniak,et al. Statistical Parsing with a Context-Free Grammar and Word Statistics , 1997, AAAI/IAAI.

[5] Stephen R. Clark,et al. CLSP WS-02 Final Report: Semi-Supervised Training for Statistical Parsing , 2003 .

[6] Giuseppe Attardi,et al. Experiments with a Multilanguage Non-Projective Dependency Parser , 2006, CoNLL.

[7] Joakim Nivre,et al. Deterministic Dependency Parsing of English Text , 2004, COLING.

[8] Yuji Matsumoto,et al. Statistical Dependency Analysis with Support Vector Machines , 2003, IWPT.

[9] Felice Dell'Orletta,et al. Accurate Dependency Parsing with a Stacked Multilayer Perceptron , 2009 .

[10] Maria Simi,et al. Tuning DeSR for Dependency Parsing of Italian , 2011, EVALITA.

[11] Eugene Charniak,et al. Automatic Domain Adaptation for Parsing , 2010, NAACL.