A Baseline Approach for Detecting Sentences Containing Uncertainty
暂无分享,去创建一个
We apply a baseline approach to the CoNLL-2010 shared task data sets on hedge detection. Weights have been assigned to cue words marked in the training data based on their occurrences in certain and uncertain sentences. New sentences received scores that correspond with those of their best scoring cue word, if present. The best acceptance scores for uncertain sentences were determined using 10-fold cross validation on the training data. This approach performed reasonably on the shared task's biological (F=82.0) and Wikipedia (F=62.8) data sets.
[1] Alexander S. Yeh,et al. More accurate tests for the statistical significance of result differences , 2000, COLING.
[2] Ben Medlock,et al. Exploring hedge identification in biomedical literature , 2008, J. Biomed. Informatics.
[3] János Csirik,et al. The CoNLL-2010 Shared Task: Learning to Detect Hedges and their Scope in Natural Language Text , 2010, CoNLL Shared Task.