DCU at MediaEval 2011: Rich Speech Retrieval
暂无分享,去创建一个
We describe our runs and results for the Rich Speech Retrieval (RSR) Task at MediaEval 2011. Our runs examine the use of alternative segmentation methods on the provided ASR transcripts to locate the beginning of the topic, assuming that this will capture or get close to the starting point of the relevant segment; combination of various types of queries and weighting of metadata to move the relevant segment higher in the ranked list; and different ASR transcripts to compare the influence of the ASR transcripts quality. Our results show that newer versions of the transcripts and use of metadata produce better results on average. So far we have not used information about the illocutionary act type corresponding to each query, but analysis of the retrieval results shows difference in behaviour for queries associated with certatin classes of act.
[1] Freddy Y. Y. Choi. Advances in domain independent linear text segmentation , 2000, ANLP.
[2] Martha Larson,et al. Overview of MediaEval 2011 Rich Speech Retrieval Task and Genre Tagging Task , 2011, MediaEval.
[3] Marti A. Hearst. TextTiling: A Quantitative Approach to Discourse , 1993 .