论文信息 - Creating a Data Collection for Evaluating Rich Speech Retrieval

Creating a Data Collection for Evaluating Rich Speech Retrieval

We describe the development of a test collection for the investigation of speech retrieval beyond identification of relevant content. This collection focuses on satisfying user information needs for queries associated with specific types of speech acts. The collection is based on an archive of the Internet video from Internet video sharing platform (blip.tv), and was provided by the MediaEval benchmarking initiative. A crowdsourcing approach was used to identify segments in the video data which contain speech acts, to create a description of the video containing the act and to generate search queries designed to refind this speech act. We describe and reflect on our experiences with crowdsourcing this test collection using the Amazon Mechanical Turk platform. We highlight the challenges of constructing this dataset, including the selection of the data source, design of the crowdsouring task and the specification of queries and relevant items.

Martha Larson | Maria Eskevich | Gareth J. F. Jones | Roeland Ordelman

[1] Ellen M. Voorhees,et al. The TREC Spoken Document Retrieval Track: A Success Story , 2000, TREC.

[2] Matthew Lease,et al. Crowdsourcing Document Relevance Assessment with Mechanical Turk , 2010, Mturk@HLT-NAACL.

[3] Alexander I. Rudnicky,et al. Using the Amazon Mechanical Turk for transcription of spoken language , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4] Ian R. Lane,et al. Tools for Collecting Speech Corpora via Mechanical-Turk , 2010, Mturk@HLT-NAACL.

[5] Mohammad Soleymani,et al. Automatic tagging and geotagging in video collections and communities , 2011, ICMR.

[6] Brendan T. O'Connor,et al. Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks , 2008, EMNLP.

[7] Omar Alonso,et al. Crowdsourcing for relevance evaluation , 2008, SIGF.

[8] Jean-Luc Gauvain,et al. Speech Processing for Audio Indexing , 2008, GoTAL.

[9] Chris Callison-Burch,et al. Creating Speech and Language Data With Amazon’s Mechanical Turk , 2010, Mturk@HLT-NAACL.

[10] Klaus Zechner,et al. Using Amazon Mechanical Turk for Transcription of Non-Native Speech , 2010, Mturk@HLT-NAACL.

[11] Martha Larson,et al. Overview of MediaEval 2011 Rich Speech Retrieval Task and Genre Tagging Task , 2011, MediaEval.

[12] Ryen W. White,et al. Overview of the CLEF-2006 Cross-Language Speech Retrieval Track , 2006, CLEF.