论文信息 - The Search and Hyperlinking Task at MediaEval 2013

The Search and Hyperlinking Task at MediaEval 2013

We describe the runs for our participation in the Search sub-task of the Search and Hyperlinking Task at MediaEval 2013. Our experiments investigate the aect of using information about speech segment boundaries and pauses on the effectiveness of retrieving jump-in points within the retrieved segments. We segment all three available types of transcripts (automatic ones provided by LIMSI/Vocapia and LIUM, and manual subtitles provided by BBC) into fixed-length time units, and present the resulting runs using the original segment starts and using the potential jump-in points. Our method for adjustment of the jump-in points achieves higher scores for all LIMSI/Vocapia, LIUM, and subtitles based runs.

Maria Eskevich | Gareth J. F. Jones | Roeland Ordelman | Robin Aly | Shu Chen

[1] Jean-Luc Gauvain. The Quaero program: multilingual and multimedia technologies , 2010, IWSLT.

[2] Georges Quénot,et al. TRECVID 2015 - An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2011, TRECVID.

[3] Jean-Luc Gauvain,et al. The LIMSI Broadcast News transcription system , 2002, Speech Commun..

[4] Cordelia Schmid,et al. The AXES research video search system , 2014, ICASSP 2014.

[5] Mark J. F. Gales,et al. Automatic Transcription of Multi-genre Media Archives , 2013, SLAM@INTERSPEECH.

[6] Ben He,et al. Terrier : A High Performance and Scalable Information Retrieval Platform , 2022 .

[7] Björn W. Schuller,et al. Recent developments in openSMILE, the munich open-source multimedia feature extractor , 2013, ACM Multimedia.

[8] Martha Larson,et al. Search and Hyperlinking Task at MediaEval 2012 , 2012, MediaEval.

[9] Jean-Luc Gauvain,et al. Speech Processing for Audio Indexing , 2008, GoTAL.

[10] Maria Eskevich,et al. New Metrics for Meaningful Evaluation of Informally Structured Speech Retrieval , 2012, ECIR.

[11] Rik Van de Walle,et al. Multimedia information seeking through search and hyperlinking , 2013, ICMR.

[12] Bertrand Chupeau,et al. A Video Fingerprint Based on Visual Digest and Local Fingerprints , 2006, 2006 International Conference on Image Processing.

[13] Martha Larson,et al. Overview of MediaEval 2011 Rich Speech Retrieval Task and Genre Tagging Task , 2011, MediaEval.

[14] Paul Deléglise,et al. Enhancing the TED-LIUM Corpus with Selected Data for Language Modeling and More TED Talks , 2014, LREC.

[15] Maria Eskevich,et al. Linking inside a video collection: what and how to measure? , 2013, WWW.

[16] Gareth J. F. Jones,et al. Overview of the CLEF-2005 Cross-Language Speech Retrieval Track , 2005, CLEF.

[17] Tinne Tuytelaars,et al. A Testbed for Cross-Dataset Analysis , 2014, ECCV Workshops.

[18] Cordelia Schmid,et al. Unsupervised metric learning for face identification in TV video , 2011, 2011 International Conference on Computer Vision.

[19] Maria Eskevich,et al. Adapting Binary Information Retrieval Evaluation Metrics for Segment-based Retrieval Tasks , 2013, ArXiv.

[20] Andrew Zisserman,et al. VISOR: Towards On-the-Fly Large-Scale Object Category Retrieval , 2012, ACCV.

[21] Thomas Sikora,et al. Feature-based video key frame extraction for low quality video sequences , 2009, 2009 10th Workshop on Image Analysis for Multimedia Interactive Services.

[22] Paul Deléglise,et al. LIUM's systems for the IWSLT 2011 speech translation tasks , 2011, IWSLT.

[23] Andrew Zisserman,et al. The devil is in the details: an evaluation of recent feature encoding methods , 2011, BMVC.