The Workshop on Speech, Language and Audio in Multimedia (SLAM) positions itself at at the crossroad of multiple scientific fields (music and audio processing, speech processing, natural language processing and multimedia) to discuss and stimulate research results, projects, datasets and benchmarks initiatives where audio, speech and language are applied to multimedia data. While the first two editions were collocated with major speech events, SLAM'15 is deeply rooted in the multimedia community, opening up to computer vision and multimodal fusion. To this end, the workshop emphasizes video hyperlinking as an showcase where computer vision meets speech and language. Such techniques provide a powerful illustration of how multimedia technologies incorporating speech, language and audio can make multimedia content collections better accessible, and thereby more useful, to users.
[1]
Florian Metze,et al.
Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis
,
2012,
MM 2012.
[2]
Rik Van de Walle,et al.
Multimedia information seeking through search and hyperlinking
,
2013,
ICMR.
[3]
Martha Larson,et al.
Proceedings of the third workshop on Searching spontaneous conversational speech
,
2009,
MM 2009.
[4]
Maria Eskevich,et al.
The Search and Hyperlinking Task at MediaEval 2013
,
2013,
MediaEval.
[5]
Florian Metze,et al.
AMVA'12: ACM international workshop on audio and multimedia methods for large-scale video analysis
,
2012,
ACM Multimedia.