One of the next big challenges in Automatic Speech Recognition (ASR) is the transcription of speech in meetings. This task is particularly problematic for current recognition technologies because, in most realistic meeting scenarios, the vocabularies are unconstrained, the speech is spontaneous and often overlapping, and the microphones are inconspicuously placed. To support the development of meeting recognition technologies by both the speech recognition and video extraction research communities, NIST is providing a development and evaluation infrastructure including: a multi-media corpus of audio and video from meetings collected at NIST using a variety of microphones and video cameras, new evaluation protocols, metrics, software, rich transcription conventions, sponsoring evaluations and workshops, facilitating multi-site data pooling, and helping bring the community together to focus on the technical challenges. To date, NIST has collected a pilot corpus of 15 hours of meetings in its specially-instrumented Meeting Data Collection Laboratory. The corpus includes digital recordings from close-talking mics, lapel mics, distantly-placed mics, 5 digitally-recorded camera views, and full speaker/word-level transcripts. This data is being used in the development and evaluation of speech technologies and by the video extraction community under the auspices of the ARDA Video Analysis and Content Exploitation (VACE) program.
[1]
Andreas Stolcke,et al.
Meetings about meetings: research at ICSI on speech in multiparty conversations
,
2003,
2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[2]
E.,et al.
GROUPS : INTERACTION AND PERFORMANCE
,
2001
.
[3]
Martial Michel,et al.
Network Transfer of Control Data: An Application of the NIST SMART DATA FLOW
,
2003
.
[4]
Andreas Stolcke,et al.
The ICSI Meeting Corpus
,
2003,
2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[5]
Susanne Burger,et al.
The ISL meeting corpus: the impact of meeting type on speech style
,
2002,
INTERSPEECH.
[6]
Christopher Cieri,et al.
Research methodologies, observations and outcomes in (conversational) speech data collection
,
2002
.