Multilingual Speech Processing Activities in Quaero: Application to Multimedia Search in Unstructured Data

Spoken language processing technologies are principle components in most of the applications being developed as part of the Quaero program. Quaero is a large research and industrial innovation program focusing on the development of technologies for automatic analysis and classification of multimedia and multilingual documents. Concerning speech processing, research aims to substantially improve the state-ofthe-art in speech-to-text transcription, speaker diarization and recognition, language recognition, and speech translation.