An Engine for Online Video Search in Large Archives of the Holocaust Testimonies

In this paper we present an online system for cross-lingual lexical (full-text) searching in the large archive of the Holocaust testimonies. Video interviews recorded in two languages (English and Czech) were automatically transcribed and indexed in order to provide efficient access to the lexical content of the recordings. The engine takes advantage of the state-of-the-art speech recognition system and performs fast spoken term detection (STD), providing direct access to the segments of interviews containing queried words or short phrases.