Creation of speech corpora for the multilingual Bonn Open Synthesis System

In this paper we present the procedure for creating a new speech corpus for the Bonn Open Synthesis System (BOSS). BOSS has several advantages which make this procedure particularly straightforward and fast. BOSS is open source, allowing flexible use of components and corpora. It shows a clear separation between data and architecture, which means that a change in corpus does not require a change in the architecture. The data formats are strictly defined, making it a very transparent system. The implementation of a small Dutch corpus is used as a case study.