论文信息 - An implementation of decision tree-based context clustering on graphics processing units

An implementation of decision tree-based context clustering on graphics processing units

Decision tree-based context clustering is essential but timeconsuming while building HMM-based speech synthesis systems. Its widely used implementation is not designed to take advantage of highly parallel architectures, such as GPUs. This paper shows an implementation of tree-based clustering for these highly parallel architectures. Experimental results showed that the new implementation running on GPUs was significantly faster than the conventional one running on CPUs.

Heiga Zen | Nicholas Pilkington

[1] Jj Odell,et al. The Use of Context in Large Vocabulary Speech Recognition , 1995 .

[2] Keiichi Tokuda,et al. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis , 1999, EUROSPEECH.

[3] Heiga Zen,et al. Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005 , 2007, IEICE Trans. Inf. Syst..

[4] Heiga Zen,et al. The HMM-based speech synthesis system (HTS) version 2.0 , 2007, SSW.

[5] Sadaoki Furui,et al. Harnessing graphics processors for the fast computation of acoustic likelihoods in speech recognition , 2009, Comput. Speech Lang..

[6] Frank K. Soong,et al. GPU-accelerated Gaussian clustering for fMPE discriminative training , 2008, INTERSPEECH.