A multi-band nonlinear oscillator model for speech
暂无分享,去创建一个
Nonlinear self-oscillating systems can model speech without an external excitation that drives a conventional filter model. However, they often do not give due consideration to perceptually important but weak signal components such as the higher formants of voiced speech. To overcome this problem, we propose two frequency-domain oscillator models: a bank of sub-band oscillators with individual oscillator states and a multi-band oscillator with a single joint state vector. Their state-transition map is approximated with compactly parameterized multivariate adaptive regression splines (MARS) and the systems are successfully tested in short-term prediction and synthesis experiments with sustained vowels.
[1] Katsuhiko Shirai,et al. Speech synthesis using superposition of sinusoidal waves generated by synchronized oscillators , 1990, ICSLP.
[2] Hans-Peter Bernhard,et al. A tight upper bound on the gain of linear and nonlinear predictors for stationary stochastic processes , 1998, IEEE Trans. Signal Process..
[3] J. Friedman. Multivariate adaptive regression splines , 1990 .
[4] Gernot Kubin. Signal Analysis and Modelling for Speech Processing , 1998 .