Phoneme recognition with elliptic discrimination neural units
暂无分享,去创建一个
Many researchers achieved high phoneme recognition rates by multilayered neural networks with linear discrimination neural (LDN) units. However, it is difficult to analyze which components of the input are important to each unit in those LDN networks.
This paper proposed a multilayer neural network with elliptic discrimination neural (EDN) units so that the functions of each unit in the network may be interpreted more definitely. The center of the elliptic discrimination boundary of a neural unit corresponds to a typical point in an input space. The radii of the ellipse express the extent of the corresponding components in the input space, hence it becomes clear which components of the input space are important to each unit in the EDN network.
To compare the performance of EDN and LDN networks, recognition experiments of phonemes /b, d, g/ in 5240 tokens of a Japanese speech database were carried out. In the experiments, recognition rates were obtained by EDN networks as high as the rate by an LDN network. Also, it was confirmed which components of the input space are important to each unit in the EDN network.
[1] Geoffrey E. Hinton,et al. Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..
[2] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.
[3] Riichiro Mizoguchi,et al. A continuous speech recognition system with a powerful environment for the knowledge-base construction: SPREX II , 1989, Systems and Computers in Japan.