Neural net nonlinear prediction for speech data
暂无分享,去创建一个
A new, nonlinear, neural network based, predictor has been devised for the encoding of speech data. It may be used in the design of a differential pulse code modulation (DPCM) coder for speech. A hybrid neural network architecture has been employed which combines the perceptron and backpropagation paradigms, thus called the PB-hybrid (PBH). Only two neurons are needed in the backpropagation section, keeping the required overhead modest. This predictor is designed by supervised training, based on a typical sequence of digitised values of samples in a speech frame. Simulation experiments have been carried out using 15 ms frames of 16 kHz speech data. The results obtained for the prediction gain show a 3dB advantage of the PBH network over the linear predictor.
[1] Philip D. Wasserman,et al. Neural computing - theory and practice , 1989 .
[2] Thomas W. Parsons,et al. Voice and Speech Processing , 1986 .
[3] Yoh-Han Pao,et al. Adaptive pattern recognition and neural networks , 1989 .
[4] James L. McClelland,et al. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .