Dynamic recognition of vowels by machine using trajectories in a two dimensional feature space
暂无分享,去创建一个
Two real values features derived from vowel formants in every 10-ms time frame, are plotted in the plane to form a trajectory. The trajectories are analyzed geometrically to extract stationary regions and turning points, and to fit straight lines to suitable parts. Relating these to "ideal" positions for six basic vowels, a new set of dynamic features are derived and used for classification of already segmented vowels. Using a k-nearest neighbour rule with 2300 training vowels and as many test vowels, taken from continuous speech samples of the same group of 33 male speakers, an average success rate of 72% has been achieved in six way classification. This may be compared to 75-86% claimed for human subjects in similar tests, but with little training and much less data.<<ETX>>
[1] Gerhardus Johannes Prinsloo. Dynamic vocal tract features for the recognition of coarticulated vowels , 1993 .
[2] V. R. Algazi,et al. Speech recognition using dynamic features of acoustic subword spectra , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.
[3] Frank Fallside,et al. Computer display of vowel quadrilateral with real-time representation of articulation , 1978 .