Speech Representation and Speech Understanding
暂无分享,去创建一个
The core objective of the research is to encode speech into segments which retain necessary information for accurate continuous speech recognition, but are more efficient to deal with than the usual encoding of short frames of speech. We use a multi-stage decision-tree encoder with linear combinations of features at the decision nodes; the result is segments which cover multiple frames and which are coded with the terminal node number of the final tree.