Automatic segmentation of speech

A method for automatic segmentation of speech into phones is described. The incoming utterance is split up into more or less stationary parts, and these stationary parts are labelled as phones using the phonetic transcription of the utterance. An implicit segmentation algorithm splits up the utterance into segments on the basis of the degree of similarity between the frequency spectra of neighboring frames. An explicit algorithm does the same, but on the basis of the degree of similarity between the frequency spectra of the frames in the utterance and reference spectra. A combination algorithm compares the two segmentation results and produces the final segmentation. Automatically determined phone boundaries are compared with manually determined ones. The result of a perception test is described. >