Spoken Letter Recognition

Automatic recognition of spoken letters is one of the most challenging tasks in the field of computer speech recognition. The difficulty of the task is due to the acoustic similarity of many of the letters. Accurate recognition requires the system to perform fine phonetic distinctions, such as B vs. D, B vs. P, D vs. T, T vs. G, C vs. Z, V vs. Z, M vs. N and J vs. K. The ability to perform fine phonetic distinctions---to discriminate among the minimal sound units of the language---is a fundamental unsolved problem in computer speech recognition.

[1]  Peter F. Brown,et al.  The acoustic-modeling problem in automatic speech recognition , 1987 .

[2]  Ronald A. Cole,et al.  Pitch detection with a neural-net classifier , 1991, IEEE Trans. Signal Process..

[3]  Murali Gopalakrishnan Segmenting speech into broad phonetic categories using neural networks , 1990 .

[4]  Geoffrey E. Hinton,et al.  A time-delay neural network architecture for isolated word recognition , 1990, Neural Networks.

[5]  Ronald A. Cole,et al.  Speaker-independent recognition of spoken English letters , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[6]  Biing-Hwang Juang,et al.  Statistical segmentation and word modeling techniques in isolated word recognition , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[7]  Ron Cole,et al.  The ISOLET spoken letter database , 1990 .

[8]  R.A. Cole,et al.  Speaker-independent name retrieval from spellings using a database of 50000 names , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[9]  D. Casasent,et al.  Image processing for image understanding with neural nets , 1989, International 1989 Joint Conference on Neural Networks.

[10]  B. Aldefeld,et al.  Automated directory listing retrieval system based on isolated word recognition , 1980, Proceedings of the IEEE.

[11]  L. Hou,et al.  Segmentation and broad classification of continuous speech , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[12]  Ronald A. Cole,et al.  Speaker-independent English alphabet recognition: experiments with the e-set , 1990, ICSLP.

[13]  Ronald A. Cole,et al.  Feature-based speaker-independent recognition of isolated english letters , 1983, ICASSP.

[14]  Ronald A. Cole,et al.  Performing fine phonetic distinctions: templates versus features , 1990 .