Robust feature extraction for alphabet recognition

Spectral/temporal segment features are adapted for isolated word recognition and tested with the entire English alphabet set using Hidden Markov Models. The ISOLET database from OGI and the HTK toolkit from Cambridge university were used to test our feature extraction technique. With our feature set we were able to achieve 97.3% recognition accuracy on test data with one pass using a whole word based recognizer. Gaussian noise was also added to evaluate robustness of the feature set. We were able to obtain recognition accuracies of 49.6% and 84.3% at SNR of -10dB and 0dB, respectively. Linear discriminant analysis was also applied to the initial feature set for a number of feature configurations and noise levels but, generally, the performance was not improved. We conclude that the initial feature computations used are both very efficient (best results obtained with 50 total features) and robust in the presence of noise.

[1]  Stephen A. Zahorian,et al.  Phone classification with segmental features and a binary-pair partitioned neural network classifier , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  S. Zahorian,et al.  Dynamic spectral shape features as acoustic correlates for initial stop consonants , 1991 .

[3]  C. Lefebvre,et al.  A comparison of several acoustic representations for speech recognition with degraded and undegraded speech , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[4]  Andreas Spanias,et al.  High-performance alphabet recognition , 1996, IEEE Trans. Speech Audio Process..

[5]  Thomas W. Parsons,et al.  Voice and Speech Processing , 1986 .

[6]  Ron Cole,et al.  The ISOLET spoken letter database , 1990 .

[7]  Stephen A. Zahorian,et al.  PHONE CLASSIFICATION WITH SEGMENTAL FEATURES AND CLASSIFIER A BINARY-PAIR PARTITIONED NEURAL NETWORK , 1997 .

[8]  Ronald A. Cole,et al.  Spoken Letter Recognition , 1990, HLT.

[9]  Nikos Fakotakis,et al.  Fast endpoint detection algorithm for isolated word recognition in office environment , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.