Use of multiple classifiers for speech recognition in wireless CDMA network environments

In this paper, we address the problem and the use of multiple classi ers for robust recognition over the cellular network. The idea is to provide more variability to the system to be trained, and to support this variability with more number of model parameters. The main drawback is that the model size, and the computational complexity increases linearly related to di erent call environment. To alleviate this problem we rst introduce a new measure called the average-arc-count into the decoding process. The main advantage of this new measure is that many of the multiple classi ers can be shut down during the recognition stage if the average-arc-count of individual classi er exceeds a certain threshold limit for a given utterance. Secondly, we can also build individual classi ers with less number of parameters and without degrading the overall system performance. Experimental results on English connected digit recognition task show a string error rate reduction of as much as 40% by using the multiple classi ers when compared to individual CDMA systems.

[1]  John J. Godfrey,et al.  Transforming HMMs for speaker-independent hands-free speech recognition in the car , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[2]  Olivier Siohan,et al.  Multiple classifiers by constrained minimization , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[3]  E. J. Diethorn A subband noise-reduction method for enhancing speech in telephony and teleconferencing , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[4]  C.E. Mokbel,et al.  Automatic word recognition in cars , 1995, IEEE Trans. Speech Audio Process..

[5]  Qiru Zhou,et al.  A wave decoder for continuous speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[6]  Régine André-Obrecht,et al.  Cellular phone speech recognition: noise compensation vs. robust architectures , 1997, EUROSPEECH.

[7]  Rathinavelu Chengalvarayan,et al.  A comparative study of hybrid modelling techniques for improved telephone speech recognition , 1998, ICSLP.

[8]  Daniele Falavigna,et al.  Use of simulated data for robust telephone speech recognition , 1999, EUROSPEECH.

[9]  Jean-Claude Junqua,et al.  Impact of the unknown communication channel on automatic speech recognition: a review KN-29 , 1997, EUROSPEECH.

[10]  M.G. Rahim,et al.  Signal conditioning techniques for robust speech recognition , 1996, IEEE Signal Processing Letters.