BIDIRECTIONAL NEURAL NETWORK FOR FEATURE COMPENSATION OF CLEAN AND TELEPHONE SPEECH SIGNALS
暂无分享,去创建一个
[1] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.
[2] M Bijankhan,et al. FARSDAT- THE SPEECH DATABASE OF FARSI SPOKEN LANGUAGE , 1994 .
[3] Hans-Günter Hirsch. HMM adaptation for applications in telecommunication , 2001, Speech Commun..
[4] Damjan Vlaj,et al. Efficient Noise Robust Feature Extraction Algorithms for Distributed Speech Recognition (DSR) Systems , 2003, Int. J. Speech Technol..
[5] Mahmood Bijankhan,et al. Tfarsdat - the telephone farsi speech database , 2003, INTERSPEECH.
[6] Richard M. Stern,et al. Reconstruction of missing features for robust speech recognition , 2004, Speech Commun..
[7] Sebastian Möller,et al. Quality of Telephone-Based Spoken Dialogue Systems , 2005 .
[8] John H. L. Hansen,et al. Statistical class-based MFCC enhancement of filtered and band-limited speech for robust ASR , 2005, INTERSPEECH.
[9] Steve Young,et al. The HTK book version 3.4 , 2006 .
[10] Richard M. Stern,et al. Band-Independent Mask Estimation for Missing-Feature Reconstruction in the Presence of Unknown Background Noise , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.
[11] Seyyed Ali Seyyed Salehi,et al. Robust speech recognition by modifying clean and telephone feature vectors using bidirectional neural network , 2006, INTERSPEECH.
[12] Alex Acero,et al. Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[13] John H. L. Hansen,et al. Time–Frequency Correlation-Based Missing-Feature Reconstruction for Robust Speech Recognition in Band-Restricted Conditions , 2009, IEEE Transactions on Audio, Speech, and Language Processing.