Analyzing pitch robustness of PMVDR and MFCC features for children's speech recognition
暂无分享,去创建一个
[1] Harald Singer,et al. Pitch dependent phone modelling for HMM based speech recognition , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[2] Bhaskar D. Rao,et al. All-pole modeling of speech based on the minimum variance distortionless response spectrum , 2000, Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136).
[3] Shrikanth S. Narayanan,et al. Acoustics of children's speech: developmental changes of temporal and spectral parameters. , 1999, The Journal of the Acoustical Society of America.
[4] Shweta Ghai,et al. Exploring the role of spectral smoothing in context of children's speech recognition , 2009, INTERSPEECH.
[5] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.
[6] John H. L. Hansen,et al. A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition , 2008, Speech Commun..
[7] Xu Shao,et al. Pitch prediction from MFCC vectors for speech reconstruction , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[8] Joakim Gustafson,et al. Voice transformations for improving children²s speech recognition in a publicly available dialogue system , 2002, INTERSPEECH.
[9] Shweta Ghai,et al. On the use of pitch normalization for improving children's speech recognition , 2009, INTERSPEECH.
[10] Steve Young,et al. The HTK book version 3.4 , 2006 .
[11] Fabio Brugnara,et al. Acoustic variability and automatic recognition of children's speech , 2007, Speech Commun..
[12] Luís C. Oliveira,et al. Pitch-synchronous time-scaling for prosodic and voice quality transformations , 2005, INTERSPEECH.
[13] Li Lee,et al. A frequency warping approach to speaker normalization , 1998, IEEE Trans. Speech Audio Process..
[14] R. G. Leonard,et al. A database for speaker-independent digit recognition , 1984, ICASSP.