Improving Children's Speech Recognition Through Explicit Pitch Scaling Based on Iterative Spectrogram Inversion
暂无分享,去创建一个
Syed Shahnawazuddin | Gayadhar Pradhan | Arun B. Samaddar | Hemant Kumar Kathania | Waquar Ahmad | A. B. Samaddar | S. Shahnawazuddin | H. Kathania | Waquar Ahmad | Gayadhar Pradhan
[1] Syed Shahnawazuddin,et al. Low-memory fast on-line adaptation for acoustically mismatched children's speech recognition , 2015, INTERSPEECH.
[2] Tara N. Sainath,et al. Large vocabulary automatic speech recognition for children , 2015, INTERSPEECH.
[3] Jan Cernocký,et al. Improved feature processing for deep neural networks , 2013, INTERSPEECH.
[4] Lonce Wyse,et al. AN EFFICIENT ALGORITHM FOR REAL-TIME SPECTROGRAM INVERSION , 2005 .
[5] Syed Shahnawazuddin,et al. Pitch-Adaptive Front-End Features for Robust Children's ASR , 2016, INTERSPEECH.
[6] Diego Giuliani,et al. Vocal tract length normalisation approaches to DNN-based children's and adults' speech recognition , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[7] Daniel Elenius,et al. The PF_STAR children's speech corpus , 2005, INTERSPEECH.
[8] Shrikanth S. Narayanan,et al. Creating conversational interfaces for children , 2002, IEEE Trans. Speech Audio Process..
[9] Steve Renals,et al. WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[10] P. Yip,et al. Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .
[11] Meinard Müller,et al. Improving Time-Scale Modification of Music Signals Using Harmonic-Percussive Separation , 2014, IEEE Signal Processing Letters.
[12] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[13] Dong Yu,et al. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[14] Martin J. Russell,et al. Challenges for computer recognition of children2s speech , 2007, SLaTE.
[15] Jian Cheng,et al. Using deep neural networks to improve proficiency assessment for children English language learners , 2014, INTERSPEECH.
[16] Shrikanth S. Narayanan,et al. Robust recognition of children's speech , 2003, IEEE Trans. Speech Audio Process..
[17] Shweta Ghai,et al. On the use of pitch normalization for improving children's speech recognition , 2009, INTERSPEECH.
[18] Diego Giuliani,et al. Deep-neural network approaches for speech recognition with heterogeneous groups of speakers including children† , 2016, Natural Language Engineering.
[19] Y. Asnath Victy Phamila,et al. Discrete Cosine Transform based fusion of multi-focus images for visual sensor networks , 2014, Signal Process..
[20] Shrikanth S. Narayanan,et al. Improving speech recognition for children using acoustic adaptation and pronunciation modeling , 2014, WOCCI.
[21] Shrikanth S. Narayanan,et al. A review of ASR technologies for children's speech , 2009, WOCCI.
[22] Jean Laroche,et al. New phase-vocoder techniques for pitch-shifting, harmonizing and other exotic effects , 1999, Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452).
[23] N. Ahmed,et al. Discrete Cosine Transform , 1996 .
[24] Meinard Müller,et al. TSM Toolbox: MATLAB Implementations of Time-Scale Modification Algorithms , 2014, DAFx.
[25] Shrikanth S. Narayanan,et al. Acoustics of children's speech: developmental changes of temporal and spectral parameters. , 1999, The Journal of the Acoustical Society of America.
[26] Syed Shahnawazuddin,et al. Enhancing noise and pitch robustness of children's ASR , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[27] Lonce L. Wyse,et al. Real-Time Signal Estimation From Modified Short-Time Fourier Transform Magnitude Spectra , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[28] Shweta Ghai,et al. Exploring the role of spectral smoothing in context of children's speech recognition , 2009, INTERSPEECH.