Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition
暂无分享,去创建一个
Ning Ma | Jon Barker | Ángel M. Gómez | Antonio M. Peinado | José A. González | Ning Ma | J. Barker | A. Gómez | A. Peinado
[1] Yoshihiko Nankaku,et al. GMM-Based Missing-Feature Reconstruction on Multi-Frame Windows , 2011, INTERSPEECH.
[2] Yifan Gong,et al. Robust automatic speech recognition : a bridge to practical application , 2015 .
[3] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.
[4] James R. Glass,et al. Updated Minds Report on Speech Recognition and Understanding, Part 2 Citation Baker, J. Et Al. " Updated Minds Report on Speech Recognition and Understanding, Part 2 [dsp Education]. " Signal Processing Accessed Terms of Use , 2022 .
[5] Daniel P. W. Ellis,et al. Decoding speech in the presence of other sources , 2005, Speech Commun..
[6] Pedro J. Moreno,et al. Speech recognition in noisy environments , 1996 .
[7] Ángel M. Gómez,et al. MMSE Feature Reconstruction Based on an Occlusion Model for Robust ASR , 2012, IberSPEECH.
[8] Ning Ma,et al. MMSE-Based Missing-Feature Reconstruction With Temporal Modeling for Robust Speech Recognition , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[9] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[10] A. Nadas,et al. Speech recognition using noise-adaptive prototypes , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.
[11] Ning Ma,et al. Exploiting correlogram structure for robust speech recognition with multiple speech sources , 2007, Speech Commun..
[12] Bhiksha Raj,et al. Soft mask estimation for single channel speaker separation , 2004, SAPA@INTERSPEECH.
[13] Antonio M. Peinado,et al. Model-based compensation of the additive noise for continuous speech recognition. experiments using the Aurora II database and tasks , 2001, INTERSPEECH.
[14] Bhiksha Raj,et al. Techniques for Noise Robustness in Automatic Speech Recognition , 2012, Techniques for Noise Robustness in Automatic Speech Recognition.
[15] Rainer Martin,et al. Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..
[16] Ning Ma,et al. Log-spectral feature reconstruction based on an occlusion model for noise robust speech recognition , 2012, INTERSPEECH.
[17] Sam T. Roweis,et al. Factorial models and refiltering for speech separation and denoising , 2003, INTERSPEECH.
[18] Friedrich Faubel,et al. BASED SOFT-MASK ESTIMATION FOR MISSING FEATURE RECONSTRUCTION , 2008 .
[19] Juan Manuel Górriz,et al. Voice Activity Detection. Fundamentals and Speech Recognition System Robustness , 2007 .
[20] James R. Glass,et al. Developments and directions in speech recognition and understanding, Part 1 [DSP Education] , 2009, IEEE Signal Processing Magazine.
[21] Li Deng,et al. HMM adaptation using vector taylor series for noisy speech recognition , 2000, INTERSPEECH.
[22] J M Górriz,et al. Statistical voice activity detection based on integrated bispectrum likelihood ratio tests for robust speech recognition. , 2007, The Journal of the Acoustical Society of America.
[23] Bhiksha Raj,et al. Soft Mask Methods for Single-Channel Speaker Separation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[24] Yifan Gong,et al. An Overview of Noise-Robust Automatic Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[25] Jon Barker,et al. Soft decisions in missing data techniques for robust automatic speech recognition , 2000, INTERSPEECH.
[26] B. Raj,et al. Reconstructing spectral vectors with uncertain spectrographic masks for robust speech recognition , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..
[27] Friedrich Faubel,et al. Bounded conditional mean imputation with Gaussian mixture models: A reconstruction approach to partly occluded features , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[28] Masakiyo Fujimoto,et al. Speech recognition in the presence of highly non-stationary noise based on spatial, spectral and temporal speech/noise modeling combined with dynamic variance adaptation , 2011 .
[29] R.M. Stern,et al. Missing-feature approaches in speech recognition , 2005, IEEE Signal Processing Magazine.
[30] Philipos C. Loizou,et al. Speech Enhancement: Theory and Practice , 2007 .
[31] Jesper Jensen,et al. MMSE based noise PSD tracking with low complexity , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.
[32] Roger K. Moore,et al. Hidden Markov model decomposition of speech and noise , 1990, International Conference on Acoustics, Speech, and Signal Processing.
[33] A. Banihashemi,et al. A Non-Linear Minimum Mean Square Error Estimator for the Mixture-Maximization Approximation , 2006 .
[34] Ulpu Remes,et al. Techniques for Noise Robustness in Automatic Speech Recognition , 2012 .
[35] Phil D. Green,et al. Robust automatic speech recognition with missing and unreliable acoustic data , 2001, Speech Commun..
[36] Javier Ramírez,et al. Efficient voice activity detection algorithms using long-term speech information , 2004, Speech Commun..
[37] Ali I. Genç. Moments of truncated normal/independent distributions , 2013 .
[38] Masakiyo Fujimoto,et al. LogMax observation model with MFCC-based spectral prior for reduction of highly nonstationary ambient noise , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[39] Jon Barker,et al. A pitch based noise estimation technique for robust speech recognition with Missing Data , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[40] Jean Paul Haton,et al. On noise masking for automatic missing data speech recognition: A survey and discussion , 2007, Comput. Speech Lang..
[41] Friedrich Faubel,et al. A phase-averaged model for the relationship between noisy speech, clean speech and noise in the log-mel domain , 2008, INTERSPEECH.
[42] James Glass,et al. Research Developments and Directions in Speech Recognition and Understanding, Part 1 , 2009 .
[43] Reinhold Häb-Umbach,et al. An analytic derivation of a phase-sensitive observation model for noise robust speech recognition , 2009, INTERSPEECH.
[44] John R. Hershey,et al. Single-Channel Multitalker Speech Recognition , 2010, IEEE Signal Processing Magazine.
[45] Hugo Van hamme,et al. Effect of phase-sensitive environment model and higher order VTS on noisy speech feature enhancement [speech recognition applications] , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[46] Li Deng,et al. Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise , 2004, IEEE Transactions on Speech and Audio Processing.
[47] Richard M. Stern,et al. Reconstruction of missing features for robust speech recognition , 2004, Speech Commun..
[48] Phil D. Green,et al. Missing data techniques for robust speech recognition , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.