The analysis of the simplification from the ideal ratio to binary mask in signal-to-noise ratio sense
暂无分享,去创建一个
Wei Jiang | Wei Xue | Wenju Liu | Shan Liang | Wei Jiang | Wei Xue | Wenju Liu | Shan Liang
[1] Michael Zibulevsky,et al. Underdetermined blind source separation using sparse representations , 2001, Signal Process..
[2] DeLiang Wang,et al. Speech segregation based on sound localization , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).
[3] Hiroshi Sawada,et al. Underdetermined Convolutive Blind Source Separation via Frequency Bin-Wise Clustering and Permutation Alignment , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[4] Daniel Patrick Whittlesey Ellis,et al. Prediction-driven computational auditory scene analysis , 1996 .
[5] Jon Barker,et al. Soft decisions in missing data techniques for robust automatic speech recognition , 2000, INTERSPEECH.
[6] DeLiang Wang,et al. On the optimality of ideal binary time-frequency masks , 2009, Speech Commun..
[7] DeLiang Wang,et al. Monaural speech segregation based on pitch tracking and amplitude modulation , 2002, IEEE Transactions on Neural Networks.
[8] Norbert Wiener,et al. Extrapolation, Interpolation, and Smoothing of Stationary Time Series , 1964 .
[9] Jon Barker,et al. An audio-visual corpus for speech perception and automatic speech recognition. , 2006, The Journal of the Acoustical Society of America.
[10] P. Loizou,et al. Factors influencing intelligibility of ideal binary-masked speech: implications for noise reduction. , 2008, The Journal of the Acoustical Society of America.
[11] DeLiang Wang,et al. A classification based approach to speech segregation. , 2012, The Journal of the Acoustical Society of America.
[12] Phil D. Green,et al. Robust automatic speech recognition with missing and unreliable acoustic data , 2001, Speech Commun..
[13] DeLiang Wang,et al. Binary and ratio time-frequency masks for robust speech recognition , 2006, Speech Commun..
[14] DeLiang Wang,et al. Speech segregation based on pitch tracking and amplitude modulation , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).
[15] Martin Cooke,et al. Modelling auditory processing and organisation , 1993, Distinguished dissertations in computer science.
[16] E. Lehmann,et al. Prediction of energy decay in room impulse responses simulated with an image-source model. , 2008, The Journal of the Acoustical Society of America.
[17] Richard M. Stern,et al. Reconstruction of missing features for robust speech recognition , 2004, Speech Commun..
[18] S. Mallat. A wavelet tour of signal processing , 1998 .
[19] Scott Rickard,et al. Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.
[20] Yang Lu,et al. An algorithm that improves speech intelligibility in noise for normal-hearing listeners. , 2009, The Journal of the Acoustical Society of America.
[21] Daniel P. W. Ellis,et al. Mid-level representations for Computational Auditory Scene Analysis , 1995, IJCAI 1995.
[22] Guy J. Brown,et al. Computational Auditory Scene Analysis: Principles, Algorithms, and Applications , 2006 .
[23] Richard M. Stern,et al. A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition , 2004, Speech Commun..
[24] Wei Jiang,et al. A New Bayesian Method Incorporating With Local Correlation for IBM Estimation , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[25] Mitchel Weintraub,et al. A theory and computational model of auditory monaural sound separation , 1985 .
[26] Guy J. Brown,et al. Computational auditory scene analysis , 1994, Comput. Speech Lang..
[27] Yi Hu,et al. Speech enhancement based on wavelet thresholding the multitaper spectrum , 2004, IEEE Transactions on Speech and Audio Processing.
[28] DeLiang Wang,et al. A Tandem Algorithm for Pitch Estimation and Voiced Speech Segregation , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[29] Guy J. Brown,et al. Fundamentals of Computational Auditory Scene Analysis , 2006 .
[30] Rainer Martin,et al. Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..
[31] Philipos C. Loizou,et al. Reasons why Current Speech-Enhancement Algorithms do not Improve Speech Intelligibility and Suggested Solutions , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[32] Philipos C. Loizou,et al. A noise-estimation algorithm for highly non-stationary environments , 2006, Speech Commun..
[33] Guy J. Brown,et al. Separation of speech from interfering sounds based on oscillatory correlation , 1999, IEEE Trans. Neural Networks.
[34] K. D. Kryter. Methods for the Calculation and Use of the Articulation Index , 1962 .
[35] Wei Jiang,et al. Integrating Binary Mask Estimation With MRF Priors of Cochleagram for Speech Separation , 2012, IEEE Signal Processing Letters.
[36] DeLiang Wang,et al. On the optimality of ideal binary time-frequency masks , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[37] Franz Pernkopf,et al. On linear and mixmax interaction models for single channel source separation , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[38] Jack Xin,et al. Reducing musical noise in blind source separation by time-domain sparse filters and split bregman method , 2010, INTERSPEECH.
[39] Guy J. Brown. Computational auditory scene analysis : a representational approach , 1993 .
[40] DeLiang Wang,et al. Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation. , 2006, The Journal of the Acoustical Society of America.
[41] Daniel P. W. Ellis,et al. Model-Based Scene Analysis , 2005 .