Discrimination of speech from nonspeech based on multiscale spectro-temporal Modulations
暂无分享,去创建一个
[1] Steven Greenberg,et al. Robust speech recognition using the modulation spectrogram , 1998, Speech Commun..
[2] B. Everitt,et al. Three-Mode Principal Component Analysis. , 1986 .
[3] Shihab Shamma,et al. Auditory Representations of Timbre and Pitch , 1996 .
[4] Kuo-Chang Huang,et al. 國語語音強健辨認之研究; Robust speech recognition in noisy environments , 2003 .
[5] Kuansan Wang,et al. Auditory representations of acoustic signals , 1992, IEEE Trans. Inf. Theory.
[6] Joos Vandewalle,et al. A Multilinear Singular Value Decomposition , 2000, SIAM J. Matrix Anal. Appl..
[8] Demetri Terzopoulos,et al. Multilinear Analysis of Image Ensembles: TensorFaces , 2002, ECCV.
[9] Thorsten Joachims,et al. Making large scale SVM learning practical , 1998 .
[10] L. Tucker,et al. Some mathematical notes on three-mode factor analysis , 1966, Psychometrika.
[11] Kuansan Wang,et al. Spectral shape analysis in the central auditory system , 1995, IEEE Trans. Speech Audio Process..
[12] Demetri Terzopoulos,et al. Multilinear subspace analysis of image ensembles , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..
[13] Lie Lu,et al. Content analysis for audio classification and segmentation , 2002, IEEE Trans. Speech Audio Process..
[14] B. Kollmeier,et al. Modeling auditory processing of amplitude modulation. II. Spectral and temporal integration. , 1997, The Journal of the Acoustical Society of America.
[15] Brian Kingsbury,et al. Robust speech recognition in Noisy Environments: The 2001 IBM spine evaluation system , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[16] Birger Kollmeier,et al. Combining speech enhancement and auditory feature extraction for robust speech recognition , 2000, Speech Commun..
[17] Nima Mesgarani,et al. Speech enhancement based on filtering the spectrotemporal modulations , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[18] Mounya Elhilali,et al. A spectro-temporal modulation index (STMI) for assessment of speech intelligibility , 2003, Speech Commun..
[19] S. Shamma,et al. An account of monaural phase sensitivity. , 2002, The Journal of the Acoustical Society of America.
[20] Joos Vandewalle,et al. On the Best Rank-1 and Rank-(R1 , R2, ... , RN) Approximation of Higher-Order Tensors , 2000, SIAM J. Matrix Anal. Appl..
[21] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.
[22] John Saunders,et al. Real-time discrimination of broadcast speech/music , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[23] Torsten Daub. Modeling auditory processing of amplitude modulation II. Spectral and temporal integration , 1997 .
[24] W. J. Nowack. Methods in Neuronal Modeling , 1991, Neurology.
[25] Wolfgang Effelsberg,et al. Automatic audio content analysis , 1997, MULTIMEDIA '96.
[26] S A Shamma,et al. Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex. , 2001, Journal of neurophysiology.
[27] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.
[28] B. De Moor,et al. Dimensionality reduction in higher-order-only ICA , 1997, Proceedings of the IEEE Signal Processing Workshop on Higher-Order Statistics.
[29] Stephanie Seneff,et al. Transcription and Alignment of the TIMIT Database , 1996 .
[30] Li Deng,et al. Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition , 2003, IEEE Trans. Speech Audio Process..
[31] S. Shamma,et al. Analysis of dynamic spectra in ferret primary auditory cortex. II. Prediction of unit responses to arbitrary dynamic spectra. , 1996, Journal of neurophysiology.
[32] David Gelbart,et al. Improving word accuracy with Gabor feature extraction , 2002, INTERSPEECH.
[33] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.
[34] S. Shamma,et al. Analysis of dynamic spectra in ferret primary auditory cortex. I. Characteristics of single-unit responses to moving ripple spectra. , 1996, Journal of neurophysiology.
[35] Douglas Keislar,et al. Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..
[36] Aaas News,et al. Book Reviews , 1893, Buffalo Medical and Surgical Journal.
[37] John C. Platt,et al. Distortion discriminant analysis for audio fingerprinting , 2003, IEEE Trans. Speech Audio Process..
[38] Masataka Goto,et al. RWC Music Database: Music genre database and musical instrument sound database , 2003, ISMIR.
[39] Malcolm Slaney,et al. Construction and evaluation of a robust multifeature speech/music discriminator , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[40] Jonathan Foote,et al. Content-based retrieval of music and audio , 1997, Other Conferences.
[41] Richard Bellman,et al. Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.
[42] Les E. Atlas,et al. EURASIP Journal on Applied Signal Processing 2003:7, 668–675 c ○ 2003 Hindawi Publishing Corporation Joint Acoustic and Modulation Frequency , 2003 .