Continuous robust sound event classification using time-frequency features and deep learning
暂无分享,去创建一个
[1] Satoshi Nakamura,et al. Data collection in real acoustical environments for sound scene understanding and hands-free speech recognition , 1999, EUROSPEECH.
[2] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[3] Gerald Penn,et al. Applying Convolutional Neural Networks concepts to hybrid NN-HMM model for speech recognition , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] Yan Song,et al. Robust Sound Event Classification Using Deep Neural Networks , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[5] Tara N. Sainath,et al. Deep convolutional neural networks for LVCSR , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[6] Huy Phan,et al. Acoustic event detection and localization with regression forests , 2014, INTERSPEECH.
[7] Rasmus Berg Palm,et al. Prediction as a candidate for learning deep hierarchical models of data , 2012 .
[8] Haizhou Li,et al. Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions , 2011, IEEE Signal Processing Letters.
[9] Huy Phan,et al. Learning Representations for Nonspeech Audio Events Through Their Similarities to Speech Patterns , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[10] Thomas C. Walters. Auditory-based processing of communication sounds , 2011 .
[11] Trieu-Kien Truong,et al. Audio classification and categorization based on wavelets and support vector Machine , 2005, IEEE Transactions on Speech and Audio Processing.
[12] Yan Song,et al. Robust sound event recognition using convolutional neural networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[13] Samy Bengio,et al. Large-scale content-based audio retrieval from text queries , 2008, MIR '08.
[14] I-Fan Chen,et al. Phonetic subspace mixture model for speaker diarization , 2010, INTERSPEECH.
[15] Yoshua Bengio,et al. Convolutional networks for images, speech, and time series , 1998 .
[16] Chng Eng Siong,et al. Overlapping sound event recognition using local spectrogram features and the generalised hough transform , 2013, Pattern Recognit. Lett..
[17] Yan Song,et al. Improved i-Vector Representation for Speaker Diarization , 2016, Circuits Syst. Signal Process..
[18] Richard F. Lyon,et al. Machine Hearing: An Emerging Field [Exploratory DSP] , 2010, IEEE Signal Processing Magazine.
[19] Jake Bouvrie,et al. Notes on Convolutional Neural Networks , 2006 .
[20] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.
[21] Chng Eng Siong,et al. Image Feature Representation of the Subband Power Distribution for Robust Sound Event Classification , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[22] Guodong Guo,et al. Content-based audio classification and retrieval by support vector machines , 2003, IEEE Trans. Neural Networks.
[23] Annamaria Mesaros,et al. Sound Event Detection in Multisource Environments Using Source Separation , 2011 .
[24] Richard F. Lyon,et al. Machine Hearing: An Emerging Field , 2010 .
[25] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.
[26] Huy Phan,et al. Random Regression Forests for Acoustic Event Detection and Classification , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[27] Jonathan William Dennis,et al. Sound event recognition in unstructured environments using spectrogram image processing , 2014 .
[28] Li-Rong Dai,et al. Voice Conversion Using Deep Neural Networks With Layer-Wise Generative Training , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[29] Lie Lu,et al. A flexible framework for key audio effects detection and auditory context inference , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[30] Jitendra Ajmera,et al. A robust speaker clustering algorithm , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).