Voice Activity Detection Using an Adaptive Context Attention Model
暂无分享,去创建一个
[1] Jaeseok Kim,et al. Vowel based Voice Activity Detection with LSTM Recurrent Neural Network , 2016, ICSPS 2016.
[2] Shrikanth S. Narayanan,et al. Robust Voice Activity Detection Using Long-Term Signal Variability , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[3] Dongsuk Yook,et al. Robust Voice Activity Detection Using the Spectral Peaks of Vowel Sounds , 2009 .
[4] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.
[5] Yoshua Bengio,et al. Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..
[6] E. Shlomot,et al. ITU-T Recommendation G.729 Annex B: a silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications , 1997, IEEE Commun. Mag..
[7] José Augusto Stuchi,et al. Exploring Convolutional Neural Networks for Voice Activity Detection , 2017 .
[8] Tara N. Sainath,et al. Feature Learning with Raw-Waveform CLDNNs for Voice Activity Detection , 2016, INTERSPEECH.
[9] Carla Teixeira Lopes,et al. TIMIT Acoustic-Phonetic Continuous Speech Corpus , 2012 .
[10] George Saon,et al. Analyzing convolutional neural networks for speech activity detection in mismatched acoustic conditions , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Bayya Yegnanarayana,et al. Voiced/Nonvoiced Detection Based on Robustness of Voiced Epochs , 2010, IEEE Signal Processing Letters.
[12] Koray Kavukcuoglu,et al. Visual Attention , 2020, Computational Models for Cognitive Vision.
[13] Bayya Yegnanarayana,et al. Single Frequency Filtering Approach for Discriminating Speech and Nonspeech , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[14] Sridha Sridharan,et al. Noise robust voice activity detection using features extracted from the time-domain autocorrelation function , 2010, INTERSPEECH.
[15] John H. L. Hansen,et al. Unsupervised Speech Activity Detection Using Voicing Measures and Perceptual Spectral Flux , 2013, IEEE Signal Processing Letters.
[16] Abeer Alwan,et al. Voice activity detection using harmonic frequency components in likelihood ratio test , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.
[17] Rich Caruana,et al. Overfitting in Neural Nets: Backpropagation, Conjugate Gradient, and Early Stopping , 2000, NIPS.
[18] Herman J. M. Steeneken,et al. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..
[19] Yusuke Kida,et al. Voice Activity Detection: Merging Source and Filter-based Information , 2016, IEEE Signal Processing Letters.
[20] Trausti T. Kristjansson,et al. DySANA: dynamic speech and noise adaptation for voice activity detection , 2008, INTERSPEECH.
[21] Damjan Vlaj,et al. A Computationally Efficient Mel-Filter Bank VAD Algorithm for Distributed Speech Recognition Systems , 2005, EURASIP J. Adv. Signal Process..
[22] Walter Kellermann,et al. Artificial Neural Network-Based Feature Combination for Spatial Voice Activity Detection , 2016, INTERSPEECH.
[23] Alex Graves,et al. Recurrent Models of Visual Attention , 2014, NIPS.
[24] Thad Hughes,et al. Recurrent neural networks for voice activity detection , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[25] Lina J. Karam,et al. Understanding how image quality affects deep neural networks , 2016, 2016 Eighth International Conference on Quality of Multimedia Experience (QoMEX).
[26] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..
[27] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[28] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[29] Yoshua Bengio,et al. Attention-Based Models for Speech Recognition , 2015, NIPS.
[30] Andrzej Drygajlo,et al. Entropy based voice activity detection in very noisy conditions , 2001, INTERSPEECH.
[31] Jianwu Dang,et al. Voice Activity Detection Based on an Unsupervised Learning Framework , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[32] DeLiang Wang,et al. Boosting Contextual Information for Deep Neural Network Based Voice Activity Detection , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[33] Hyeontaek Lim,et al. Formant-Based Robust Voice Activity Detection , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[34] Mark Liberman,et al. Speech activity detection on youtube using deep neural networks , 2013, INTERSPEECH.
[35] Wonyong Sung,et al. A statistical model-based voice activity detection , 1999, IEEE Signal Processing Letters.
[36] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[37] Masakiyo Fujimoto,et al. Noise robust voice activity detection based on periodic to aperiodic component ratio , 2010, Speech Commun..
[38] Pieter Abbeel,et al. Gradient Estimation Using Stochastic Computation Graphs , 2015, NIPS.
[39] Francesco Piazza,et al. Deep neural networks for Multi-Room Voice Activity Detection: Advancements and comparative evaluation , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).
[40] Björn W. Schuller,et al. Real-life voice activity detection with LSTM Recurrent Neural Networks and an application to Hollywood movies , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[41] Masakiyo Fujimoto,et al. Noise Robust Voice Activity Detection Based on Switching Kalman Filter , 2008, IEICE Trans. Inf. Syst..
[42] Werner Verhelst,et al. On Noise Robust Voice Activity Detection , 2011, INTERSPEECH.