Time and frequency domain long short-term memory for noise robust pitch tracking
暂无分享,去创建一个
[1] D. Wang,et al. Computational Auditory Scene Analysis: Principles, Algorithms, and Applications , 2008, IEEE Trans. Neural Networks.
[2] Yoshua Bengio,et al. Deep Sparse Rectifier Neural Networks , 2011, AISTATS.
[3] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[4] Wojciech Zaremba,et al. Recurrent Neural Network Regularization , 2014, ArXiv.
[5] Hideki Kawahara,et al. YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.
[6] Geoffrey Zweig,et al. LSTM time and frequency recurrence for automatic speech recognition , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[7] Jr. G. Forney,et al. Viterbi Algorithm , 1973, Encyclopedia of Machine Learning.
[8] Guy J. Brown,et al. A multi-pitch tracking algorithm for noisy speech , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[9] Michael Picheny,et al. New methods in continuous Mandarin speech recognition , 1997, EUROSPEECH.
[10] Daniel P. W. Ellis,et al. Noise Robust Pitch Tracking by Subband Autocorrelation Classification , 2012, INTERSPEECH.
[11] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.
[12] Paul Boersma,et al. Praat, a system for doing phonetics by computer , 2002 .
[13] DeLiang Wang,et al. Robust pitch tracking in noisy speech using speaker-dependent deep neural networks , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[14] Nitish Srivastava,et al. Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.
[15] Guoning Hu,et al. Monaural speech organization and segregation , 2006 .
[16] John H. L. Hansen,et al. F0 estimation for noisy speech by exploring temporal harmonic structures in local time frequency spectrum segment , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[17] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.
[18] Mike Brookes,et al. PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[19] References , 1971 .
[20] Shashidhar G. Koolagudi,et al. Emotion recognition from speech: a review , 2012, International Journal of Speech Technology.
[21] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[22] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[23] Alan A Wrench,et al. A MULTI-CHANNEL/MULTI-SPEAKER ARTICULATORY DATABASE FOR CONTINUOUS SPEECH RECOGNITION RESEARCH , 2000 .
[24] Herman J. M. Steeneken,et al. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..
[25] David Talkin,et al. A Robust Algorithm for Pitch Tracking ( RAPT ) , 2005 .
[26] DeLiang Wang,et al. Neural Network Based Pitch Tracking in Very Noisy Speech , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.