AM-FM DECOMPOSITION OF SPEECH SIGNAL: APPLICATIONS FOR SPEECH PRIVACY AND DIAGNOSIS
暂无分享,去创建一个
[1] Sanjeev Khudanpur,et al. Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Jan Cernocký,et al. Improved feature processing for deep neural networks , 2013, INTERSPEECH.
[3] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[4] Petr Motlícek,et al. Autoregressive Models of Amplitude Modulations in Audio Compression , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[5] Daniel P. W. Ellis,et al. Frequency-domain linear prediction for temporal features , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).
[6] James David Johnston,et al. Enhancing the Performance of Perceptual Audio Coders by Using Temporal Noise Shaping (TNS) , 1996 .
[7] Douglas A. Reynolds,et al. Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..
[8] Paavo Alku,et al. Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering , 1991, Speech Commun..
[9] J. Makhoul,et al. Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.
[10] 千葉 勉,et al. The vowel : its nature and structure , 1941 .
[11] H. Dudley. The carrier nature of speech , 1940 .