暂无分享,去创建一个
Peter M C Harrison | Peter M. C. Harrison | Dominik Schiller | Nori Jacoby | Pol van Rijn | Pauline Larrouy-Maestri | Silvan Mertes | Elisabeth Andr'e | Dominik Schiller | Nori Jacoby | P. Larrouy-Maestri | Silvan Mertes | Elisabeth Andr'e | P. V. Rijn
[1] Navdeep Jaitly,et al. Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Aleix M. Martinez,et al. Emotional Expressions Reconsidered: Challenges to Inferring Emotion From Human Facial Movements , 2019, Psychological science in the public interest : a journal of the American Psychological Society.
[3] Paul Boersma,et al. Praat: doing phonetics by computer , 2003 .
[4] Peter M C Harrison,et al. Gibbs Sampling with People , 2020, NeurIPS.
[5] Björn W. Schuller,et al. The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing , 2016, IEEE Transactions on Affective Computing.
[6] Bryan Catanzaro,et al. Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis , 2021, ICLR.
[7] C. B. Colby. The weirdest people in the world , 1973 .
[8] Michael C. Mangini,et al. Making the ineffable explicit: estimating the information employed for face classifications , 2004, Cogn. Sci..
[9] Bart de Boer,et al. Introducing Parselmouth: A Python interface to Praat , 2018, J. Phonetics.
[10] Björn W. Schuller,et al. Recent developments in openSMILE, the munich open-source multimedia feature extractor , 2013, ACM Multimedia.
[11] Josh H McDermott,et al. Headphone screening to facilitate web-based auditory experiments , 2017, Attention, Perception, & Psychophysics.
[12] Yuxuan Wang,et al. Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron , 2018, ICML.
[13] Samy Bengio,et al. Tacotron: Towards End-to-End Speech Synthesis , 2017, INTERSPEECH.
[14] Thomas L. Griffiths,et al. Markov Chain Monte Carlo with People , 2007, NIPS.
[15] IEEE Recommended Practice for Speech Quality Measurements , 1969, IEEE Transactions on Audio and Electroacoustics.
[16] Ryan Prenger,et al. Mellotron: Multispeaker Expressive Voice Synthesis by Conditioning on Rhythm, Pitch and Global Style Tokens , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[17] Andrey Anikin,et al. Perceptual and acoustic differences between authentic and acted nonverbal emotional vocalizations , 2017, Quarterly journal of experimental psychology.
[18] Yuxuan Wang,et al. Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis , 2018, ICML.