Real Time object detection and multilingual speech synthesis
暂无分享,去创建一个
[1] Srikanth Ronanki,et al. Median-based generation of synthetic speech durations using a non-parametric approach , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).
[2] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.
[3] Julia Hirschberg,et al. Progress in speech synthesis , 1997 .
[4] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.
[5] Marc'Aurelio Ranzato,et al. Large Scale Distributed Deep Networks , 2012, NIPS.
[6] Andrew Zisserman,et al. Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.
[7] Alex Krizhevsky,et al. One weird trick for parallelizing convolutional neural networks , 2014, ArXiv.
[8] Ruslan Salakhutdinov,et al. Multimodal Neural Language Models , 2014, ICML.
[9] Xiang Zhang,et al. OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.
[10] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.