Mapping Acoustic Vector Space and Document Vector Space by RNN-LSTM
暂无分享,去创建一个
[1] Gregory H. Wakefield,et al. Audio thumbnailing of popular music using chroma-based representations , 2005, IEEE Transactions on Multimedia.
[2] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[3] James R. Glass,et al. Look, listen, and decode: Multimodal speech recognition with images , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).
[4] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..
[5] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.