Learning Distributional Token Representations from Visual Features
暂无分享,去创建一个
[1] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.
[2] Wei Chen,et al. Sogou Neural Machine Translation Systems for WMT17 , 2017, WMT.
[3] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[4] Alexander M. Rush,et al. Character-Aware Neural Language Models , 2015, AAAI.
[5] Alexander M. Rush,et al. Image-to-Markup Generation with Coarse-to-Fine Attention , 2016, ICML.
[6] Fei Xia,et al. The Penn Chinese TreeBank: Phrase structure annotation of a large corpus , 2005, Natural Language Engineering.
[7] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[8] Desmond Elliott,et al. Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description , 2017, WMT.
[9] Marta R. Costa-jussà,et al. Chinese–Spanish neural machine translation enhanced with character and word bitmap fonts , 2017, Machine Translation.
[10] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..
[11] Andrew McCallum,et al. Fast and Accurate Entity Recognition with Iterated Dilated Convolutions , 2017, EMNLP.
[12] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.
[13] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[14] Jason Lee,et al. Fully Character-Level Neural Machine Translation without Explicit Segmentation , 2016, TACL.
[15] Hinrich Schütze,et al. Nonsymbolic Text Representation , 2016, EACL.
[16] Hung-yi Lee,et al. Learning Chinese Word Representations From Glyphs Of Characters , 2017, EMNLP.
[17] Qun Liu,et al. A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging , 2008, ACL.
[18] Frederick Liu,et al. Learning Character-level Compositionality with Visual Features , 2017, ACL.
[19] Jörg Tiedemann,et al. Character-based Joint Segmentation and POS Tagging for Chinese using Bidirectional RNN-CRF , 2017, IJCNLP.
[20] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[21] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[22] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .