暂无分享,去创建一个
[1] Geoffrey E. Hinton,et al. Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer , 2017, ICLR.
[2] Andrew McCallum,et al. Structured Prediction Energy Networks , 2015, ICML.
[3] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[4] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[5] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..
[6] Christopher D. Manning,et al. Better Word Representations with Recursive Neural Networks for Morphology , 2013, CoNLL.
[7] Yee Whye Teh,et al. The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables , 2016, ICLR.
[8] Yann LeCun,et al. Energy-based Generative Adversarial Network , 2016, ICLR.
[9] Koray Kavukcuoglu,et al. Learning word embeddings efficiently with noise-contrastive estimation , 2013, NIPS.
[10] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.
[11] Lifu Tu,et al. Learning Approximate Inference Networks for Structured Prediction , 2018, ICLR.
[12] Aapo Hyvärinen,et al. Noise-Contrastive Estimation of Unnormalized Statistical Models, with Applications to Natural Image Statistics , 2012, J. Mach. Learn. Res..
[13] Guillaume Bouchard,et al. Complex Embeddings for Simple Link Prediction , 2016, ICML.
[14] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[15] Jason Weston,et al. Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.
[16] Yee Whye Teh,et al. A fast and simple algorithm for training neural probabilistic language models , 2012, ICML.
[17] Rong Pan,et al. Incorporating GAN for Negative Sampling in Knowledge Representation Learning , 2018, AAAI.
[18] Abhinav Gupta,et al. Training Region-Based Object Detectors with Online Hard Example Mining , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[19] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[20] Zhiyuan Liu,et al. Learning Entity and Relation Embeddings for Knowledge Graph Completion , 2015, AAAI.
[21] Sanja Fidler,et al. Order-Embeddings of Images and Language , 2015, ICLR.
[22] Jianfeng Gao,et al. Embedding Entities and Relations for Learning and Inference in Knowledge Bases , 2014, ICLR.
[23] Andrew M. Dai,et al. MaskGAN: Better Text Generation via Filling in the ______ , 2018, ICLR.
[24] David M. Blei,et al. Augment and Reduce: Stochastic Inference for Large Categorical Distributions , 2018, ICML.
[25] Moustapha Cissé,et al. Efficient softmax approximation for GPUs , 2016, ICML.
[26] Ben Taskar,et al. Learning structured prediction models: a large margin approach , 2005, ICML.
[27] Pasquale Minervini,et al. Convolutional 2D Knowledge Graph Embeddings , 2017, AAAI.
[28] Noah A. Smith,et al. Contrastive Estimation: Training Log-Linear Models on Unlabeled Data , 2005, ACL.
[29] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[30] Ben Poole,et al. Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.
[31] Ashish Vaswani,et al. Decoding with Large-Scale Neural Language Models Improves Translation , 2013, EMNLP.
[32] Chris Dyer,et al. Notes on Noise Contrastive Estimation and Negative Sampling , 2014, ArXiv.
[33] Bo Dai,et al. Contrastive Learning for Image Captioning , 2017, NIPS.
[34] Jascha Sohl-Dickstein,et al. REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models , 2017, NIPS.
[35] David Duvenaud,et al. Backpropagation through the Void: Optimizing control variates for black-box gradient estimation , 2017, ICLR.
[36] Yann LeCun,et al. Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).
[37] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[38] Léon Bottou,et al. Wasserstein GAN , 2017, ArXiv.
[39] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.
[40] Yanshuai Cao,et al. Improving GAN Training via Binarized Representation Entropy (BRE) Regularization , 2018, ICLR.
[41] Thomas Hofmann,et al. Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..
[42] Ian J. Goodfellow,et al. On distinguishability criteria for estimating generative models , 2014, ICLR.
[43] Vaibhava Goel,et al. Self-Critical Sequence Training for Image Captioning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[44] Jason Weston,et al. A semantic matching energy function for learning with multi-relational data , 2013, Machine Learning.
[45] Ehud Rivlin,et al. Placing search in context: the concept revisited , 2002, TOIS.
[46] Zhen Wang,et al. Knowledge Graph Embedding by Translating on Hyperplanes , 2014, AAAI.
[47] William Yang Wang,et al. KBGAN: Adversarial Learning for Knowledge Graph Embeddings , 2017, NAACL.
[48] Hao Liu,et al. Action-dependent Control Variates for Policy Optimization via Stein Identity , 2018, ICLR.
[49] Jun Zhao,et al. Knowledge Graph Embedding via Dynamic Mapping Matrix , 2015, ACL.
[50] Aapo Hyvärinen,et al. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models , 2010, AISTATS.