Neuron Campaign for Initialization Guided by Information Bottleneck Theory
暂无分享,去创建一个
Dongmei Zhang | Shi Han | Qiang Fu | Lun Du | Haitao Mao | Xu Chen | Dongmei Zhang | Qiang Fu | Lun Du | Shi Han | Xu Chen | Haitao Mao
[1] Yoshua Bengio,et al. Mutual Information Neural Estimation , 2018, ICML.
[2] Kaiming He,et al. Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[3] Sebastian Ruder,et al. An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.
[4] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.
[5] Jiri Matas,et al. All you need is a good init , 2015, ICLR.
[6] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[7] Wei Lin,et al. Tag2Vec: Learning Tag Representations in Tag Networks , 2019, WWW.
[8] Dong Yu,et al. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[9] Arnaud Doucet,et al. On the Impact of the Activation Function on Deep Neural Networks Training , 2019, ICML.
[10] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[11] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[12] Jure Leskovec,et al. Graph Information Bottleneck , 2020, NeurIPS.
[13] Sergey Levine,et al. Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow , 2018, ICLR.
[14] Paris Smaragdis,et al. Deep learning for monaural speech separation , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[15] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[16] Luciano Floridi,et al. GPT-3: Its Nature, Scope, Limits, and Consequences , 2020, Minds and Machines.
[17] Shuwen Yang,et al. Domain Adaptive Classification on Heterogeneous Information Networks , 2020, IJCAI.
[18] Yoshua Bengio,et al. Learning deep representations by mutual information estimation and maximization , 2018, ICLR.
[19] Dongmei Zhang,et al. TabularNet: A Neural Network Architecture for Understanding Semantic Structures of Tabular Data , 2021, KDD.
[20] Yun Wang,et al. Tag2Gauss: Learning Tag Representations via Gaussian Distribution in Tagged Networks , 2019, IJCAI.
[21] Surya Ganguli,et al. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks , 2013, ICLR.
[22] Yann LeCun,et al. The mnist database of handwritten digits , 2005 .
[23] Kaigui Bian,et al. TSSRGCN: Temporal Spectral Spatial Retrieval Graph Convolutional Network for Traffic Flow Forecasting , 2020, 2020 IEEE International Conference on Data Mining (ICDM).
[24] Naftali Tishby,et al. Opening the Black Box of Deep Neural Networks via Information , 2017, ArXiv.
[25] Alexander A. Alemi,et al. Deep Variational Information Bottleneck , 2017, ICLR.