Projection based weight normalization: Efficient method for optimization on oblique manifold in DNNs
暂无分享,去创建一个
Ling Shao | Fan Zhu | Li Liu | Xianglong Liu | Jie Qin | Lei Huang | Li Liu | Jie Qin | Lei Huang | Xianglong Liu | Fan Zhu | Ling Shao
[1] Nikos Komodakis,et al. Wide Residual Networks , 2016, BMVC.
[2] Lei Huang,et al. Centered Weight Normalization in Accelerating Training of Deep Neural Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[3] Zhang Yi,et al. Connections Between Nuclear-Norm and Frobenius-Norm-Based Representations , 2015, IEEE Transactions on Neural Networks and Learning Systems.
[4] Tim Salimans,et al. Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks , 2016, NIPS.
[5] Sergey Ioffe,et al. Batch Renormalization: Towards Reducing Minibatch Dependence in Batch-Normalized Models , 2017, NIPS.
[6] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.
[7] Lu Bai,et al. Spectral bounding: Strictly satisfying the 1-Lipschitz property for generative adversarial networks , 2020, Pattern Recognit..
[8] Robert E. Mahony,et al. Optimization Algorithms on Matrix Manifolds , 2007 .
[9] Adi Shraibman,et al. Rank, Trace-Norm and Max-Norm , 2005, COLT.
[10] Tapani Raiko,et al. Semi-supervised Learning with Ladder Networks , 2015, NIPS.
[11] Razvan Pascanu,et al. Natural Neural Networks , 2015, NIPS.
[12] Ruslan Salakhutdinov,et al. Path-SGD: Path-Normalized Optimization in Deep Neural Networks , 2015, NIPS.
[13] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.
[14] Heng Tao Shen,et al. Hierarchical LSTMs with Adaptive Attention for Visual Captioning , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[15] Anders Krogh,et al. A Simple Weight Decay Can Improve Generalization , 1991, NIPS.
[16] Yoshua Bengio,et al. Maxout Networks , 2013, ICML.
[17] Jun Zhu,et al. Triple Generative Adversarial Nets , 2017, NIPS.
[18] Minhyung Cho,et al. Riemannian approach to batch normalization , 2017, NIPS.
[19] Xianglong Liu,et al. Orthogonal Weight Normalization: Solution to Optimization over Multiple Dependent Stiefel Manifolds in Deep Neural Networks , 2017, AAAI.
[20] Ruslan Salakhutdinov,et al. Data-Dependent Path Normalization in Neural Networks , 2015, ICLR.
[21] Dacheng Tao,et al. Improving Training of Deep Neural Networks via Singular Value Bounding , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[22] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[23] Lei Huang,et al. Decorrelated Batch Normalization , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[24] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[25] Robert Hecht-Nielsen,et al. On the Geometry of Feedforward Neural Network Error Surfaces , 1993, Neural Computation.
[26] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[27] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[28] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[29] Ruslan Salakhutdinov,et al. Scaling up Natural Gradient by Sparsely Factorizing the Inverse Fisher Matrix , 2015, ICML.
[30] Matus Telgarsky,et al. Spectrally-normalized margin bounds for neural networks , 2017, NIPS.
[31] Xianglong Liu,et al. Spatio-temporal deformable 3D ConvNets with attention for action recognition , 2020, Pattern Recognit..
[32] P. Absil,et al. Riemannian Geometry of Grassmann Manifolds with a View on Algorithmic Computation , 2004 .
[33] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[34] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.
[35] Shiliang Pu,et al. All You Need is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks with Orthonormality and Modulation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[36] Xuelong Li,et al. From Deterministic to Generative: Multimodal Stochastic RNNs for Video Captioning , 2017, IEEE Transactions on Neural Networks and Learning Systems.
[37] Nicu Sebe,et al. Spatio-Temporal Attention Networks for Action Recognition and Detection , 2020, IEEE Transactions on Multimedia.
[38] Moustapha Cissé,et al. Parseval Networks: Improving Robustness to Adversarial Examples , 2017, ICML.
[39] Andrew Y. Ng,et al. Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .
[40] Ping Luo,et al. Learning Deep Architectures via Generalized Whitened Neural Networks , 2017, ICML.
[41] Chao Li,et al. Active multi-kernel domain adaptation for hyperspectral image classification , 2017, Pattern Recognit..
[42] Renjie Liao,et al. Normalizing the Normalizers: Comparing and Extending Network Normalization Schemes , 2016, ICLR.
[43] Yoshua Bengio,et al. Deconstructing the Ladder Network Architecture , 2015, ICML.
[44] Razvan Pascanu,et al. On the difficulty of training recurrent neural networks , 2012, ICML.
[45] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[46] Jost Tobias Springenberg,et al. Unsupervised and Semi-supervised Learning with Categorical Generative Adversarial Networks , 2015, ICLR.
[47] Xianglong Liu,et al. Learning binary code for fast nearest subspace search , 2020, Pattern Recognit..
[48] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[49] Zhenyu Huang,et al. Multiple Marginal Fisher Analysis , 2019, IEEE Transactions on Industrial Electronics.
[50] Zhuowen Tu,et al. Deeply-Supervised Nets , 2014, AISTATS.
[51] Ole Winther,et al. Auxiliary Deep Generative Models , 2016, ICML.
[52] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.
[53] Pierre-Antoine Absil,et al. Joint Diagonalization on the Oblique Manifold for Independent Component Analysis , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.
[54] Kilian Q. Weinberger,et al. Deep Networks with Stochastic Depth , 2016, ECCV.