暂无分享,去创建一个
Nenghai Yu | Tie-Yan Liu | Qi Meng | Wei Chen | Shuxin Zheng | Huishuai Zhang | Tie-Yan Liu | Huishuai Zhang | Wei Chen | Nenghai Yu | Qi Meng | Shuxin Zheng
[1] Samy Bengio,et al. Understanding deep learning requires rethinking generalization , 2016, ICLR.
[2] Nathan Srebro,et al. Exploring Generalization in Deep Learning , 2017, NIPS.
[3] Heng-Tze Cheng,et al. Wide & Deep Learning for Recommender Systems , 2016, DLRS@RecSys.
[4] Tomaso A. Poggio,et al. Regularization Networks and Support Vector Machines , 2000, Adv. Comput. Math..
[5] Tat-Seng Chua,et al. Neural Collaborative Filtering , 2017, WWW.
[6] Peter L. Bartlett,et al. Rademacher and Gaussian Complexities: Risk Bounds and Structural Results , 2003, J. Mach. Learn. Res..
[7] Ryota Tomioka,et al. In Search of the Real Inductive Bias: On the Role of Implicit Regularization in Deep Learning , 2014, ICLR.
[8] Gintare Karolina Dziugaite,et al. Computing Nonvacuous Generalization Bounds for Deep (Stochastic) Neural Networks with Many More Parameters than Training Data , 2017, UAI.
[9] Nenghai Yu,et al. G-SGD: Optimizing ReLU Neural Networks in its Positively Scale-Invariant Space , 2018, ICLR.
[10] Sebastian Ruder,et al. An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.
[11] Xiangnan He,et al. A Generic Coordinate Descent Framework for Learning from Implicit Feedback , 2016, WWW.
[12] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[13] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[14] Yoshua Bengio,et al. Deep Sparse Rectifier Neural Networks , 2011, AISTATS.
[15] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .
[16] Mark Hoogendoorn,et al. Mathematical Foundations for Supervised Learning , 2018 .
[17] Matus Telgarsky,et al. Spectrally-normalized margin bounds for neural networks , 2017, NIPS.
[18] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[19] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[20] Ryota Tomioka,et al. Norm-Based Capacity Control in Neural Networks , 2015, COLT.
[21] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[22] Leslie Pack Kaelbling,et al. Generalization in Deep Learning , 2017, ArXiv.
[23] Ruslan Salakhutdinov,et al. Path-Normalized Optimization of Recurrent Neural Networks with ReLU Activations , 2016, NIPS.
[24] Nitish Srivastava,et al. Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.
[25] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[26] Geoffrey Zweig,et al. Achieving Human Parity in Conversational Speech Recognition , 2016, ArXiv.
[27] Yann Dauphin,et al. Convolutional Sequence to Sequence Learning , 2017, ICML.
[28] Ruslan Salakhutdinov,et al. Path-SGD: Path-Normalized Optimization in Deep Neural Networks , 2015, NIPS.
[29] Qi Meng,et al. Optimization of ReLU Neural Networks using Quotient Stochastic Gradient Descent , 2018 .
[30] Tao Chen,et al. TriRank: Review-aware Explainable Recommendation by Modeling Aspects , 2015, CIKM.
[31] Quoc V. Le,et al. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).