Predicting What You Already Know Helps: Provable Self-Supervised Learning
暂无分享,去创建一个
[1] A. Buja. Remarks on Functional Canonical Variates, Alternating Least Squares Methods and Ace , 1990 .
[2] Peter L. Bartlett,et al. Rademacher and Gaussian Complexities: Risk Bounds and Structural Results , 2003, J. Mach. Learn. Res..
[3] Sanjeev Arora,et al. A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks , 2020, ICLR.
[4] S. Kakade,et al. Few-Shot Learning via Learning the Representation, Provably , 2020, ICLR.
[5] Andrew R. Barron,et al. Universal approximation bounds for superpositions of a sigmoidal function , 1993, IEEE Trans. Inf. Theory.
[6] Xinlei Chen,et al. Exploring Simple Siamese Representation Learning , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[7] James Cheng,et al. Self-Enhanced GNN: Improving Graph Neural Networks Using Model Outputs , 2020, 2021 International Joint Conference on Neural Networks (IJCNN).
[8] Sham M. Kakade,et al. Multi-view Regression Via Canonical Correlation Analysis , 2007, COLT.
[9] Akshay Krishnamurthy,et al. Contrastive learning, multi-view redundancy, and linear models , 2020, ALT.
[10] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.
[11] Tzee-Ming Huang. Testing conditional independence using maximal nonlinear conditional correlation , 2010, 1010.3843.
[12] Sanjeev Arora,et al. A Practical Algorithm for Topic Modeling with Provable Guarantees , 2012, ICML.
[13] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Paolo Favaro,et al. Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles , 2016, ECCV.
[15] Yoshua Bengio,et al. What regularized auto-encoders learn from the data-generating distribution , 2012, J. Mach. Learn. Res..
[16] Geoffrey E. Hinton,et al. A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.
[17] Pierre H. Richemond,et al. Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning , 2020, NeurIPS.
[18] Bernhard Schölkopf,et al. Measuring Statistical Dependence with Hilbert-Schmidt Norms , 2005, ALT.
[19] Geoffrey E. Hinton,et al. Big Self-Supervised Models are Strong Semi-Supervised Learners , 2020, NeurIPS.
[20] Mikhail Khodak,et al. A Theoretical Analysis of Contrastive Unsupervised Representation Learning , 2019, ICML.
[21] Michael Tschannen,et al. On Mutual Information Maximization for Representation Learning , 2019, ICLR.
[22] Akshay Krishnamurthy,et al. Contrastive estimation reveals topic posterior information to linear models , 2020, J. Mach. Learn. Res..
[23] Yingli Tian,et al. Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[24] Yuesheng Xu,et al. Universal Kernels , 2006, J. Mach. Learn. Res..
[25] Sanjeev Arora,et al. Learning Topic Models -- Going beyond SVD , 2012, 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science.
[26] Santosh S. Vempala,et al. Latent semantic indexing: a probabilistic analysis , 1998, PODS '98.
[27] Charles Blundell,et al. Representation Learning via Invariant Causal Mechanisms , 2020, ICLR.
[28] Xinlei Chen,et al. Understanding Self-supervised Learning with Dual Deep Networks , 2020, ArXiv.
[29] Michael I. Jordan,et al. Kernel dimension reduction in regression , 2009, 0908.1854.
[30] Michael I. Jordan,et al. Dimensionality Reduction for Supervised Learning with Reproducing Kernel Hilbert Spaces , 2004, J. Mach. Learn. Res..
[31] C. Baker. Joint measures and cross-covariance operators , 1973 .
[32] Tong Zhang,et al. Two-view feature generation model for semi-supervised learning , 2007, ICML '07.
[33] Alexei A. Efros,et al. Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[34] Christopher Potts,et al. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.
[35] Thomas Hofmann,et al. Probabilistic Latent Semantic Indexing , 1999, SIGIR Forum.
[36] Nitish Srivastava. Unsupervised Learning of Visual Representations using Videos , 2015 .
[37] Martial Hebert,et al. Shuffle and Learn: Unsupervised Learning Using Temporal Order Verification , 2016, ECCV.
[38] Efstratios Gavves,et al. Self-Supervised Video Representation Learning with Odd-One-Out Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[39] Alexei A. Efros,et al. Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[40] J. Friedman,et al. Estimating Optimal Transformations for Multiple Regression and Correlation. , 1985 .
[41] Xiaowei Zhou,et al. Path-Invariant Map Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[42] Alexei A. Efros,et al. Colorful Image Colorization , 2016, ECCV.
[43] David Gross,et al. Recovering Low-Rank Matrices From Few Coefficients in Any Basis , 2009, IEEE Transactions on Information Theory.
[44] Yoshua Bengio,et al. Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.
[45] Pascal Vincent,et al. A Connection Between Score Matching and Denoising Autoencoders , 2011, Neural Computation.
[46] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[47] Understanding Contrastive Representation Learning through Alignment and Uniformity on the Hypersphere , 2020, ICML.
[48] Thomas Brox,et al. Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[49] Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss , 2021, ArXiv.
[50] Yao-Hung Hubert Tsai,et al. Demystifying Self-Supervised Learning: An Information-Theoretical Framework , 2020, ArXiv.
[51] Yu Cheng,et al. Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[52] Phillip Isola,et al. Contrastive Multiview Coding , 2019, ECCV.
[53] Andrew Zisserman,et al. Learning and Using the Arrow of Time , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[54] John Shawe-Taylor,et al. Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.
[55] Alexander Kolesnikov,et al. Revisiting Self-Supervised Visual Representation Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[56] Nikos Komodakis,et al. Unsupervised Representation Learning by Predicting Image Rotations , 2018, ICLR.
[57] Zhuang Ma,et al. Noise Contrastive Estimation and Negative Sampling for Conditional Models: Consistency and Statistical Efficiency , 2018, EMNLP.
[58] Oriol Vinyals,et al. Representation Learning with Contrastive Predictive Coding , 2018, ArXiv.
[59] J. Leskovec,et al. Strategies for Pre-training Graph Neural Networks , 2019, ICLR.
[60] Ludwig Schmidt,et al. Unlabeled Data Improves Adversarial Robustness , 2019, NeurIPS.
[61] Honglak Lee,et al. An efficient framework for learning sentence representations , 2018, ICLR.
[62] Alec Radford,et al. Improving Language Understanding by Generative Pre-Training , 2018 .
[63] Shai Ben-David,et al. Understanding Machine Learning: From Theory to Algorithms , 2014 .
[64] Sham M. Kakade,et al. Random Design Analysis of Ridge Regression , 2012, COLT.
[65] M. Rudelson,et al. Hanson-Wright inequality and sub-gaussian concentration , 2013 .
[66] Dawn Song,et al. Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty , 2019, NeurIPS.
[67] Christopher D. Manning,et al. Baselines and Bigrams: Simple, Good Sentiment and Topic Classification , 2012, ACL.
[68] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[69] Alexei A. Efros,et al. A Century of Portraits: A Visual Historical Record of American High School Yearbooks , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).
[70] Aapo Hyvärinen,et al. Noise-contrastive estimation: A new estimation principle for unnormalized statistical models , 2010, AISTATS.
[71] Sergey Levine,et al. Grasp2Vec: Learning Object Representations from Self-Supervised Grasping , 2018, CoRL.
[72] Yoshua Bengio,et al. Learning deep representations by mutual information estimation and maximization , 2018, ICLR.
[73] Shao-Lun Huang,et al. An efficient algorithm for information decomposition and extraction , 2015, 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton).
[74] Alexei A. Efros,et al. Unsupervised Visual Representation Learning by Context Prediction , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[75] Kristian Kirsch,et al. Methods Of Modern Mathematical Physics , 2016 .