Konrad P. Körding,et al. Extracting Slow Subspaces from Natural Videos Leads to Complex Cells , 2001, ICANN.
 Max Welling,et al. Transformation Properties of Learned Visual Representations , 2014, ICLR.
 Marc'Aurelio Ranzato,et al. Video (language) modeling: a baseline for generative models of natural videos , 2014, ArXiv.
 Terrence J. Sejnowski,et al. Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.
 Jonathan Tompson,et al. Unsupervised Learning of Spatiotemporally Coherent Metrics , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).
 Ivan Laptev,et al. Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
 Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
 Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
 Bruno A. Olshausen,et al. Learning Intermediate-Level Representations of Form and Motion from Natural Movies , 2012, Neural Computation.
 Marc'Aurelio Ranzato,et al. Unsupervised Learning of Invariant Feature Hierarchies with Applications to Object Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.
 Antonio Torralba,et al. Anticipating the future by watching unlabeled video , 2015, ArXiv.