An Empirical Evaluation of Current Convolutional Architectures’ Ability to Manage Nuisance Location and Scale Variability
Stefano Soatto | Nikolaos Karianakis | Jingming Dong | Stefano Soatto | Nikolaos Karianakis | Jingming Dong
[1] Dumitru Erhan,et al. Scalable, High-Quality Object Detection , 2014, ArXiv.
[2] Thomas Brox,et al. Descriptor Matching with Convolutional Neural Networks: a Comparison to SIFT , 2014, ArXiv.
[3] C. Lawrence Zitnick,et al. Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.
[4] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[5] Qiang Chen,et al. Network In Network , 2013, ICLR.
[6] Cristian Sminchisescu,et al. CPMC: Automatic Object Segmentation Using Constrained Parametric Min-Cuts , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[7] Zhuowen Tu,et al. Deeply-Supervised Nets , 2014, AISTATS.
[8] Cordelia Schmid,et al. A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.
[9] Jonathan Balzer,et al. Multi-view feature engineering and learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[10] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[11] Andrew Zisserman,et al. Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.
[12] Jian Sun,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[13] Geoffrey E. Hinton,et al. Modeling the joint density of two images under a variety of transformations , 2011, CVPR 2011.
[14] Koen E. A. van de Sande,et al. Selective Search for Object Recognition , 2013, International Journal of Computer Vision.
[15] Marc'Aurelio Ranzato,et al. Learning Longer Memory in Recurrent Neural Networks , 2014, ICLR.
[16] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .
[17] Stefano Soatto,et al. Domain-size pooling in local descriptors: DSP-SIFT , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[18] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[19] Yoshua Bengio,et al. How transferable are features in deep neural networks? , 2014, NIPS.
[20] T. Poggio. The computational magic of the ventral stream , 2012 .
[21] Quoc V. Le,et al. Measuring Invariances in Deep Networks , 2009, NIPS.
[22] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.
[23] Bernt Schiele,et al. What Makes for Effective Detection Proposals? , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[24] Thomas Serre,et al. Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[25] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.
[26] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[27] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.
[28] Thomas Brox,et al. Unsupervised feature learning by augmenting single images , 2013, ICLR.
[29] Razvan Pascanu,et al. Advances in optimizing recurrent networks , 2012, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[30] James M. Rehg,et al. RIGOR: Reusing Inference in Graph Cuts for Generating Object Regions , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[31] Pedro M. Domingos,et al. Deep Symmetry Networks , 2014, NIPS.
[32] Santiago Manen,et al. Prime Object Proposals with Randomized Prim's Algorithm , 2013, 2013 IEEE International Conference on Computer Vision.
[33] Max Welling,et al. Learning the Irreducible Representations of Commutative Lie Groups , 2014, ICML.
[34] Jiri Matas,et al. Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..
[35] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.
[36] Thomas Deselaers,et al. Measuring the Objectness of Image Windows , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[37] Philip H. S. Torr,et al. BING: Binarized normed gradients for objectness estimation at 300fps , 2014, Computational Visual Media.
[38] Bernt Schiele,et al. How good are detection proposals, really? , 2014, BMVC.
[39] Lorenzo Rosasco,et al. On Invariance and Selectivity in Representation Learning , 2015, ArXiv.
[40] Andrea Vedaldi,et al. MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.
[41] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[42] Vladlen Koltun,et al. Geodesic Object Proposals , 2014, ECCV.
[43] Andrew Zisserman,et al. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.
[44] Xiang Zhang,et al. OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.
[45] Trevor Darrell,et al. Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[46] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[47] Jason Yosinski,et al. Deep neural networks are easily fooled: High confidence predictions for unrecognizable images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[48] Thomas M. Cover,et al. Elements of Information Theory , 2005 .
[49] Jiri Matas,et al. Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..
[50] Charles F. Hockett,et al. A mathematical theory of communication , 1948, MOCO.
[51] Stefano Soatto,et al. Visual Representations: Defining Properties and Deep Approximations , 2014, ICLR 2016.
[52] Sang Joon Kim,et al. A Mathematical Theory of Communication , 2006 .
[53] Wojciech Zaremba,et al. Recurrent Neural Network Regularization , 2014, ArXiv.
[54] Dumitru Erhan,et al. Scalable Object Detection Using Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[55] Matthew B. Blaschko,et al. Learning a category independent object detection cascade , 2011, 2011 International Conference on Computer Vision.
[56] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[57] Stéphane Mallat,et al. Invariant Scattering Convolution Networks , 2012, IEEE transactions on pattern analysis and machine intelligence.
[58] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.
[59] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[60] Ming Yang,et al. Regionlets for Generic Object Detection , 2013, ICCV.
[61] T. Tuytelaars,et al. Weakly Supervised Object Detection with Posterior Regularization , 2014 .
[62] Neil A. Dodgson,et al. Proceedings Ninth IEEE International Conference on Computer Vision , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.