论文信息 - Can Deep Learning Learn the Principle of Closed Contour Detection?

Can Deep Learning Learn the Principle of Closed Contour Detection?

Learning the principle of a task should always be the primary goal of a learning system. Otherwise it reduces to a memorizing system and there always exists edge cases. In spite of its recent success in visual recognition tasks, convolutional neural networks’ (CNNs) ability to learn principles is still questionable. While CNNs exhibit a certain degree of generalization, they eventually break when the variability exceeds their capacity, indicating a failure to learn the underlying principles. We use edge cases of a closed contour detection task to support our arguments. We argue that lateral interactions, which are not a part of pure feed-forward CNNs but common in biological vision, are essential to this task.

Xinhua Zhang | Garrett T. Kenyon | Yijing Watkins | Xinhua Zhang | Y. Watkins

[1] Yoshua Bengio,et al. Measuring the tendency of CNNs to Learn Surface Statistical Regularities , 2017, ArXiv.

[2] James H Elder,et al. Cue dynamics underlying rapid detection of animals in natural scenes. , 2009, Journal of vision.

[3] Ilya Nemenman,et al. Model Cortical Association Fields Account for the Time Course and Dependence on Target Complexity of Human Contour Perception , 2011, PLoS Comput. Biol..

[4] Benjamin Recht,et al. Do CIFAR-10 Classifiers Generalize to CIFAR-10? , 2018, ArXiv.

[5] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Lance R. Williams,et al. Stochastic Completion Fields: A Neural Model of Illusory Contour Shape and Salience , 1997, Neural Computation.

[7] Honglak Lee,et al. Object Contour Detection with a Fully Convolutional Encoder-Decoder Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Lance R. Williams,et al. A Comparison of Measures for Detecting Natural Shapes in Cluttered Backgrounds , 1998, International Journal of Computer Vision.

[11] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[12] Seunghoon Hong,et al. Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[13] David J. Field,et al. Contour integration by the human visual system: Evidence for a local “association field” , 1993, Vision Research.

[14] Thomas Serre,et al. Not-So-CLEVR: Visual Relations Strain Feedforward Neural Networks , 2018, ICLR 2018.