论文信息 - Symmetry constrained machine learning

Symmetry constrained machine learning

Symmetry, a central concept in understanding the laws of nature, has been used for centuries in physics, mathematics, and chemistry, to help make mathematical models tractable. Yet, despite its power, symmetry has not been used extensively in machine learning, until rather recently. In this article we show a general way to incorporate symmetries into machine learning models. We demonstrate this with a detailed analysis on a rather simple real world machine learning system - a neural network for classifying handwritten digits, lacking bias terms for every neuron. We demonstrate that ignoring symmetries can have dire over-fitting consequences, and that incorporating symmetry into the model reduces over-fitting, while at the same time reducing complexity, ultimately requiring less training data, and taking less time and resources to train.

Doron L. Bergman | D. Bergman

[1] Honglak Lee,et al. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.

[2] Yoshua Bengio,et al. Object Recognition with Gradient-Based Learning , 1999, Shape, Contour and Grouping in Computer Vision.

[3] Pedro M. Domingos,et al. Deep Symmetry Networks , 2014, NIPS.

[4] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[5] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[6] Naftali Tishby,et al. Opening the Black Box of Deep Neural Networks via Information , 2017, ArXiv.

[7] J. Goldstone,et al. Field theories with « Superconductor » solutions , 1961 .

[8] Jelle Veraart,et al. Rotationally-invariant mapping of scalar and orientational metrics of neuronal microstructure with diffusion MRI , 2018, NeuroImage.

[9] J. Cronin. Broken Symmetries , 2011 .

[10] Max Welling,et al. Group Equivariant Convolutional Networks , 2016, ICML.

[11] Szymon Rusinkiewicz,et al. Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors , 2003, Symposium on Geometry Processing.