ManifoldNet: A Deep Neural Network for Manifold-Valued Data With Applications

Geometric deep learning is a relatively nascent field that has attracted significant attention in the recent past. This is partly due to the ready availability of manifold-valued data. In this paper we present a novel theoretical framework for developing deep neural networks to cope with manifold-valued fields as inputs. We also present a novel architecture to realize this theory and call it a ManifoldNet. Analogous to convolutions in vector spaces which are equivalent to computing weighted sums, manifold-valued data 'convolutions' can be defined using the weighted Frechet Mean (wFM), an intrinsic operation. The hidden layers of ManifoldNet compute wFM of their inputs, where the weights are to be learnt. Since wFM is an intrinsic operation, the processed data remain on the manifold as they propagate through the hidden layers. To reduce the computational burden, we present a provably convergent recursive algorithm for wFM computation. Additionaly, for non-constant curvature manifolds, we prove that each wFM layer is non-collapsible and a contraction mapping. We also prove that the wFM is equivariant to the symmetry group action admitted by the manifold on which the data reside. To showcase the performance of ManifoldNet, we present several experiments from vision and medical imaging.

[1]  Daniel C Alexander,et al.  Multiple‐Fiber Reconstruction Algorithms for Diffusion MRI , 2005, Annals of the New York Academy of Sciences.

[2]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[3]  Koray Kavukcuoglu,et al.  Pixel Recurrent Neural Networks , 2016, ICML.

[4]  Prafulla Dhariwal,et al.  Glow: Generative Flow with Invertible 1x1 Convolutions , 2018, NeurIPS.

[5]  Alex Graves,et al.  Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.

[6]  I. Chavel Riemannian Geometry: Subject Index , 2006 .

[7]  Mark W. Woolrich,et al.  Advances in functional and structural MR image analysis and implementation as FSL , 2004, NeuroImage.

[8]  Rudrasis Chakraborty,et al.  H-CNNs: Convolutional Neural Networks for Riemannian Homogeneous Spaces , 2018, ArXiv.

[9]  M. Fréchet Les éléments aléatoires de nature quelconque dans un espace distancié , 1948 .

[10]  V. Wedeen,et al.  Diffusion MRI of Complex Neural Architecture , 2003, Neuron.

[11]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[12]  Stephan J. Garbin,et al.  Harmonic Networks: Deep Translation and Rotation Equivariance , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Vikas Singh,et al.  A Statistical Recurrent Model on the Manifold of Symmetric Positive Definite Matrices , 2018, NeurIPS.

[14]  Volker Tresp,et al.  Tensor-Train Recurrent Neural Networks for Video Classification , 2017, ICML.

[15]  Robert A. Russell Non-Euclidean Triangle Centers , 2016 .

[16]  Søren Hauberg,et al.  Intrinsic Grassmann Averages for Online Linear and Robust Subspace Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Anuj Srivastava,et al.  Riemannian Analysis of Probability Density Functions with Applications in Vision , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Yee Whye Teh,et al.  Learning to Parse Images , 1999, NIPS.

[19]  Nitish Srivastava,et al.  Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.

[20]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[21]  S. Helgason Differential Geometry, Lie Groups, and Symmetric Spaces , 1978 .

[22]  Pierre Vandergheynst,et al.  Geometric Deep Learning: Going beyond Euclidean data , 2016, IEEE Signal Process. Mag..

[23]  Rudrasis Chakraborty,et al.  A Nonlinear Regression Technique for Manifold Valued Data with Applications to Medical Image Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Barnabás Póczos,et al.  Equivariance Through Parameter-Sharing , 2017, ICML.

[25]  Karl-Theodor Sturm,et al.  Probability Measures on Metric Spaces of Nonpositive Curvature , 2003 .

[26]  Stéphane Mallat,et al.  Invariant Scattering Convolution Networks , 2012, IEEE transactions on pattern analysis and machine intelligence.

[27]  Maxime Descoteaux,et al.  Dipy, a library for the analysis of diffusion MRI data , 2014, Front. Neuroinform..

[28]  Geoffrey E. Hinton,et al.  Transforming Autoencoders , 2011 .

[29]  Søren Hauberg,et al.  Scalable Robust Principal Component Analysis Using Grassmann Averages , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  A. Stiggelbout,et al.  Systematic evaluation of rating scales for impairment and disability in Parkinson's disease , 2002, Movement disorders : official journal of the Movement Disorder Society.

[31]  W. Kendall Probability, Convexity, and Harmonic Maps with Small Image I: Uniqueness and Fine Existence , 1990 .

[32]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[33]  Derek B Archer,et al.  A Template and Probabilistic Atlas of the Human Sensorimotor Tracts using Diffusion MRI , 2018, Cerebral cortex.

[34]  Maher Moakher,et al.  A Differential Geometric Approach to the Geometric Mean of Symmetric Positive-Definite Matrices , 2005, SIAM J. Matrix Anal. Appl..

[35]  David Groisser Newton's method, zeroes of vector fields, and the Riemannian center of mass , 2004, Adv. Appl. Math..

[36]  Max Welling,et al.  Group Equivariant Convolutional Networks , 2016, ICML.

[37]  Stéphane Mallat,et al.  Understanding deep convolutional networks , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[38]  Geoffrey E. Hinton,et al.  Dynamic Routing Between Capsules , 2017, NIPS.

[39]  Matthieu Cord,et al.  Riemannian batch normalization for SPD neural networks , 2019, NeurIPS.

[40]  Preface A Panoramic View of Riemannian Geometry , 2003 .

[41]  Baba C. Vemuri,et al.  Non-rigid Registration of High Angular Resolution Diffusion Images Represented by Gaussian Mixture Fields , 2009, MICCAI.

[42]  Tong Zhang,et al.  Deep Manifold-to-Manifold Transforming Network , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[43]  Risi Kondor,et al.  On the Generalization of Equivariance and Convolution in Neural Networks to the Action of Compact Groups , 2018, ICML.

[44]  Mathieu Salzmann,et al.  Second-order Convolutional Neural Networks , 2017, ArXiv.

[45]  Rudrasis Chakraborty,et al.  A Deep Neural Network for Manifold-Valued Data with Applications to Neuroimaging , 2019, IPMI.

[46]  Yongdo Lim,et al.  Weighted inductive means , 2014 .

[47]  Rudrasis Chakraborty,et al.  Statistics on the Stiefel manifold: Theory and applications , 2019, The Annals of Statistics.

[48]  Pierre Vandergheynst,et al.  Geodesic Convolutional Neural Networks on Riemannian Manifolds , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[49]  Jonathan H. Manton,et al.  A globally convergent numerical algorithm for computing the centre of mass on compact Lie groups , 2004, ICARCV 2004 8th Control, Automation, Robotics and Vision Conference, 2004..

[50]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  B. Afsari Riemannian Lp center of mass: existence, uniqueness, and convexity , 2011 .

[52]  Christophe Lenglet,et al.  A nonparametric Riemannian framework for processing high angular resolution diffusion images and its applications to ODF-based morphometry , 2011, NeuroImage.

[53]  Rudrasis Chakraborty,et al.  An efficient recursive estimator of the Fréchet mean on a hypersphere with applications to Medical Image Analysis , 2015 .

[54]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55]  Luc Van Gool,et al.  Building Deep Networks on Grassmann Manifolds , 2016, AAAI.

[56]  Raquel Urtasun,et al.  Understanding the Effective Receptive Field in Deep Convolutional Neural Networks , 2016, NIPS.

[57]  H. O. Foulkes Abstract Algebra , 1967, Nature.

[58]  Stéphane Mallat,et al.  Deep roto-translation scattering for object classification , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[59]  Barnabás Póczos,et al.  The Statistical Recurrent Unit , 2017, ICML.

[60]  Luc Van Gool,et al.  A Riemannian Network for SPD Matrix Learning , 2016, AAAI.

[61]  Rudrasis Chakraborty,et al.  Statistical Recurrent Models on Manifold valued Data , 2018, NIPS 2018.

[62]  Sander Dieleman,et al.  Rotation-invariant convolutional neural networks for galaxy morphology prediction , 2015, ArXiv.

[63]  Joan Bruna,et al.  Deep Convolutional Networks on Graph-Structured Data , 2015, ArXiv.

[64]  Silvere Bonnabel,et al.  Stochastic Gradient Descent on Riemannian Manifolds , 2011, IEEE Transactions on Automatic Control.

[65]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[66]  P. Basser,et al.  MR diffusion tensor spectroscopy and imaging. , 1994, Biophysical journal.

[67]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[68]  Pedro M. Domingos,et al.  Deep Symmetry Networks , 2014, NIPS.