Fast predictive multimodal image registration

We introduce a deep encoder-decoder architecture for image deformation prediction from multimodal images. Specifically, we design an image-patch-based deep network that jointly (i) learns an image similarity measure and (ii) the relationship between image patches and deformation parameters. While our method can be applied to general image registration formulations, we focus on the Large Deformation Diffeomorphic Metric Mapping (LDDMM) registration model. By predicting the initial momentum of the shooting formulation of LDDMM, we preserve its mathematical properties and drastically reduce the computation time, compared to optimization-based approaches. Furthermore, we create a Bayesian probabilistic version of the network that allows evaluation of registration uncertainty via sampling of the network at test time. We evaluate our method on a 3D brain MRI dataset using both T1- and T2-weighted images. Our experiments show that our method generates accurate predictions and that learning the similarity measure leads to more consistent registrations than relying on generic multimodal image similarity measures, such as mutual information. Our approach is an order of magnitude faster than optimization-based LDDMM.

[1]  Bernhard Schölkopf,et al.  Learning similarity measure for multi-modal 3D image registration , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Shaohua Kevin Zhou,et al.  Cross-Domain Synthesis of Medical Images Using Efficient Location-Sensitive Deep Network , 2015, MICCAI.

[3]  Jan Modersitzki,et al.  Numerical Methods for Image Registration , 2004 .

[4]  Nikos Komodakis,et al.  A Deep Metric for Multimodal Registration , 2016, MICCAI.

[5]  Chuck Meyer,et al.  Evaluation of Control Point Selection in Automatic, Mutual Information Driven, 3D Warping , 1998, MICCAI.

[6]  Olivier D. Faugeras,et al.  Variational Methods for Multimodal Image Matching , 2002, International Journal of Computer Vision.

[7]  Frank Sauer,et al.  Learning Based Non-rigid Multi-modal Image Registration Using Kullback-Leibler Divergence , 2005, MICCAI.

[8]  Guido Gerig,et al.  Altered corpus callosum morphology associated with autism over the first 2 years of life. , 2015, Brain : a journal of neurology.

[9]  Arno Klein,et al.  A reproducible evaluation of ANTs similarity metric performance in brain image registration , 2011, NeuroImage.

[10]  Li Zhang,et al.  Deep similarity learning for multimodal medical images , 2018, Comput. methods Biomech. Biomed. Eng. Imaging Vis..

[11]  Cordelia Schmid,et al.  DeepFlow: Large Displacement Optical Flow with Deep Matching , 2013, 2013 IEEE International Conference on Computer Vision.

[12]  Paul A. Viola,et al.  Alignment by Maximization of Mutual Information , 1997, International Journal of Computer Vision.

[13]  Thomas Brox,et al.  FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[14]  Dinggang Shen,et al.  Predict brain MR image registration via sparse learning of appearance and transformation , 2015, Medical Image Anal..

[15]  Zoubin Ghahramani,et al.  Bayesian Convolutional Neural Networks with Bernoulli Approximate Variational Inference , 2015, ArXiv.

[16]  Mark Holden,et al.  A Review of Geometric Transformations for Nonrigid Body Registration , 2008, IEEE Transactions on Medical Imaging.

[17]  Marc Niethammer,et al.  Semi-coupled dictionary learning for deformation prediction , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[18]  References , 1971 .

[19]  Snehashis Roy,et al.  Magnetic resonance image synthesis through patch regression , 2013, 2013 IEEE 10th International Symposium on Biomedical Imaging.

[20]  Peter Lorenzen,et al.  Multi-modal image set registration and atlas formation , 2006, Medical Image Anal..

[21]  Daniel Rueckert,et al.  Diffeomorphic 3D Image Registration via Geodesic Shooting Using an Efficient Adjoint Calculation , 2011, International Journal of Computer Vision.

[22]  D. Louis Collins,et al.  Unbiased average age-appropriate atlases for pediatric studies , 2011, NeuroImage.

[23]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Stephen M. Pizer,et al.  2D/3D image registration using regression learning , 2013, Comput. Vis. Image Underst..

[25]  Snehashis Roy,et al.  Magnetic Resonance Image Example-Based Contrast Synthesis , 2013, IEEE Transactions on Medical Imaging.

[26]  Thomas Brox,et al.  Striving for Simplicity: The All Convolutional Net , 2014, ICLR.

[27]  Nassir Navab,et al.  Learning Optimization Updates for Multimodal Registration , 2016, MICCAI.

[28]  Xiao Yang,et al.  Fast Predictive Image Registration , 2016, LABELS/DLMIA@MICCAI.

[29]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[30]  P. Thomas Fletcher,et al.  A vector momenta formulation of diffeomorphisms for improved geodesic regression and atlas construction , 2013, 2013 IEEE 10th International Symposium on Biomedical Imaging.

[31]  Alain Trouvé,et al.  Computing Large Deformation Metric Mappings via Geodesic Flows of Diffeomorphisms , 2005, International Journal of Computer Vision.

[32]  Sébastien Ourselin,et al.  Fast free-form deformation using graphics processing units , 2010, Comput. Methods Programs Biomed..

[33]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[34]  Nikos Paragios,et al.  Boosted metric learning for 3D multi-modal deformable registration , 2011, 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.