论文信息 - Single‐image Tomography: 3D Volumes from 2D Cranial X‐Rays

Single‐image Tomography: 3D Volumes from 2D Cranial X‐Rays

As many different 3D volumes could produce the same 2D x‐ray image, inverting this process is challenging. We show that recent deep learning‐based convolutional neural networks can solve this task. As the main challenge in learning is the sheer amount of data created when extending the 2D image into a 3D volume, we suggest firstly to learn a coarse, fixed‐resolution volume which is then fused in a second step with the input x‐ray into a high‐resolution volume. To train and validate our approach we introduce a new dataset that comprises of close to half a million computer‐simulated 2D x‐ray images of 3D volumes scanned from 175 mammalian species. Future applications of our approach include stereoscopic rendering of legacy x‐ray images, re‐rendering of x‐rays including changes of illumination, view pose or geometry. Our evaluation includes comparison to previous tomography work, previous learning methods using our data, a user study and application to a set of real x‐rays.

[1] Dinggang Shen,et al. Convolutional Neural Network for Reconstruction of 7T-like Images from 3T MRI Using Appearance and Anatomical Features , 2016, LABELS/DLMIA@MICCAI.

[2] L. Shepp,et al. Maximum Likelihood Reconstruction for Emission Tomography , 1983, IEEE Transactions on Medical Imaging.

[3] Marcus A. Magnor,et al. Constrained inverse volume rendering for planetary nebulae , 2004, IEEE Visualization 2004.

[4] Jianxiong Xiao,et al. 3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[6] Thomas Brox,et al. 3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation , 2016, MICCAI.

[7] Thomas Brox,et al. Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[8] Andreas K. Maier,et al. A Deep Learning Architecture for Limited-Angle Computed Tomography Reconstruction , 2017, Bildverarbeitung für die Medizin.

[9] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[10] Rob Fergus,et al. Depth Map Prediction from a Single Image using a Multi-Scale Deep Network , 2014, NIPS.

[11] G. Hounsfield. Computerized transverse axial scanning (tomography). 1. Description of system. , 1973, The British journal of radiology.

[12] G. Hounsfield. Computerized transverse axial scanning (tomography): Part I. Description of system. 1973. , 1973, The British journal of radiology.

[13] Marcus A. Magnor,et al. Fast Image‐Based Modeling of Astronomical Nebulae , 2013, Comput. Graph. Forum.

[14] Thomas Brox,et al. FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[15] Leonidas J. Guibas,et al. Volumetric and Multi-view CNNs for Object Classification on 3D Data , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Markus Hadwiger,et al. Real-time volume graphics , 2006, SIGGRAPH '04.

[17] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.

[18] Bernt Schiele,et al. What Is Holding Back Convnets for Detection? , 2015, GCPR.

[19] Stella X. Yu,et al. Direct Intrinsics: Learning Albedo-Shading Decomposition by Convolutional Regression , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[20] Abhinav Gupta,et al. Learning a Predictable and Generative Vector Representation for Objects , 2016, ECCV.

[21] Mario Fritz,et al. Deep Reflectance Maps , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] M. Ament,et al. Volume Rendering , 2015 .

[23] Pat Hanrahan,et al. A signal-processing framework for inverse rendering , 2001, SIGGRAPH.

[24] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[25] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Hao Su,et al. A Point Set Generation Network for 3D Object Reconstruction from a Single Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Jiajun Wu,et al. Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling , 2016, NIPS.

[28] Ronald G. Driggers,et al. Encyclopedia of optical engineering , 2003 .

[29] Andreas K. Maier,et al. Deep Learning Computed Tomography , 2016, MICCAI.

[30] Max Jaderberg,et al. Unsupervised Learning of 3D Structure from Images , 2016, NIPS.

[31] Francesc Moreno-Noguer,et al. Simultaneous correspondence and non-rigid 3D reconstruction of the coronary tree from single X-ray images , 2011, 2011 International Conference on Computer Vision.

[32] Yvan Petit,et al. Three-dimensional (3-D) reconstruction of the spine from a single X-ray image and prior vertebra models , 2004, IEEE Transactions on Biomedical Engineering.

[33] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[34] Chao Yang,et al. Shape Inpainting Using 3D Generative Adversarial Network and Recurrent Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[35] Marcus A. Magnor,et al. Image-based tomographic reconstruction of flames , 2004, SIGGRAPH '04.

[36] Simon J. Julier,et al. Structured Prediction of Unobserved Voxels from a Single Depth Image , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38] D. Donoho,et al. Sparse MRI: The application of compressed sensing for rapid MR imaging , 2007, Magnetic resonance in medicine.

[39] Silvio Savarese,et al. 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction , 2016, ECCV.