pix2xray: converting RGB images into X-rays using generative adversarial networks

Purpose We propose a novel methodology for generating synthetic X-rays from 2D RGB images. This method creates accurate simulations for use in non-diagnostic visualization problems where the only input comes from a generic camera. Traditional methods are restricted to using simulation algorithms on 3D computer models. To solve this problem, we propose a method of synthetic X-ray generation using conditional generative adversarial networks (CGANs). Methods We create a custom synthetic X-ray dataset generator to generate image triplets for X-ray images, pose images, and RGB images of natural hand poses sampled from the NYU hand pose dataset. This dataset is used to train two general-purpose CGAN networks, pix2pix and CycleGAN, as well as our novel architecture called pix2xray which expands upon the pix2pix architecture to include the hand pose into the network. Results Our results demonstrate that our pix2xray architecture outperforms both pix2pix and CycleGAN in producing higher-quality X-ray images. We measure higher similarity metrics in our approach, with pix2pix coming in second, and CycleGAN producing the worst results. Our network performs better in the difficult cases which involve high occlusion due to occluded poses or large rotations. Conclusion Overall our work establishes a baseline that synthetic X-rays can be simulated using 2D RGB input. We establish the need for additional data such as the hand pose to produce clearer results and show that future research must focus on more specialized architectures to improve overall image clarity and structure.

[1]  Nicolas Freud,et al.  Fast and robust ray casting algorithms for virtual X-ray imaging , 2006 .

[2]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Nigel W. John,et al.  Simulation of X-ray Attenuation on the GPU , 2009, TPCG.

[4]  Kai Ma,et al.  X2CT-GAN: Reconstructing CT From Biplanar X-Rays With Generative Adversarial Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Fernando Bello,et al.  A prototype percutaneous transhepatic cholangiography training simulator with real-time breathing motion , 2009, International Journal of Computer Assisted Radiology and Surgery.

[6]  Yoshua Bengio,et al.  Generative Adversarial Networks , 2014, ArXiv.

[7]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[8]  Franck Patrick Vidal,et al.  Development and validation of real-time simulation of X-ray imaging with respiratory motion , 2016, Comput. Medical Imaging Graph..

[9]  Fernando Bello,et al.  Percutaneous Transhepatic Cholangiography Training Simulator with Real-time Breathing Motion , 2008 .

[10]  Kristoffer Haldrup,et al.  McXtrace: a Monte Carlo software package for simulating X-ray optics, beamlines and experiments , 2013 .

[11]  Tanveer F. Syeda-Mahmood,et al.  Semi-supervised learning with generative adversarial networks for chest X-ray classification with ability of data domain adaptation , 2018, 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018).

[12]  Zhongwei Zhao,et al.  Digital radiography image denoising using a generative adversarial network , 2018, Journal of X-ray science and technology.

[13]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[14]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[15]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[16]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Luc Duong,et al.  CycleGAN for style transfer in X-ray angiography , 2019, International Journal of Computer Assisted Radiology and Surgery.

[18]  Alexei A. Efros,et al.  Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[19]  Ken Perlin,et al.  Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks , 2014, ACM Trans. Graph..

[20]  Jan Kautz,et al.  High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.