Opportunities and Challenges of Synthetic Data Generation in Oncology.

Widespread interest in artificial intelligence (AI) in health care has focused mainly on deductive systems that analyze available real-world data to discover patterns not otherwise visible. Generative adversarial network, a new type of inductive AI, has recently evolved to generate high-fidelity virtual synthetic data (SD) trained on relatively limited real-world information. The AI system is fed with a collection of real data, and it learns to generate new augmented data while maintaining the general characteristics of the original data set. The use of SD to enhance clinical research and protect patient privacy has drawn a lot of interest in medicine and in the complex field of oncology. This article summarizes the main characteristics of this innovative technology and critically discusses how it can be used to accelerate data access for secondary purposes, providing an overview of the opportunities and challenges of SD generation for clinical cancer research and health care.

[1]  Scott R. Smith,et al.  Synthetic data in health care: A narrative review , 2023, PLOS digital health.

[2]  H. Gietema,et al.  Generation of synthetic ground glass nodules using generative adversarial networks (GANs) , 2022, European Radiology Experimental.

[3]  A. Krogh,et al.  Synthetic Data Generation By Artificial Intelligence to Accelerate Translational Research and Precision Medicine in Hematological Malignancies , 2022, Blood.

[4]  C. Angulo,et al.  Statistical Validation of Synthetic Data for Lung Cancer Patients Generated by Using Generative Adversarial Networks , 2022, Electronics.

[5]  Ming Y. Lu,et al.  Artificial intelligence for multimodal data integration in oncology. , 2022, Cancer cell.

[6]  K. El Emam,et al.  Synthetic data as an enabler for machine learning applications in medicine , 2022, iScience.

[7]  K. Harron,et al.  Synthetic data in medical research , 2022, BMJ medicine.

[8]  G. Callicó,et al.  Deep Convolutional Generative Adversarial Networks to Enhance Artificial Intelligence in Healthcare: A Skin Cancer Application , 2022, Sensors.

[9]  L. Saba,et al.  Generative Adversarial Networks in Brain Imaging: A Narrative Review , 2022, J. Imaging.

[10]  H. Kong,et al.  Colonoscopic image synthesis with generative adversarial network for enhanced detection of sessile serrated lesions using convolutional neural network , 2022, Scientific Reports.

[11]  Ben Glocker,et al.  Data synthesis and adversarial networks: A review and meta-analysis in cancer imaging , 2021, Medical Image Anal..

[12]  J. Yong,et al.  Multi-omics Data Integration by Generative Adversarial Network , 2021, bioRxiv.

[13]  Ming Y. Lu,et al.  Synthetic data in machine learning for medicine and healthcare , 2021, Nature Biomedical Engineering.

[14]  Da Hyun Lee,et al.  Generative adversarial network for glioblastoma ensures morphologic variations and improves diagnostic model for isocitrate dehydrogenase mutant type , 2021, Scientific Reports.

[15]  K. El Emam,et al.  Can synthetic data be a proxy for real clinical trial data? A validation study , 2021, BMJ Open.

[16]  Ben Glocker,et al.  Perceived Realism of High-Resolution Generative Adversarial Network-derived Synthetic Mammograms. , 2021, Radiology: Artificial Intelligence.

[17]  João Paulo Papa,et al.  Assisting Barrett's esophagus identification using endoscopic data augmentation based on Generative Adversarial Networks , 2020, Comput. Biol. Medicine.

[18]  Tianfu Wang,et al.  GP-GAN: Brain tumor growth prediction using stacked 3D generative adversarial networks from longitudinal MR Images , 2020, Neural Networks.

[19]  Yiping Wang,et al.  Synthesis of diagnostic quality cancer pathology images by generative adversarial networks , 2020, The Journal of pathology.

[20]  Linda Coyle,et al.  Generation and evaluation of synthetic patient data , 2020, BMC Medical Research Methodology.

[21]  Keiichi I. Nakayama,et al.  Artificial intelligence in oncology , 2020, Cancer science.

[22]  Eli Konen,et al.  Creating Artificial Images for Radiology Applications Using Generative Adversarial Networks (GANs) - A Systematic Review. , 2020, Academic radiology.

[23]  J. Baumbach,et al.  The Economic Impact of Artificial Intelligence in Health Care: Systematic Review , 2020, Journal of medical Internet research.

[24]  Ender Konukoglu,et al.  Injecting and removing suspicious features in breast imaging with CycleGAN: A pilot study of automated adversarial attacks using neural networks on small images. , 2019, European journal of radiology.

[25]  E. Choi,et al.  Prediction of Hepatic Parenchymal Change in Gd-EOB-DTPA MR Images after Stereotactic Body Radiation Therapy by Cycle GAN Deep Neural Network , 2019, International Journal of Radiation Oncology, Biology, Physics.

[26]  Andre Esteva,et al.  A guide to deep learning in healthcare , 2019, Nature Medicine.

[27]  Fujio Toriumi,et al.  Generative Adversarial Networks for the Creation of Realistic Artificial Brain Magnetic Resonance Images , 2018, Tomography.

[28]  Isaac S Kohane,et al.  Artificial Intelligence in Healthcare , 2019, Artificial Intelligence and Machine Learning for Business for Non-Engineers.

[29]  Minseon Kim,et al.  An Improved Method for Prediction of Cancer Prognosis by Network Learning , 2018, Genes.

[30]  Sung Tae Kim,et al.  Diffusion radiomics as a diagnostic model for atypical manifestation of primary central nervous system lymphoma: development and multicenter external validation , 2018, Neuro-oncology.

[31]  S. Bini Artificial Intelligence, Machine Learning, Deep Learning, and Cognitive Computing: What Do These Terms Mean and How Will They Impact Health Care? , 2018, The Journal of arthroplasty.

[32]  Ahmed Hosny,et al.  Artificial intelligence in radiology , 2018, Nature Reviews Cancer.

[33]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[34]  S. Sunarti,et al.  Artificial intelligence in healthcare: opportunities and risk for future. , 2021, Gaceta sanitaria.