A Multi-Pronged Evaluation For Image Normalization Techniques

While quantitative image features (radiomic) can be employed as informative indicators of disease progression, they are sensitive to variations in acquisition and reconstruction. Prior studies have demonstrated the ability to normalize heterogeneous scans using per-pixel metrics (e.g., mean squared error) and qualitative reader studies. However, the generalizability of these techniques and the impact of normalization on downstream tasks (e.g., classification) have been understudied. We present a multi-pronged evaluation by assessing image normalization techniques using 1) per-pixel image quality and perceptual metrics, 2) variability in radiomic features, and 3) task performance differences using a machine learning (ML) model. We evaluated a previously reported 3D generative adversarial network-based (GAN) approach, investigating its performance on low-dose computed tomography (CT) scans acquired at a different institution with varying dose levels and reconstruction kernels. While the 3D GAN achieved superior metric results, its impact on quantitative image features and downstream task performance did not result in universal improvement. These results suggest a more complicated relationship between CT acquisition and reconstruction parameters and their effect on radiomic features and ML model performance, which are not fully captured using per-pixel metrics alone. Our approach provides a more comprehensive picture of the effect of normalization.

[1]  Fei Gao,et al.  DeepSim: Deep similarity for image quality assessment , 2017, Neurocomputing.

[2]  Paul Babyn,et al.  Sharpness-Aware Low-Dose CT Denoising Using Conditional Generative Adversarial Network , 2017, Journal of Digital Imaging.

[3]  Alexei A. Efros,et al.  The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4]  J. E. van Timmeren,et al.  Influence of gray level discretization on radiomic feature stability for different CT scanners, tube currents and slice thicknesses: a comprehensive phantom study , 2017, Acta oncologica.

[5]  Zhengrong Liang,et al.  Sinogram noise reduction for low-dose CT by statistics-based nonlinear filters , 2005, SPIE Medical Imaging.

[6]  Georg Langs,et al.  Automatic lung segmentation in routine imaging is primarily a data diversity problem, not a methodology problem , 2020, European Radiology Experimental.

[7]  John Hoffman,et al.  Technical Note: FreeCT_wFBP: A robust, efficient, open-source implementation of weighted filtered backprojection for helical, fan-beam CT. , 2016, Medical physics.

[8]  Alvin C. Silva,et al.  Iterative Reconstruction Technique for Reducing Body Radiation Dose at Ct: Feasibility Study Hara Et Al. Ct Iterative Reconstruction Technique Gastrointestinal Imaging Original Research , 2022 .

[9]  Daniel C. Castro,et al.  Machine Learning with Multi-Site Imaging Data: An Empirical Study on the Impact of Scanner Effects , 2019, ArXiv.

[10]  Patrick Granton,et al.  Radiomics: extracting more information from medical images using advanced feature analysis. , 2012, European journal of cancer.

[11]  Jin Chen,et al.  STAN-CT: Standardizing CT Image using Generative Adversarial Network , 2020, ArXiv.

[12]  William Hsu,et al.  Using a Generative Adversarial Network for CT Normalization and Its Impact on Radiomic Features , 2020, 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI).

[13]  Guang Li,et al.  CT Super-Resolution GAN Constrained by the Identical, Residual, and Cycle Learning Ensemble (GAN-CIRCLE) , 2018, IEEE Transactions on Medical Imaging.

[14]  Xuanqin Mou,et al.  Low-Dose CT Image Denoising Using a Generative Adversarial Network With Wasserstein Distance and Perceptual Loss , 2017, IEEE Transactions on Medical Imaging.

[15]  Hong Cai,et al.  PieAPP: Perceptual Image-Error Assessment Through Pairwise Preference , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Massimo Bellomi,et al.  Radiomics: the facts and the challenges of image analysis , 2018, European Radiology Experimental.

[17]  P. Lambin,et al.  Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach , 2014, Nature Communications.

[18]  L. Lin,et al.  A concordance correlation coefficient to evaluate reproducibility. , 1989, Biometrics.

[19]  Andriy Fedorov,et al.  Computational Radiomics System to Decode the Radiographic Phenotype. , 2017, Cancer research.

[20]  Jin Chen,et al.  GANai: Standardizing CT Images using Generative Adversarial Network with Alternative Improvement , 2018, bioRxiv.

[21]  F Noo,et al.  TH-AB-207A-05: A Fully-Automated Pipeline for Generating CT Images Across a Range of Doses and Reconstruction Methods. , 2016, Medical physics.