Responsible Radiomics Research for Faster Clinical Translation

It is now recognized that intratumoral heterogeneity is associated with more aggressive tumor phenotypes leading to poor patient outcomes (1). Medical imaging plays a central role in related investigations, because radiologic images are routinely acquired during cancer management. Imaging modalities such as 18F-FDG PET, CT, and MRI are minimally invasive and would constitute an immense source of potential data for decoding tumor phenotypes (2). Computer-aided diagnosis methods and systems exploiting medical images have been developed for decades, but their wide clinical implementation has been hampered by false-positive rates (3). As a consequence, routine clinical exploitation of images still consists mostly of visual or manual assessments. Today, the development of machine-learning techniques and the rise of computational power allow for the exploitation of a large number of quantitative features (4). This ability has led to a new incarnation of computer-aided diagnosis, “radiomics,” which refers to the characterization of tumor phenotypes via the extraction of highdimensional mineable data—for example, morphologic, intensitybased, fractal-based, and textural features—from medical images and whose subsequent analysis aims at supporting clinical decision making. A first proof-of-concept study dedicated to the prediction of tumor outcomes using PET radiomics-based multivariable models built via machine learning was published in 2009 (5). The term radiomics was then first used in 2010 to describe how imaging features can reflect gene expression (6). Other early radiomics studies followed (7,8), including some highlighting early on that the reliability of existing features is affected by acquisition protocol, reconstruction, test–retest consistency, preprocessing, and segmentation (9–13). The overall framework of radiomics was then explicitly described in 2012 (14), and in the years that followed, this emerging field experienced exponential growth (15). In the context of precision oncology, the radiomics workflow for the construction of predictive or prognostic models consists of 3 major steps (Fig. 1A): medical image acquisition, computation of radiomics features, and statistical analysis and machine learning. To apply the models to new patients for treatment personalization, a prospective model evaluation (preferably in a multicenter setup) is necessary. Radiomics research has already shown great promise for supporting clinical decision making. However, the fact that radiomicsbased strategies have not yet been translated to routine practice can be partly attributed to the low reproducibility of most current studies. The workflow for computing features is complex and involves many steps (Fig. 1B), often leading to incomplete reporting of methodologic information (e.g., texture matrix design choices and gray-level discretization methods). As a consequence, few radiomics studies in the current literature can be reproduced from start to end. Other major issues include the limited number of patients available for radiomics research, the high false-positive rates (similar to those of analogous computer-aided diagnosis methods), and the reporting of overly optimistic results, all of which affect the generalizability of the conclusions reached in current studies. Medical imaging journals are currently overwhelmed by a large volume of radiomics-related articles of variable quality and associated clinical value. The aim of this editorial is to present guidelines that we think can improve the reporting quality and therefore the reproducibility of radiomics studies, as well as the statistical quality of radiomics analyses. These guidelines can serve not only the authors of such studies but also the reviewers who assess their appropriateness for publication.

[1]  Robert J. Gillies,et al.  The effect of SUV discretization in quantitative FDG-PET Radiomics: the need for standardized methodology in tumor texture analysis , 2015, Scientific Reports.

[2]  G. Collins,et al.  Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): The TRIPOD statement , 2015, British Journal of Cancer.

[3]  P. Nowell Tumor progression: a brief historical perspective. , 2002, Seminars in cancer biology.

[4]  M. Hatt,et al.  Intratumor Heterogeneity Characterized by Textural Features on Baseline 18F-FDG PET Images Predicts Response to Concomitant Radiochemotherapy in Esophageal Cancer , 2011, The Journal of Nuclear Medicine.

[5]  Steffen Löck,et al.  Image biomarker standardisation initiative , 2016 .

[6]  M. Hatt,et al.  Reproducibility of Tumor Uptake Heterogeneity Characterization Through Textural Feature Analysis in 18F-FDG PET , 2012, The Journal of Nuclear Medicine.

[7]  L Cozzi,et al.  PET Radiomics in NSCLC: state of the art and a proposal for harmonization of methodology , 2017, Scientific Reports.

[8]  Issam El-Naqa,et al.  Exploring feature-based approaches in PET images for predicting cancer treatment outcomes , 2009, Pattern Recognit..

[9]  Dimitris Visvikis,et al.  Characterization of PET/CT images using texture analysis: the past, the present… any future? , 2016, European Journal of Nuclear Medicine and Molecular Imaging.

[10]  A. Zwanenburg,et al.  EP-1677: Multicentre initiative for standardisation of image biomarkers , 2017 .

[11]  Z. Obermeyer,et al.  Predicting the Future - Big Data, Machine Learning, and Clinical Medicine. , 2016, The New England journal of medicine.

[12]  Patrick Granton,et al.  Radiomics: extracting more information from medical images using advanced feature analysis. , 2012, European journal of cancer.

[13]  R. Jeraj,et al.  Variability of textural features in FDG PET images due to different acquisition modes and reconstruction parameters , 2010, Acta oncologica.

[14]  M. Giger,et al.  Anniversary paper: History and status of CAD and quantitative image analysis: the role of Medical Physics and AAPM. , 2008, Medical physics.

[15]  M. Hatt,et al.  Robustness of intratumour 18F-FDG PET uptake heterogeneity quantification for therapy response prediction in oesophageal carcinoma , 2013, European Journal of Nuclear Medicine and Molecular Imaging.

[16]  R. Gillies,et al.  The biology underlying molecular imaging in oncology: from genome to anatome and back again. , 2010, Clinical radiology.

[17]  Mireille Garreau,et al.  Reproducibility of CT-based bone texture parameters of cancellous calf bone samples: influence of slice thickness. , 2008, European journal of radiology.

[18]  Paul Kinahan,et al.  Radiomics: Images Are More than Pictures, They Are Data , 2015, Radiology.

[19]  H. Aerts,et al.  Applications and limitations of radiomics , 2016, Physics in medicine and biology.

[20]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[21]  Andrzej Materka,et al.  Effects of MRI acquisition parameter variations and protocol heterogeneity on the results of texture analysis and pattern discrimination: an application-oriented study. , 2009, Medical physics.

[22]  Fei Yang,et al.  Quantitative radiomics: impact of stochastic effects on textural feature analysis implies the need for standards , 2015, Journal of medical imaging.

[23]  J. Bradley,et al.  Combined PET/CT image characteristics for radiotherapy tumor response in lung cancer. , 2012, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.