RADIOHEAD: Radiogenomic analysis incorporating tumor heterogeneity in imaging through densities

Recent technological advancements have enabled detailed investigation of associations between the molecular architecture and tumor heterogeneity, through multi-source integration of radiological imaging and genomic (radiogenomic) data. In this paper, we integrate and harness radiogenomic data in patients with lower grade gliomas (LGG), a type of brain cancer, in order to develop a regression framework called RADIOHEAD (RADIOgenomic analysis incorporating tumor HEterogeneity in imAging through Densities) to identify radiogenomic associations. Imaging data is represented through voxel intensity probability density functions of tumor subregions obtained from multimodal magnetic resonance imaging, and genomic data through molecular signatures in the form of pathway enrichment scores corresponding to their gene expression profiles. Employing a Riemannian-geometric framework for principal component analysis on the set of probability densities functions, we map each probability density to a vector of principal component scores, which are then included as predictors in a Bayesian regression model with the pathway enrichment scores as the response. Variable selection compatible with the grouping structure amongst the predictors induced through the tumor sub-regions is carried out under a group spike-and-slab prior. A Bayesian false discovery rate mechanism is then used to infer significant associations based on the posterior distribution of the regression coefficients. Our analyses reveal several pathways relevant to LGG etiology (such as synaptic transmission, nerve impulse and neurotransmitter pathways), to have significant associations with the corresponding imaging-based predictors.

[1]  Anuj Srivastava,et al.  Statistical Shape Analysis , 2014, Computer Vision, A Reference Guide.

[2]  N. Stella,et al.  G protein-coupled receptors as oncogenic signals in glioma: Emerging therapeutic avenues , 2014, Neuroscience.

[3]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[4]  Bilwaj Gaonkar,et al.  GLISTRboost: Combining Multimodal MRI Segmentation, Registration, and Biophysical Tumor Growth Modeling with Gradient Boosting Machines for Glioma Segmentation , 2015, Brainles@MICCAI.

[5]  Antonio L Amelio,et al.  Comprehensive Molecular Characterization of Pheochromocytoma and Paraganglioma. , 2017, Cancer cell.

[6]  R. Wilson,et al.  Identification of a CpG island methylator phenotype that defines a distinct subgroup of glioma. , 2010, Cancer cell.

[7]  Jeffrey S. Morris,et al.  Quantile Function on Scalar Regression Analysis for Distributional Data , 2020, Journal of the American Statistical Association.

[8]  H. Karcher Riemannian center of mass and mollifier smoothing , 1977 .

[9]  Geon-Ho Jahng,et al.  True Progression versus Pseudoprogression in the Treatment of Glioblastomas: A Comparison Study of Normalized Cerebral Blood Volume and Apparent Diffusion Coefficient by Histogram Analysis , 2013, Korean journal of radiology.

[10]  Justin Guinney,et al.  GSVA: gene set variation analysis for microarray and RNA-Seq data , 2013, BMC Bioinformatics.

[11]  J. Huse,et al.  The Evolving Molecular Genetics of Low-grade Glioma , 2015, Advances in anatomic pathology.

[12]  C. R. Rao,et al.  Information and the Accuracy Attainable in the Estimation of Statistical Parameters , 1992 .

[13]  G. Schackert,et al.  Glutamate receptors in pediatric tumors of the central nervous system , 2010, Cancer biology & therapy.

[14]  Parag Mallick,et al.  Neuronal Activity Promotes Glioma Growth through Neuroligin-3 Secretion , 2015, Cell.

[15]  Anuj Srivastava,et al.  Riemannian Analysis of Probability Density Functions with Applications in Vision , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  S. Gabriel,et al.  Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. , 2010, Cancer cell.

[17]  Naveen N. Narisetty,et al.  Consistent Group Selection with Bayesian High Dimensional Modeling , 2020, Bayesian Analysis.

[18]  Andrew Gelman,et al.  General methods for monitoring convergence of iterative simulations , 1998 .

[19]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[20]  Nader Sanai,et al.  Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma. , 2010, World neurosurgery.

[21]  Jeffrey S. Morris,et al.  Wavelet‐based functional mixed models , 2006, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[22]  Christos Davatzikos,et al.  Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features , 2017, Scientific Data.

[23]  Anuj Srivastava,et al.  Robust Comparison of Kernel Densities on Spherical Domains , 2018, Sankhya A.

[24]  N. Just,et al.  Improving tumour heterogeneity MRI assessment with histograms , 2014, British Journal of Cancer.

[25]  S. Kurtek,et al.  DEMARCATE: Density-based magnetic resonance image clustering for assessing tumor heterogeneity in cancer , 2016, NeuroImage: Clinical.

[26]  D. W. Scott,et al.  Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[27]  Jeffrey S. Morris,et al.  Bayesian Analysis of Mass Spectrometry Proteomic Data Using Wavelet‐Based Functional Mixed Models , 2008, Biometrics.

[28]  D. Brat,et al.  Abstract 936: Comprehensive and integrative genomic characterization of diffuse lower grade gliomas , 2014 .

[29]  Bernard W. Silverman,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[30]  Martin A. Lindquist,et al.  Handbook of Neuroimaging Data Analysis , 2016 .

[31]  Helga Thorvaldsdóttir,et al.  Molecular signatures database (MSigDB) 3.0 , 2011, Bioinform..

[32]  Sebastian Kurtek,et al.  Bayesian sensitivity analysis with the Fisher–Rao metric , 2015 .

[33]  Jing Wang,et al.  LinkedOmics: analyzing multi-omics data within and across 32 cancer types , 2017, Nucleic Acids Res..

[34]  E. George,et al.  APPROACHES FOR BAYESIAN VARIABLE SELECTION , 1997 .

[35]  Steven J. M. Jones,et al.  Molecular Profiling Reveals Biologically Discrete Subsets and Pathways of Progression in Diffuse Glioma , 2016, Cell.

[36]  N. Just Histogram analysis of the microvasculature of intracerebral human and murine glioma xenografts , 2011, Magnetic resonance in medicine.

[37]  C. Robert,et al.  Optimal Sample Size for Multiple Testing : the Case of Gene Expression Mi roarraysPeter , 2004 .

[38]  Stephen M. Moore,et al.  The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository , 2013, Journal of Digital Imaging.

[39]  J. S. Rao,et al.  Spike and slab variable selection: Frequentist and Bayesian strategies , 2005, math/0505633.

[40]  C. Crainiceanu,et al.  Statistical normalization techniques for magnetic resonance imaging , 2014, NeuroImage: Clinical.

[41]  Xiaofang Xu,et al.  Bayesian Variable Selection and Estimation for Group Lasso , 2015, 1512.01013.

[42]  K. Polyak,et al.  Intra-tumour heterogeneity: a looking glass for cancer? , 2012, Nature Reviews Cancer.

[43]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[44]  Adrian E. Raftery,et al.  Bayesian Model Averaging: A Tutorial , 2016 .

[45]  Lin Zhang,et al.  Bayesian hierarchical structured variable selection methods with application to molecular inversion probe studies in breast cancer , 2014 .

[46]  N. Čencov Statistical Decision Rules and Optimal Inference , 2000 .

[47]  Ole Winther,et al.  Bayesian Inference for Structured Spike and Slab Priors , 2014, NIPS.

[48]  Tong Zhou,et al.  Ion channel gene expression predicts survival in glioma patients , 2015, Scientific Reports.

[49]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[50]  Yu Liu,et al.  Inhibition of Glioma Cell Lysosome Exocytosis Inhibits Glioma Invasion , 2012, PloS one.

[51]  A. Bhattacharyya On a measure of divergence between two statistical populations defined by their probability distributions , 1943 .

[52]  Namkug Kim,et al.  Percent change of perfusion skewness and kurtosis: a potential imaging biomarker for early treatment response in patients with newly diagnosed glioblastomas. , 2012, Radiology.

[53]  L. Fahrmeir,et al.  Spike-and-Slab Priors for Function Selection in Structured Additive Regression Models , 2011, 1105.5250.

[54]  John D. Storey The positive false discovery rate: a Bayesian interpretation and the q-value , 2003 .