Assessing heterogeneity in spatial data using the HTA index with applications to spatial transcriptomics and imaging

Abstract Motivation Tumour heterogeneity is being increasingly recognized as an important characteristic of cancer and as a determinant of prognosis and treatment outcome. Emerging spatial transcriptomics data hold the potential to further our understanding of tumour heterogeneity and its implications. However, existing statistical tools are not sufficiently powerful to capture heterogeneity in the complex setting of spatial molecular biology. Results We provide a statistical solution, the HeTerogeneity Average index (HTA), specifically designed to handle the multivariate nature of spatial transcriptomics. We prove that HTA has an approximately normal distribution, therefore lending itself to efficient statistical assessment and inference. We first demonstrate that HTA accurately reflects the level of heterogeneity in simulated data. We then use HTA to analyze heterogeneity in two cancer spatial transcriptomics datasets: spatial RNA sequencing by 10x Genomics and spatial transcriptomics inferred from H&E. Finally, we demonstrate that HTA also applies to 3D spatial data using brain MRI. In spatial RNA sequencing, we use a known combination of molecular traits to assert that HTA aligns with the expected outcome for this combination. We also show that HTA captures immune-cell infiltration at multiple resolutions. In digital pathology, we show how HTA can be used in survival analysis and demonstrate that high levels of heterogeneity may be linked to poor survival. In brain MRI, we show that HTA differentiates between normal ageing, Alzheimer’s disease and two tumours. HTA also extends beyond molecular biology and medical imaging, and can be applied to many domains, including GIS. Availability and implementation Python package and source code are available at: https://github.com/alonalj/hta. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  X. Yi,et al.  Assessing tumor heterogeneity using ctDNA to predict and monitor therapeutic response in metastatic breast cancer , 2020, International journal of cancer.

[2]  Angela E. Leek,et al.  Geospatial immune variability illuminates differential evolution of lung adenocarcinoma , 2020, Nature Medicine.

[3]  S. Stuckey,et al.  Proton Density MRI Increases Detection of Cervical Spinal Cord Multiple Sclerosis Lesions Compared with T2-Weighted Fast Spin-Echo , 2016, American Journal of Neuroradiology.

[4]  B. Ripley The Second-Order Analysis of Stationary Point Processes , 1976 .

[5]  Tonglin Zhang,et al.  A measure of spatial stratified heterogeneity , 2016 .

[6]  N. Razavian,et al.  Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning , 2018, Nature Medicine.

[7]  F. Bertucci,et al.  Association of GATA3, P53, Ki67 status and vascular peritumoral invasion are strongly prognostic in luminal breast cancer , 2009, Breast Cancer Research.

[8]  R. Gillies,et al.  Evolutionary dynamics of carcinogenesis and why targeted therapy does not work , 2012, Nature Reviews Cancer.

[9]  Carlo C. Maley,et al.  An ecological measure of immune-cancer colocalization as a prognostic factor for breast cancer , 2015, Breast Cancer Research.

[10]  H. Matsubara,et al.  Quantification of Structural Heterogeneity Using Fractal Analysis of Contrast-Enhanced CT Image to Predict Survival in Gastric Cancer Patients , 2020, Digestive Diseases and Sciences.

[11]  Å. Borg,et al.  Spatial Deconvolution of HER2-positive Breast Tumors Reveals Novel Intercellular Relationships , 2020, bioRxiv.

[12]  Joaquim Cezar Felipe,et al.  Two-dimensional multiscale entropy analysis: Applications to image texture evaluation , 2018, Signal Process..

[13]  Ulrich Sure,et al.  Edinburgh Research Explorer Spatial and temporal heterogeneity of mouse and human microglia at single-cell resolution , 2022 .

[14]  Grzegorz A Rempala,et al.  Methods for diversity and overlap analysis in T-cell receptor populations , 2012, Journal of Mathematical Biology.

[15]  Justin Guinney,et al.  GSVA: The Gene Set Variation Analysis package for microarray and RNA-seq data , 2016 .

[16]  H. Ueno,et al.  Spatial immune profiling of the colorectal tumor microenvironment predicts good outcome in stage II patients. , 2020, npj Digital Medicine.

[17]  S. Al-Sarraj,et al.  Receptor tyrosine kinase genes amplified in glioblastoma exhibit a mutual exclusivity in variable proportions reflective of individual tumor heterogeneity. , 2012, Cancer research.

[18]  Z. Yakhini,et al.  Spatial transcriptomics inferred from pathology whole-slide images links tumor heterogeneity to survival in breast and lung cancer , 2020, Scientific reports.

[19]  Yinyin Yuan Spatial Heterogeneity in the Tumor Microenvironment. , 2016, Cold Spring Harbor perspectives in medicine.

[20]  C. Denkert,et al.  Tumor infiltrating lymphocytes in early breast cancer. , 2018, Breast.

[21]  W. Seeger,et al.  Spatial Density and Distribution of Tumor-Associated Macrophages Predict Survival in Non–Small Cell Lung Carcinoma , 2020, Cancer Research.

[22]  Gary D Bader,et al.  Relapse fated latent diagnosis subclones in acute B lineage leukaemia are drug tolerant and possess distinct metabolic programs. , 2020, Cancer discovery.