GEM-DeCan: Improving tumor immune microenvironment profiling by the integration of novel gene expression and DNA methylation deconvolution signatures

Quantifying the proportion of the different cell types present in tumor biopsies remains a priority in cancer research. So far, a number of deconvolution methods have emerged for estimating cell composition using reference signatures either based on gene expression or on DNA methylation from purified cells. These two deconvolution approaches could be complementary to each other, leading to even more performant signatures, in cases where both data types are available. However, the potential relationship between signatures based on gene expression and those based on DNA methylation remains underexplored. Here we present five new deconvolution signature matrices, based on DNA methylation or RNAseq data, which can estimate the proportion of immune cells and cancer cells in a tumour sample. We test these signature matrices on available datasets for in-silico and in-vitro mixtures, peripheral blood, cancer samples from TCGA, bone marrow from multiple myeloma patients and a single-cell melanoma dataset. Cell proportions estimates based on deconvolution performed using our signature matrices, implemented within the EpiDISH framework, show comparable or better correlation with FACS measurements of immune cell-type abundance and with various estimates of cancer sample purity and composition than existing methods. Finally, using publicly available data of 3D chromatin structure in haematopoietic cells, we expanded the list of genes to be included in the RNAseq signature matrices by considering the presence of methylated CpGs in gene promoters or in genomic regions which are in 3D contact with these promoters. Our expanded signature matrices have improved performance compared to our initial RNAseq signature matrix. Finally, we show the value of our signature matrices in predicting patient response to immune checkpoint inhibitors in three melanoma and one bladder cancer cohort, based on bulk tumour sample gene expression data. We also provide GEM-DeCan: a snakemake pipeline, able to run an analysis from raw sequencing data to deconvolution based on various gene expression signature matrices, both for bulk RNASeq and DNA methylation data. The code for producing the signature matrices and reproducing all the figures of this paper is available on GitHub: https://github.com/VeraPancaldiLab/GEMDeCan.

[1]  P. Lacal,et al.  Targeting Tumor-Associated Macrophages to Increase the Efficacy of Immune Checkpoint Inhibitors: A Glimpse into Novel Therapeutic Approaches for Metastatic Melanoma , 2020, Cancers.

[2]  I. El Naqa,et al.  Characterization of the Tumor Immune Microenvironment Identifies M0 Macrophage-Enriched Cluster as a Poor Prognostic Factor in Hepatocellular Carcinoma. , 2020, JCO clinical cancer informatics.

[3]  Y. Liao,et al.  Macrophage correlates with immunophenotype and predicts anti-PD-L1 response of urothelial cancer , 2020, Theranostics.

[4]  M. Carleton,et al.  CXCR1 and CXCR2 Chemokine Receptor Agonists Produced by Tumors Induce Neutrophil Extracellular Traps that Interfere with Immune Cytotoxicity. , 2020, Immunity.

[5]  E. Raineri,et al.  Using GARDEN-NET and ChAseR to explore human haematopoietic 3D chromatin interaction networks , 2020, Nucleic acids research.

[6]  R. Deberardinis Tumor Microenvironment, Metabolism, and Immunotherapy. , 2020, The New England journal of medicine.

[7]  J. Wargo,et al.  B cells are associated with survival and immunotherapy response in sarcoma , 2020, Nature.

[8]  Jeffrey E. Lee,et al.  B cells and tertiary lymphoid structures promote immunotherapy response , 2020, Nature.

[9]  D. Schadendorf,et al.  Tertiary lymphoid structures improve immunotherapy and survival in melanoma , 2020, Nature.

[10]  M. Shaul,et al.  Tumour-associated neutrophils in patients with cancer , 2019, Nature Reviews Clinical Oncology.

[11]  Z. Trajanoski,et al.  Next-generation computational tools for interrogating cancer immunity , 2019, Nature Reviews Genetics.

[12]  E. Heard,et al.  Advances in epigenetics link genetics to the environment and disease , 2019, Nature.

[13]  A. Huttenlocher,et al.  Neutrophil plasticity in the tumor microenvironment. , 2019, Blood.

[14]  Joshua M. Korn,et al.  Next-generation characterization of the Cancer Cell Line Encyclopedia , 2019, Nature.

[15]  M. Ringnér,et al.  Partially methylated domains are hypervariable in breast cancer and fuel widespread CpG island hypermethylation , 2018, Nature Communications.

[16]  J. Varner,et al.  Targeting Tumor-Associated Macrophages in Cancer. , 2019, Trends in immunology.

[17]  M. Ceccarelli,et al.  RNA-Seq Signatures Normalized by mRNA Abundance Allow Absolute Deconvolution of Human Immune Cell Types , 2019, Cell reports.

[18]  J. Madore,et al.  Distinct Immune Cell Populations Define Response to Anti-PD-1 Monotherapy and Anti-PD-1/Anti-CTLA-4 Combined Therapy. , 2019, Cancer cell.

[19]  M. Smyth,et al.  Cancer immunoediting and resistance to T cell-based immunotherapy , 2018, Nature Reviews Clinical Oncology.

[20]  E. King,et al.  Author Correction: Pan-cancer deconvolution of tumour composition using DNA methylation , 2018, Nature Communications.

[21]  J. Allison,et al.  Fundamental Mechanisms of Immune Checkpoint Blockade Therapy. , 2018, Cancer discovery.

[22]  Marieke E. Ijsselsteijn,et al.  Molecular and pharmacological modulators of the tumor immune contexture revealed by deconvolution of RNA-seq data , 2018, bioRxiv.

[23]  E. King,et al.  Pan-cancer deconvolution of tumour composition using DNA methylation , 2018, Nature Communications.

[24]  Liguo Zhang,et al.  Unifying cancer and normal RNA sequencing data from different sources , 2018, Scientific Data.

[25]  P. L. Bergsagel,et al.  Dysregulated IL-18 Is a Key Driver of Immunosuppression and a Possible Therapeutic Target in the Multiple Myeloma Microenvironment. , 2018, Cancer cell.

[26]  Rajarsi R. Gupta,et al.  Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images. , 2018, Cell reports.

[27]  R. Weinberg,et al.  Understanding the tumor immune microenvironment (TIME) for effective therapy , 2018, Nature Medicine.

[28]  Sven Rahmann,et al.  Genome analysis , 2022 .

[29]  T. Chan,et al.  Tumor and Microenvironment Evolution during Immunotherapy with Nivolumab , 2017, Cell.

[30]  B. Christensen,et al.  Cell-type deconvolution from DNA methylation: a review of recent applications , 2017, Human molecular genetics.

[31]  M. Vignali,et al.  Contribution of systemic and somatic factors to clinical response and resistance to PD-L1 blockade in urothelial cancer: An exploratory multi-omic analysis , 2017, PLoS medicine.

[32]  D. Speiser,et al.  Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data , 2017, bioRxiv.

[33]  P. Greenberg,et al.  Obstacles Posed by the Tumor Microenvironment to T cell Activity: A Case for Synergistic Therapies. , 2017, Cancer cell.

[34]  Andrew E. Teschendorff,et al.  A comparison of reference-based algorithms for correcting cell-type heterogeneity in Epigenome-Wide Association Studies , 2017, BMC Bioinformatics.

[35]  J. Wargo,et al.  Primary, Adaptive, and Acquired Resistance to Cancer Immunotherapy , 2017, Cell.

[36]  Alberto Mantovani,et al.  Tumour-associated macrophages as treatment targets in oncology , 2017, Nature Reviews Clinical Oncology.

[37]  P. Murray Macrophage Polarization. , 2017, Annual review of physiology.

[38]  Thomas Lengauer,et al.  DNA Methylation Dynamics of Human Hematopoietic Stem Cell Differentiation , 2016, Cell stem cell.

[39]  Steven J. M. Jones,et al.  The International Human Epigenome Consortium: A Blueprint for Scientific Collaboration and Discovery , 2016, Cell.

[40]  Jonathan M. Cairns,et al.  Lineage-Specific Genome Architecture Links Enhancers and Non-coding Disease Variants to Target Gene Promoters , 2016, Cell.

[41]  Alfonso Valencia,et al.  Genome-wide analysis of differential transcriptional and epigenetic variability across human immune cell types , 2016, bioRxiv.

[42]  P. Laurent-Puig,et al.  Estimating the population abundance of tissue-infiltrating immune and stromal cell populations using gene expression , 2016, Genome Biology.

[43]  M. Pittet,et al.  The role of myeloid cells in cancer therapies , 2016, Nature Reviews Cancer.

[44]  Hui Zhang,et al.  Nanoparticle Targeting of Neutrophils for Improved Cancer Immunotherapy , 2016, Advanced healthcare materials.

[45]  Charles H. Yoon,et al.  Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq , 2016, Science.

[46]  J. Sosman,et al.  Genomic and Transcriptomic Features of Response to Anti-PD-1 Therapy in Metastatic Melanoma , 2016, Cell.

[47]  Devin C. Koestler,et al.  Improving cell mixture deconvolution by identifying optimal DNA methylation libraries (IDOL) , 2016, BMC Bioinformatics.

[48]  Jonathan M. Cairns,et al.  CHiCAGO: robust detection of DNA looping interactions in Capture Hi-C data , 2015, Genome Biology.

[49]  A. Butte,et al.  Systematic pan-cancer analysis of tumour purity , 2015, Nature Communications.

[50]  Cameron S. Osborne,et al.  The pluripotent regulatory circuitry connecting promoters to their long-range interacting elements , 2015, Genome research.

[51]  L. Levy,et al.  Phenotypic diversity and plasticity in circulating neutrophil subpopulations in cancer. , 2015, Cell reports.

[52]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[53]  Martin J. Aryee,et al.  Coverage recommendations for methylation analysis by whole genome bisulfite sequencing , 2014, Nature Methods.

[54]  Jonathan M. Cairns,et al.  Robust Detection of DNA Looping Interactions in Capture HiC data , 2015 .

[55]  M. Samur RTCGAToolbox: A New Tool for Exporting TCGA Firehose Data , 2014, PloS one.

[56]  Winston Timp,et al.  Large hypomethylated blocks as a universal defining epigenetic alteration in human solid tumors , 2014, Genome Medicine.

[57]  Martin J. Aryee,et al.  Epigenome-wide association studies without the need for cell-type composition , 2014, Nature Methods.

[58]  E. Andres Houseman,et al.  Reference-free cell mixture adjustments in analysis of DNA methylation data , 2014, Bioinform..

[59]  G. Getz,et al.  Inferring tumour purity and stromal and immune cell admixture from expression data , 2013, Nature Communications.

[60]  S. Shen-Orr,et al.  Computational deconvolution: extracting cell type-specific information from heterogeneous samples. , 2013, Current opinion in immunology.

[61]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[62]  Ting Gong,et al.  DeconRNASeq: a statistical framework for deconvolution of heterogeneous tissue samples based on mRNA-Seq data , 2013, Bioinform..

[63]  Nathan C. Sheffield,et al.  The accessible chromatin landscape of the human genome , 2012, Nature.

[64]  Devin C. Koestler,et al.  DNA methylation arrays as surrogate measures of cell mixture distribution , 2012, BMC Bioinformatics.

[65]  A. McKenna,et al.  Absolute quantification of somatic DNA alterations in human cancer , 2012, Nature Biotechnology.

[66]  G. Natoli,et al.  Transcriptional regulation of macrophage polarization: enabling diversity with identity , 2011, Nature Reviews Immunology.

[67]  Jane E. Visvader,et al.  Cells of origin in cancer , 2011, Nature.

[68]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.