Comprehensive Benchmarking and Integration of Tumour Microenvironment Cell Estimation Methods

The tumour microenvironment comprises complex cellular compositions and interactions between cancer, immune, and stromal components which all play crucial roles in cancer. Various computational approaches have been developed during the last decade that estimate the relative abundance of different cell types in an unbiased manner using bulk tumour RNA data. However, a comparison that objectively evaluates the performance of these approaches against one another has not been conducted. Here we benchmarked six widely used tools and gene sets: Bindea et al. gene sets, Davoli et al. gene sets, CIBERSORT, MCP-counter, TIMER, and xCell. We also introduce ConsensusTME, a consensus approach that uses the union of genes that the six tools used for cell estimation, and corrects for tumour-type specificity. We benchmarked the seven tools using TCGA DNA-derived purity scores (33 tumour-types), methylation-derived leukocyte scores (30 tumour-types), and H&E deep learning derived lymphocyte counts (13 tumour types), and individual benchmark data sets (PBMCs and 2 tumour types). Although none of the seven tools outperformed others in every benchmark, ConsensusTME ranked consistently well in all cancer-related benchmarks making it the top performing method overall. Computational methods that provide robust and accurate estimates of non-cancerous cell populations in the tumour microenvironment from tumour bulk expression data are important tools that can advance our understanding of tumour, immune, and stroma interactions, with potential clinical application if high accuracy estimates are achieved.

[1]  S. Elledge,et al.  Tumor aneuploidy correlates with markers of immune evasion and with reduced response to immunotherapy , 2017, Science.

[2]  A. Butte,et al.  xCell: digitally portraying the tissue cellular heterogeneity landscape , 2017, Genome Biology.

[3]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[4]  Benjamin E. Gross,et al.  The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. , 2012, Cancer discovery.

[5]  Z. Trajanoski,et al.  Quantifying tumor-infiltrating immune cells from transcriptomics data , 2018, Cancer Immunology, Immunotherapy.

[6]  Bernardo J. Clavijo,et al.  Rapid transcriptional plasticity of duplicated gene clusters enables a clonally reproducing aphid to colonise diverse plant species , 2017, Genome Biology.

[7]  Steven J. M. Jones,et al.  The Immune Landscape of Cancer , 2018, Immunity.

[8]  Rajarsi R. Gupta,et al.  Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images. , 2018, Cell reports.

[9]  Ash A. Alizadeh,et al.  Robust enumeration of cell subsets from tissue expression profiles , 2015, Nature Methods.

[10]  Jeffrey A. Wiser,et al.  ImmPort: disseminating data to the public for the future of immunology , 2014, Immunologic Research.

[11]  Siyuan Zheng,et al.  Benchmarking: contexts and details matter , 2017, Genome Biology.

[12]  Ben S. Wittner,et al.  Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1 , 2009, Nature.

[13]  X. Shirley Liu,et al.  Revisit linear regression-based deconvolution methods for tumor gene expression data , 2017, Genome Biology.

[14]  Z. Trajanoski,et al.  Spatiotemporal dynamics of intratumoral immune cells reveal the immune landscape in human cancer. , 2013, Immunity.

[15]  P. Laurent-Puig,et al.  Estimating the population abundance of tissue-infiltrating immune and stromal cell populations using gene expression , 2016, Genome Biology.

[16]  Ash A. Alizadeh,et al.  Data normalization considerations for digital tumor dissection , 2017, Genome Biology.

[17]  Justin Guinney,et al.  GSVA: gene set variation analysis for microarray and RNA-Seq data , 2013, BMC Bioinformatics.

[18]  C. Tyler-Smith,et al.  Ancient DNA and the rewriting of human history: be sparing with Occam’s razor , 2016, Genome Biology.

[19]  A. McKenna,et al.  Absolute quantification of somatic DNA alterations in human cancer , 2012, Nature Biotechnology.

[20]  G. Getz,et al.  Inferring tumour purity and stromal and immune cell admixture from expression data , 2013, Nature Communications.

[21]  Peter W. Laird,et al.  Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer , 2018, Cell.

[22]  Jun S. Liu,et al.  Comprehensive analyses of tumor immunity: implications for cancer immunotherapy , 2016, Genome Biology.

[23]  J. Mesirov,et al.  The Molecular Signatures Database Hallmark Gene Set Collection , 2015 .

[24]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.