Fast and robust deconvolution of tumor infiltrating lymphocyte from expression profiles using least trimmed squares

Gene-expression deconvolution is used to quantify different types of cells in a mixed population. It provides a highly promising solution to rapidly characterize the tumor-infiltrating immune landscape and identify cold cancers. However, a major challenge is that gene-expression data are frequently contaminated by many outliers that decrease the estimation accuracy. Thus, it is imperative to develop a robust deconvolution method that automatically decontaminates data by reliably detecting and removing outliers. We developed a new machine learning tool, Fast And Robust DEconvolution of Expression Profiles (FARDEEP), to enumerate immune cell subsets from whole tumor tissue samples. To reduce noise in the tumor gene expression datasets, FARDEEP utilizes an adaptive least trimmed square to automatically detect and remove outliers before estimating the cell compositions. We show that FARDEEP is less susceptible to outliers and returns a better estimation of coefficients than the existing methods with both numerical simulations and real datasets. FARDEEP provides the absolute quantitation of each immune cell subset in addition to relative percentages. Hence, FARDEEP represents a novel robust algorithm to complement the existing toolkit for the characterization of tissue-infiltrating immune cell landscape. The source code for FARDEEP as implemented in R is available for download at https://goo.gl/SqGKuo.

[1]  Shuguang Huang,et al.  Comparative analysis and integrative classification of NCI60 cell lines and primary tumors using gene expression profiling data , 2006, BMC Genomics.

[2]  Mark M. Davis,et al.  Cell type–specific gene expression differences in complex tissues , 2010, Nature Methods.

[3]  J. Szustakowski,et al.  Optimal Deconvolution of Transcriptional Profiling Data Using Quadratic Programming with Application to Complex Clinical Blood Samples , 2011, PloS one.

[4]  R. Weinberg,et al.  Understanding the tumor immune microenvironment (TIME) for effective therapy , 2018, Nature Medicine.

[5]  Ash A. Alizadeh,et al.  Robust enumeration of cell subsets from tissue expression profiles , 2015, Nature Methods.

[6]  Kun Huang,et al.  MMAD: microarray microdissection with analysis of differences is a computational tool for deconvoluting cell type-specific contributions from tissue samples , 2014, Bioinform..

[7]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[8]  A. Butte,et al.  Systematic pan-cancer analysis of tumour purity , 2015, Nature Communications.

[9]  Mechthild Krause,et al.  CD8+ tumour‐infiltrating lymphocytes in relation to HPV status and clinical outcome in patients with head and neck cancer after postoperative chemoradiotherapy: A multicentre study of the German cancer consortium radiation oncology group (DKTK‐ROG) , 2016, International journal of cancer.

[10]  Jeremy MG Taylor,et al.  Tumor infiltrating lymphocytes and survival in patients with head and neck squamous cell carcinoma , 2016, Head & neck.

[11]  PETER J. ROUSSEEUW,et al.  Computing LTS Regression for Large Data Sets , 2005, Data Mining and Knowledge Discovery.

[12]  Yan Guo,et al.  A Cell-Based Systems Biology Assessment of Human Blood to Monitor Immune Responses after Influenza Vaccination , 2015, PloS one.

[13]  G. Wolf,et al.  Telltale tumor infiltrating lymphocytes (TIL) in oral, head & neck cancer. , 2016, Oral oncology.

[14]  D. Speiser,et al.  Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data , 2017, bioRxiv.

[15]  Robert J. Lonigro,et al.  Integrative Clinical Genomics of Metastatic Cancer , 2017, Nature.

[16]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[17]  yang-xin fu,et al.  DNA sensing and immune responses in cancer therapy. , 2017, Current opinion in immunology.

[18]  I. Amit,et al.  Digital cell quantification identifies global immune cell dynamics during influenza infection , 2014, Molecular systems biology.

[19]  Aidong Zhang,et al.  Cluster analysis for gene expression data: a survey , 2004, IEEE Transactions on Knowledge and Data Engineering.

[20]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[21]  Ting Gong,et al.  DeconRNASeq: a statistical framework for deconvolution of heterogeneous tissue samples based on mRNA-Seq data , 2013, Bioinform..

[22]  Z. Trajanoski,et al.  Integrative Analyses of Colorectal Cancer Show Immunoscore Is a Stronger Predictor of Patient Survival Than Microsatellite Instability. , 2016, Immunity.

[23]  P. Rousseeuw Least Median of Squares Regression , 1984 .

[24]  B. Ycart,et al.  Large-scale microarray profiling reveals four stages of immune escape in non-Hodgkin lymphomas , 2016, Oncoimmunology.

[25]  Christophe Croux,et al.  Sparse least trimmed squares regression for analyzing high-dimensional large data sets , 2013, 1304.4773.

[26]  S. Knuutila,et al.  Follicular Lymphoma Cell Lines, an In Vitro Model for Antigenic Selection and Cytokine‐Mediated Growth Regulation of Germinal Centre B Cells , 2003, Scandinavian journal of immunology.

[27]  Z. Modrušan,et al.  Deconvolution of Blood Microarray Data Identifies Cellular Activation Patterns in Systemic Lupus Erythematosus , 2009, PloS one.

[28]  Francesco Vallania,et al.  Leveraging heterogeneity across multiple datasets increases cell-mixture deconvolution accuracy and reduces biological and technical biases , 2018, Nature Communications.

[29]  Yiyuan She,et al.  Outlier Detection Using Nonconvex Penalized Regression , 2010, ArXiv.

[30]  Z. Trajanoski,et al.  Type, Density, and Location of Immune Cells Within Human Colorectal Tumors Predict Clinical Outcome , 2006, Science.

[31]  R. Weichselbaum,et al.  STING-Dependent Cytosolic DNA Sensing Promotes Radiation-Induced Type I Interferon-Dependent Antitumor Immunity in Immunogenic Tumors. , 2014, Immunity.

[32]  Gerhard Laschober,et al.  quanTIseq: quantifying immune contexture of human tumors , 2017 .

[33]  G. Getz,et al.  Inferring tumour purity and stromal and immune cell admixture from expression data , 2013, Nature Communications.

[34]  F. Dieli,et al.  Assessment of tumor-infiltrating TCRVγ9Vδ2 γδ lymphocyte abundance by deconvolution of human cancers microarrays , 2017, Oncoimmunology.

[35]  Qingming Huang,et al.  Exploring Outliers in Crowdsourced Ranking for QoE , 2017, ACM Multimedia.

[36]  Zlatko Trajanoski,et al.  In situ cytotoxic and memory T cells predict outcome in patients with early-stage colorectal cancer. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[37]  Weiping Zou,et al.  Chemokines in the cancer microenvironment and their relevance in cancer immunotherapy , 2017, Nature Reviews Immunology.

[38]  T. Gajewski,et al.  The host STING pathway at the interface of cancer and immunity. , 2016, The Journal of clinical investigation.

[39]  G. Wolf,et al.  Tumor infiltrating lymphocytes (TIL) and prognosis in oral cavity squamous carcinoma: a preliminary study. , 2015, Oral oncology.

[40]  Peter J. Rousseeuw,et al.  Robust Regression and Outlier Detection , 2005, Wiley Series in Probability and Statistics.

[41]  J. Beal Biochemical complexity drives log-normal variation in genetic expression , 2017 .

[42]  Jun S. Liu,et al.  Comprehensive analyses of tumor immunity: implications for cancer immunotherapy , 2016, Genome Biology.

[43]  Mei Yu,et al.  PERT: A Method for Expression Deconvolution of Human Blood Samples from Varied Microenvironmental and Developmental Conditions , 2012, PLoS Comput. Biol..

[44]  A. Butte,et al.  xCell: digitally portraying the tissue cellular heterogeneity landscape , 2017, Genome Biology.

[45]  C. Rödel,et al.  Tumour-infiltrating lymphocytes predict response to definitive chemoradiotherapy in head and neck cancer , 2013, British Journal of Cancer.

[46]  S. Wright,et al.  CHEMTAX - a program for estimating class abundances from chemical markers: application to HPLC measurements of phytoplankton , 1996 .