The Prognostic Role of Genes with Skewed Expression Distribution in Lung Adenocarcinoma

Many studies assumed gene expression to be normally distributed. However, some were found to have left-skewed distribution, while others have right-skewed distribution. Here, we investigated the gene expression distribution of five lung adenocarcinoma data sets. We assumed that samples in the tail and non-tail of a skewed distribution were drawn from different populations with different survival outcomes. To investigate this hypothesis, skewed genes were detected to build a tail indicator matrix comprising of binary values. Survival analysis revealed that patients with more skewed genes in their tails had worse survival. Hierarchical clustering of the tail indicator matrices discovered a gene set with similar tail configurations for either left or right skewed genes. The two gene sets divided patients into three groups with different survivals. In conclusion, there is a direct association between genes with skewed distribution and the prognosis of lung adenocarcinoma patients.

[1]  Satoru Miyano,et al.  Identification of genes upregulated in ALK-positive and EGFR/KRAS/ALK-negative lung adenocarcinomas. , 2012, Cancer research.

[2]  Jiang Li,et al.  Large Scale Comparison of Gene Expression Levels by Microarrays and RNAseq Using TCGA Data , 2013, PloS one.

[3]  Robert J. Weil,et al.  Non-Gaussian Distributions Affect Identification of Expression Patterns, Functional Annotation, and Prospective Classification in Human Cancer Genomes , 2012, PloS one.

[4]  A. Bittner,et al.  Comparison of RNA-Seq and Microarray in Transcriptome Profiling of Activated T Cells , 2014, PloS one.

[5]  Igor Jurisica,et al.  Validation of a Histology-Independent Prognostic Gene Signature for Early-Stage, Non–Small-Cell Lung Cancer Including Stage IA Patients , 2014, Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer.

[6]  Jennifer A. Mitchell,et al.  Concordance between RNA-sequencing data and DNA microarray data in transcriptome analysis of proliferative and quiescent fibroblasts , 2015, Royal Society Open Science.

[7]  Igor Jurisica,et al.  Gene expression–based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study , 2008, Nature Medicine.

[8]  Steven J. M. Jones,et al.  Comprehensive molecular profiling of lung adenocarcinoma , 2014, Nature.

[9]  Yu Wang,et al.  The effects and mechanisms of SLC34A2 in tumorigenesis and progression of human non-small cell lung cancer , 2015, Journal of Biomedical Science.

[10]  Steven Eschrich,et al.  Differential association of STK11 and TP53 with KRAS mutation-associated gene expression, proliferation and immune surveillance in lung adenocarcinoma , 2015, Oncogene.

[11]  Derek Y. Chiang,et al.  Suppression of Lung Adenocarcinoma Progression by Nkx2-1 , 2011, Nature.

[12]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[13]  William C Hahn,et al.  Integrated cistromic and expression analysis of amplified NKX2-1 in lung adenocarcinoma identifies LMO3 as a functional transcriptional target. , 2013, Genes & development.

[14]  Ian W McKeague,et al.  Circulating Pro-Surfactant Protein B as a Risk Biomarker for Lung Cancer , 2013, Cancer Epidemiology, Biomarkers & Prevention.

[15]  Mary Goldman,et al.  The UCSC Cancer Genomics Browser: update 2015 , 2014, Nucleic Acids Res..

[16]  Nancy Stewart,et al.  EYFS Best Practice: All about … children's storymaking , 2014 .

[17]  Arjun Bhutkar,et al.  Foxa2 and Cdx2 cooperate with Nkx2-1 to inhibit lung adenocarcinoma metastasis , 2015, Genes & development.

[18]  Sanjay Mehrotra,et al.  Validation and characterization of DNA microarray gene expression data distribution and associated moments , 2010, BMC Bioinformatics.

[19]  Henrik J Ditzel,et al.  Analysis of GAGE, NY-ESO-1 and SP17 cancer/testis antigen expression in early stage non-small cell lung carcinoma , 2013, BMC Cancer.