Identifying reproducible cancer-associated highly expressed genes with important functional significances using multiple datasets

Identifying differentially expressed (DE) genes between cancer and normal tissues is of basic importance for studying cancer mechanisms. However, current methods, such as the commonly used Significance Analysis of Microarrays (SAM), are biased to genes with low expression levels. Recently, we proposed an algorithm, named the pairwise difference (PD) algorithm, to identify highly expressed DE genes based on reproducibility evaluation of top-ranked expression differences between paired technical replicates of cells under two experimental conditions. In this study, we extended the application of the algorithm to the identification of DE genes between two types of tissue samples (biological replicates) based on several independent datasets or sub-datasets of a dataset, by constructing multiple paired average gene expression profiles for the two types of samples. Using multiple datasets for lung and esophageal cancers, we demonstrated that PD could identify many DE genes highly expressed in both cancer and normal tissues that tended to be missed by the commonly used SAM. These highly expressed DE genes, including many housekeeping genes, were significantly enriched in many conservative pathways, such as ribosome, proteasome, phagosome and TNF signaling pathways with important functional significances in oncogenesis.

[1]  Gordon K. Smyth,et al.  Testing significance relative to a fold-change threshold is a TREAT , 2009, Bioinform..

[2]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[3]  Guy N. Brock,et al.  Empirical evaluation of consistency and accuracy of methods to detect differentially expressed genes based on microarray data , 2014, Comput. Biol. Medicine.

[4]  G. Otterson,et al.  Fanconi Anemia Repair Pathway Dysfunction, a Potential Therapeutic Target in Lung Cancer , 2014, Front. Oncol..

[5]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[6]  Zheng Guo,et al.  Identification of reproducible drug-resistance-related dysregulated genes in small-scale cancer cell line experiments , 2015, Scientific Reports.

[7]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[8]  Edwin Wang,et al.  Cancer systems biology in the genome sequencing era: part 1, dissecting and modeling of tumor clones and their networks. , 2013, Seminars in cancer biology.

[9]  Stefan F. T. Weiss,et al.  Downregulation of the Non-Integrin Laminin Receptor Reduces Cellular Viability by Inducing Apoptosis in Lung and Cervical Cancer Cells , 2013, PloS one.

[10]  B. Tilton,et al.  Housekeeping gene variability in normal and cancerous colorectal, pancreatic, esophageal, gastric and hepatic tissues. , 2005, Molecular and cellular probes.

[11]  M. Tschan,et al.  miR-29b Mediates NF-κB Signaling in KRAS-Induced Non-Small Cell Lung Cancers. , 2016, Cancer research.

[12]  Q. Tong,et al.  Inhibitory effect of ubiquitin-proteasome pathway on proliferation of esophageal carcinoma cells. , 2004, World journal of gastroenterology.

[13]  Genetic polymorphism rs3760396 of the chemokine (C-C motif) ligand 2 gene (CCL2) associated with the susceptibility of lung cancer in a pathological subtype-specific manner in Han-ancestry Chinese: a case control study , 2016, BMC Cancer.

[14]  V. Pankratz,et al.  Genetic variation in glutathione metabolism and DNA repair genes predicts survival of small-cell lung cancer patients. , 2010, Annals of oncology : official journal of the European Society for Medical Oncology.

[15]  N. Sunaga,et al.  Inhibition of L-type amino acid transporter 1 has antitumor activity in non-small cell lung cancer. , 2010, Anticancer research.

[16]  M. Hsiao,et al.  Endoplasmic reticulum ribosome-binding protein 1 (RRBP1) overexpression is frequently found in lung cancer patients and alleviates intracellular stress-induced apoptosis through the enhancement of GRP78 , 2013, Oncogene.

[17]  Chien-Jen Chen,et al.  Fanconi anemia genes in lung adenocarcinoma- a pathway-wide study on cancer susceptibility , 2016, Journal of Biomedical Science.

[18]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Jun Yu,et al.  How many human genes can be defined as housekeeping with current expression data? , 2008, BMC Genomics.

[20]  Tom C. T. Yin,et al.  A Flexible Renewal Process Simulator for Neural Spike Trains , 1976, IEEE Transactions on Biomedical Engineering.

[21]  Martin T. Suchorolski,et al.  Warburg and Crabtree Effects in Premalignant Barrett's Esophagus Cell Lines with Active Mitochondria , 2013, PloS one.

[22]  Jing Zhu,et al.  Gaining confidence in biological interpretation of the microarray data: the functional consistence of the significant GO categories , 2008, Bioinform..

[23]  Chang-Young Jang,et al.  RpS3, a DNA repair endonuclease and ribosomal protein, is involved in apoptosis , 2004, FEBS letters.

[24]  J. Vilček,et al.  Interleukin-6 induction by tumor necrosis factor and interleukin-1 in human fibroblasts involves activation of a nuclear factor binding to a kappa B-like sequence , 1990, Molecular and cellular biology.

[25]  W. V. van IJcken,et al.  Gene Expression-Based Classification of Non-Small Cell Lung Carcinomas and Survival Prediction , 2010, PloS one.

[26]  M. Burt,et al.  Glutathione metabolism in patients with non-small cell lung cancers. , 1997, Cancer research.

[27]  M. Stoppelli,et al.  Reversal of Warburg Effect and Reactivation of Oxidative Phosphorylation by Differential Inhibition of EGFR Signaling Pathways in Non–Small Cell Lung Cancer , 2015, Clinical Cancer Research.

[28]  G. Gibson,et al.  Skeletal muscle mRNA levels for cathepsin B, but not components of the ubiquitin-proteasome pathway, are increased in patients with lung cancer referred for thoracotomy. , 2002, Clinical science.

[29]  P. Wang,et al.  O-GlcNAcylation of G6PD promotes the pentose phosphate pathway and tumor growth , 2015, Nature Communications.

[30]  Chuhsing Kate Hsiao,et al.  Identification of a Novel Biomarker, SEMA5A, for Non–Small Cell Lung Carcinoma in Nonsmoking Women , 2010, Cancer Epidemiology, Biomarkers & Prevention.

[31]  M. Huang,et al.  Oxidative and endoplasmic reticulum stress signaling are involved in dehydrocostuslactone-mediated apoptosis in human non-small cell lung cancer cells. , 2010, Lung cancer.

[32]  H. Uramoto,et al.  Identification of ribosomal protein L19 as a novel tumor antigen recognized by autologous cytotoxic T lymphocytes in lung adenocarcinoma , 2010, Cancer science.

[33]  K. Kang,et al.  Fisetin induces apoptosis and endoplasmic reticulum stress in human non-small cell lung cancer through inhibition of the MAPK signaling pathway , 2016, Tumor Biology.

[34]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[35]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[36]  Nan Hu,et al.  Genome wide analysis of DNA copy number neutral loss of heterozygosity (CNNLOH) and its relation to gene expression in esophageal squamous cell carcinoma , 2010, BMC Genomics.

[37]  Lang He,et al.  Revealing weak differential gene expressions and their reproducible functions associated with breast cancer metastasis , 2012, Comput. Biol. Chem..

[38]  Edwin Wang,et al.  Cancer systems biology in the genome sequencing era: part 2, evolutionary dynamics of tumor clonal networks and drug resistance. , 2013, Seminars in cancer biology.

[39]  Qiuling Shi,et al.  Polymorphisms of methionine synthase and methionine synthase reductase and risk of lung cancer: a case–control analysis , 2005, Pharmacogenetics and genomics.

[40]  W. Chu Tumor necrosis factor. , 2013, Cancer letters.

[41]  G. Scagliotti Proteasome inhibitors in lung cancer. , 2006, Critical reviews in oncology/hematology.

[42]  A K Bahn,et al.  Application of binomial distribution to medicine: comparison of one sample proportion to an expected proportion (for small samples). Evaluation of a new treatment. Evaluation of a risk factor. , 1969, Journal of the American Medical Women's Association.

[43]  Liqing Zhou,et al.  Down‐regulation of 5S rRNA by miR‐150 and miR‐383 enhances c‐Myc–rpL11 interaction and inhibits proliferation of esophageal squamous carcinoma cells , 2015, FEBS letters.

[44]  S. Pyo,et al.  Prognostic significance and function of phosphorylated ribosomal protein S6 in esophageal squamous cell carcinoma , 2013, Modern Pathology.

[45]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[46]  Trygve Almøy,et al.  A Discussion Concerning the Inclusion of Variety Effect when Analysis of Variance is Used to Detect Differentially Expressed Genes , 2007, Gene regulation and systems biology.

[47]  E. Wang,et al.  Predictive genomics: a cancer hallmark network framework for predicting tumor clinical phenotypes using genome sequencing data. , 2014, Seminars in cancer biology.

[48]  R. Weinshilboum,et al.  Role of the glutathione metabolic pathway in lung cancer treatment and prognosis: a review. , 2006, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[49]  Jun Yu,et al.  Identification of Human HK Genes and Gene Expression Regulation Study in Cancer from Transcriptomics Data Analysis , 2013, PloS one.

[50]  C. Angeletti,et al.  67-Kilodalton laminin receptor expression correlates with worse prognostic indicators in non-small cell lung carcinomas. , 1997, Clinical cancer research : an official journal of the American Association for Cancer Research.

[51]  Jing Zhu,et al.  Viewing cancer genes from co-evolving gene modules , 2010, Bioinform..

[52]  Nan Hu,et al.  Identification of unique expression signatures and therapeutic targets in esophageal squamous cell carcinoma , 2012, BMC Research Notes.

[53]  J. Castle,et al.  Definition, conservation and epigenetics of housekeeping and tissue-enriched genes , 2009, BMC Genomics.

[54]  Stan Pounds,et al.  Assumption adequacy averaging as a concept for developing more robust methods for differential gene expression analysis , 2009, Comput. Stat. Data Anal..

[55]  Sachio Ito,et al.  Uncovering Direct Targets of MiR-19a Involved in Lung Cancer Progression , 2015, PloS one.

[56]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[57]  Lei Yang,et al.  Comparative analysis of housekeeping and tissue-selective genes in human based on network topologies and biological properties , 2016, Molecular Genetics and Genomics.

[58]  Yao Shen,et al.  Cancer Stem Cells in Small Cell Lung Cancer Cell Line H446: Higher Dependency on Oxidative Phosphorylation and Mitochondrial Substrate-Level Phosphorylation than Non-Stem Cancer Cells , 2016, PloS one.

[59]  Rainer Breitling,et al.  Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments , 2004, FEBS letters.

[60]  Li-Jen Su,et al.  Methylosome protein 50 promotes androgen- and estrogen-independent tumorigenesis. , 2014, Cellular signalling.

[61]  Edward S. C. Shih,et al.  Preservation of Ranking Order in the Expression of Human Housekeeping Genes , 2011, PloS one.

[62]  Thanos D Halazonetis,et al.  DNA replication stress as a hallmark of cancer. , 2015, Annual review of pathology.