Robust multi-tissue gene panel for cancer detection

BackgroundWe have identified a set of genes whose relative mRNA expression levels in various solid tumors can be used to robustly distinguish cancer from matching normal tissue. Our current feature set consists of 113 gene probes for 104 unique genes, originally identified as differentially expressed in solid primary tumors in microarray data on Affymetrix HG-U133A platform in five tissue types: breast, colon, lung, prostate and ovary. For each dataset, we first identified a set of genes significantly differentially expressed in tumor vs. normal tissue at p-value = 0.05 using an experimentally derived error model. Our common cancer gene panel is the intersection of these sets of significantly dysregulated genes and can distinguish tumors from normal tissue on all these five tissue types.MethodsFrozen tumor specimens were obtained from two commercial vendors Clinomics (Pittsfield, MA) and Asterand (Detroit, MI). Biotinylated targets were prepared using published methods (Affymetrix, CA) and hybridized to Affymetrix U133A GeneChips (Affymetrix, CA). Expression values for each gene were calculated using Affymetrix GeneChip analysis software MAS 5.0. We then used a software package called Genes@Work for differential expression discovery, and SVM light linear kernel for building classification models.ResultsWe validate the predictability of this gene list on several publicly available data sets generated on the same platform. Of note, when analysing the lung cancer data set of Spira et al, using an SVM linear kernel classifier, our gene panel had 94.7% leave-one-out accuracy compared to 87.8% using the gene panel in the original paper. In addition, we performed high-throughput validation on the Dana Farber Cancer Institute GCOD database and several GEO datasets.ConclusionsOur result showed the potential for this panel as a robust classification tool for multiple tumor types on the Affymetrix platform, as well as other whole genome arrays. Apart from possible use in diagnosis of early tumorigenesis, some other potential uses of our methodology and gene panel would be in assisting pathologists in diagnosis of pre-cancerous lesions, determining tumor boundaries, assessing levels of contamination in cell populations in vitro and identifying transformations in cell cultures after multiple passages. Moreover, based on the robustness of this gene panel in identifying normal vs. tumor, mislabelled or misinterpreted samples can be pinpointed with high confidence.

[1]  P. Sebastiani,et al.  Airway epithelial gene expression in the diagnostic evaluation of smokers with suspect lung cancer , 2007, Nature Medicine.

[2]  Gyan Bhanot,et al.  Abstract 1996: Molecular stratification of clear cell renal cell carcinoma by consensus clustering reveals distinct subtypes and survival patterns , 2010 .

[3]  Stefan Roepcke,et al.  Transcriptional census of 36 microdissected colorectal cancers yields a gene signature to distinguish UICC II and III , 2006, International journal of cancer.

[4]  J. Foekens,et al.  Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer , 2005, The Lancet.

[5]  Gustavo Stolovitzky,et al.  Genes@Work: an efficient algorithm for pattern discovery and multivariate feature selection in gene expression data , 2004, Bioinform..

[6]  Kathleen R. Cho,et al.  Fibroblast growth factor 9 has oncogenic activity and is a downstream target of Wnt signaling in ovarian endometrioid adenocarcinomas. , 2006, Cancer research.

[7]  J. Ross,et al.  MammaPrint™ 70-gene signature: another milestone in personalized medical care for breast cancer patients , 2009, Expert review of molecular diagnostics.

[8]  N. Gerry,et al.  Previously unidentified changes in renal cell carcinoma gene expression identified by parametric analysis of microarray data , 2003, BMC Cancer.

[9]  Q. Tan,et al.  Gene expression in epithelial ovarian cancer: a study of intratumor heterogeneity , 2006, International Journal of Gynecologic Cancer.

[10]  I. Gorlov,et al.  Housekeeping genes in prostate tumorigenesis , 2009, International journal of cancer.

[11]  L. Aaltonen,et al.  Serrated carcinomas form a subclass of colorectal cancer with distinct molecular basis , 2007, Oncogene.

[12]  W. Liang,et al.  9) TM4 Microarray Software Suite , 2006 .

[13]  W. Liang,et al.  TM4 microarray software suite. , 2006, Methods in enzymology.

[14]  Mogens Kruhøffer,et al.  Gene Expression in the Urinary Bladder , 2004, Cancer Research.

[15]  S. P. Fodor,et al.  High density synthetic oligonucleotide arrays , 1999, Nature Genetics.

[16]  Donald Geman,et al.  Large-scale integration of cancer microarray data identifies a robust common cancer signature , 2007, BMC Bioinformatics.

[17]  Yue Xiong,et al.  p15PAF, a novel PCNA associated factor with increased expression in tumor tissues , 2001, Oncogene.

[18]  Yixin Wang,et al.  Novel Genes Associated with Malignant Melanoma but not Benign Melanocytic Lesions , 2005, Clinical Cancer Research.

[19]  Huiqing Liu,et al.  Identifying mRNA targets of microRNA dysregulated in cancer: with application to clear cell Renal Cell Carcinoma , 2010, BMC Systems Biology.

[20]  Yingdong Zhao,et al.  Common cancer biomarkers. , 2006, Cancer research.

[21]  P. Brown,et al.  Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression. , 2004, Proceedings of the National Academy of Sciences of the United States of America.