Large-scale integration of cancer microarray data identifies a robust common cancer signature

BackgroundThere is a continuing need to develop molecular diagnostic tools which complement histopathologic examination to increase the accuracy of cancer diagnosis. DNA microarrays provide a means for measuring gene expression signatures which can then be used as components of genomic-based diagnostic tests to determine the presence of cancer.ResultsIn this study, we collect and integrate ~ 1500 microarray gene expression profiles from 26 published cancer data sets across 21 major human cancer types. We then apply a statistical method, referred to as the T op-S coring P air of G roups (TSPG) classifier, and a repeated random sampling strategy to the integrated training data sets and identify a common cancer signature consisting of 46 genes. These 46 genes are naturally divided into two distinct groups; those in one group are typically expressed less than those in the other group for cancer tissues. Given a new expression profile, the classifier discriminates cancer from normal tissues by ranking the expression values of the 46 genes in the cancer signature and comparing the average ranks of the two groups. This signature is then validated by applying this decision rule to independent test data.ConclusionBy combining the TSPG method and repeated random sampling, a robust common cancer signature has been identified from large-scale microarray data integration. Upon further validation, this signature may be useful as a robust and objective diagnostic test for cancer.

[1]  Rainer Spang,et al.  Detecting Common Gene Expression Patterns in Multiple Cancer Outcome Entities , 2005, Biomedical microdevices.

[2]  William Stafford Noble,et al.  Matrix2png: a utility for visualizing matrix data , 2003, Bioinform..

[3]  Daniel Q. Naiman,et al.  Classifying Gene Expression Profiles from Pairwise mRNA Comparisons , 2004, Statistical applications in genetics and molecular biology.

[4]  C Eng,et al.  Gene expression in papillary thyroid carcinoma reveals highly consistent profiles , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Andrew I Su,et al.  Large scale molecular analysis identifies genes with altered expression in salivary adenoid cystic carcinoma. , 2002, The American journal of pathology.

[6]  Andrew I. Brooks,et al.  Computational method for reducing variance with Affymetrix microarrays , 2002, BMC Bioinformatics.

[7]  D. Koller,et al.  A module map showing conditional activity of expression modules in cancer , 2004, Nature Genetics.

[8]  Arie Perry,et al.  Molecular characterization of human meningiomas by gene expression profiling using high-density oligonucleotide microarrays. , 2002, The American journal of pathology.

[9]  L. Hood,et al.  Highly accurate two-gene classifier for differentiating gastrointestinal stromal tumors and leiomyosarcomas , 2007, Proceedings of the National Academy of Sciences.

[10]  A. Orth,et al.  Large-scale analysis of the human and mouse transcriptomes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[11]  N. Sampas,et al.  Molecular classification of cutaneous malignant melanoma by gene expression profiling , 2000, Nature.

[12]  Doron Lancet,et al.  Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification , 2005, Bioinform..

[13]  T. Poggio,et al.  Prediction of central nervous system embryonal tumour outcome based on gene expression , 2002, Nature.

[14]  Ignacio Wistuba,et al.  Malignant Pleural Mesothelioma , 2017 .

[15]  David E. Misek,et al.  Gene-expression profiles predict survival of patients with lung adenocarcinoma , 2002, Nature Medicine.

[16]  Paul S Mischel,et al.  Gene expression profiling identifies molecular subtypes of gliomas , 2003, Oncogene.

[17]  Paul G Gauger,et al.  Distinct transcriptional profiles of adrenocortical tumors uncovered by DNA microarray analysis. , 2003, The American journal of pathology.

[18]  N. Gerry,et al.  Previously unidentified changes in renal cell carcinoma gene expression identified by parametric analysis of microarray data , 2003, BMC Cancer.

[19]  Rork Kuick,et al.  Molecular profiling of pancreatic adenocarcinoma and chronic pancreatitis identifies multiple genes differentially regulated in pancreatic cancer. , 2003, Cancer research.

[20]  Daniel Q. Naiman,et al.  Simple decision rules for classifying human cancers from gene expression profiles , 2005, Bioinform..

[21]  E. Lander,et al.  Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Christine A Iacobuzio-Donahue,et al.  Exploration of global gene expression patterns in pancreatic adenocarcinoma using cDNA microarrays. , 2003, The American journal of pathology.

[23]  M. Bittner,et al.  Chromosomal aberrations and gene expression profiles in non-small cell lung cancer. , 2007, Lung cancer.

[24]  Yingdong Zhao,et al.  Common cancer biomarkers. , 2006, Cancer research.

[25]  Arie Perry,et al.  Comparative gene expression profile analysis of neurofibromatosis 1-associated and sporadic pilocytic astrocytomas. , 2002, Cancer research.

[26]  S. Dhanasekaran,et al.  Delineation of prognostic biomarkers in prostate cancer , 2001, Nature.

[27]  D. Lockhart,et al.  Analysis of gene expression profiles in normal and neoplastic ovarian tissue samples identifies candidate molecular markers of epithelial ovarian cancer. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[28]  J. Welsh,et al.  Molecular classification of human carcinomas by use of gene expression signatures. , 2001, Cancer research.

[29]  David E. Misek,et al.  Distinctive molecular profiles of high-grade and low-grade gliomas based on oligonucleotide microarray analysis. , 2001, Cancer research.

[30]  Ramon Silva Martins,et al.  Malignant pleural mesothelioma. , 2012, Journal of the National Comprehensive Cancer Network : JNCCN.

[31]  M. Bittner,et al.  Human prostate cancer and benign prostatic hyperplasia: molecular dissection by gene expression profiling. , 2001, Cancer research.

[32]  M. Becich,et al.  Gene expression alterations in prostate cancer predicting tumor aggression and preceding development of malignancy. , 2004, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[33]  Daniel Q. Naiman,et al.  Robust prostate cancer marker genes emerge from direct integration of inter-study microarray data , 2005, Bioinform..

[34]  Robert C. Bast,et al.  Translational Crossroads for Biomarkers , 2005, Clinical Cancer Research.

[35]  Stefan Michiels,et al.  Prediction of cancer outcome with microarrays: a multiple random validation strategy , 2005, The Lancet.

[36]  Misao Ohki,et al.  Identification of a gene expression signature associated with pediatric AML prognosis. , 2003, Blood.

[37]  Raphael A Nemenoff,et al.  Tumorigenesis and Neoplastic Progression Analysis of Orthologous Gene Expression between Human Pulmonary Adenocarcinoma and a Carcinogen-Induced Murine Model , 2010 .

[38]  Emanuel Petricoin,et al.  Molecular profiling of human cancer , 2000, Nature Reviews Genetics.

[39]  Shuichi Tsutsumi,et al.  Global gene expression analysis of gastric cancer by oligonucleotide microarrays. , 2002, Cancer research.

[40]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[41]  T. Barrette,et al.  ONCOMINE: a cancer microarray database and integrated data-mining platform. , 2004, Neoplasia.

[42]  G. Stephanopoulos,et al.  A compendium of gene expression in normal human tissues. , 2001, Physiological genomics.

[43]  Olivier Poch,et al.  Identification of genes associated with tumorigenesis and metastatic potential of hypopharyngeal cancer by microarray analysis , 2004, Oncogene.

[44]  Torben F. Ørntoft,et al.  Identifying distinct classes of bladder carcinoma using microarrays , 2003, Nature Genetics.

[45]  Jeffrey R Marks,et al.  Gene Expression Patterns That Characterize Advanced Stage Serous Ovarian Cancers , 2004, The Journal of the Society for Gynecologic Investigation: JSGI.

[46]  John Crowley,et al.  Global gene expression profiling of multiple myeloma, monoclonal gammopathy of undetermined significance, and normal bone marrow plasma cells. , 2002, Blood.

[47]  Yixin Wang,et al.  Novel Genes Associated with Malignant Melanoma but not Benign Melanocytic Lesions , 2005, Clinical Cancer Research.

[48]  T. Poggio,et al.  Multiclass cancer diagnosis using tumor gene expression signatures , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[49]  M. Terris,et al.  Gene expression patterns in renal cell carcinoma assessed by complementary DNA microarray. , 2003, The American journal of pathology.

[50]  Dennis B. Troup,et al.  NCBI GEO: mining tens of millions of expression profiles—database and tools update , 2006, Nucleic Acids Res..

[51]  P. Brown,et al.  Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[52]  D. Botstein,et al.  Gene expression patterns in human liver cancers. , 2002, Molecular biology of the cell.