Molecular classification of human carcinomas by use of gene expression signatures.

Classification of human tumors according to their primary anatomical site of origin is fundamental for the optimal treatment of patients with cancer. Here we describe the use of large-scale RNA profiling and supervised machine learning algorithms to construct a first-generation molecular classification scheme for carcinomas of the prostate, breast, lung, ovary, colorectum, kidney, liver, pancreas, bladder/ureter, and gastroesophagus, which collectively account for approximately 70% of all cancer-related deaths in the United States. The classification scheme was based on identifying gene subsets whose expression typifies each cancer class, and we quantified the extent to which these genes are characteristic of a specific tumor type by accurately and confidently predicting the anatomical site of tumor origin for 90% of 175 carcinomas, including 9 of 12 metastatic lesions. The predictor gene subsets include those whose expression is typical of specific types of normal epithelial differentiation, as well as other genes whose expression is elevated in cancer. This study demonstrates the feasibility of predicting the tissue origin of a carcinoma in the context of multiple cancer classes.

[1]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[2]  N. Dubrawsky Cancer statistics , 1989, CA: a cancer journal for clinicians.

[3]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[4]  D. Lockhart,et al.  Expression monitoring by hybridization to high-density oligonucleotide arrays , 1996, Nature Biotechnology.

[5]  L. Old,et al.  Enhanced antitumor activity of combination radioimmunotherapy (131I-labeled monoclonal antibody A33) with chemotherapy (fluorouracil). , 1997, Cancer research.

[6]  L. Wodicka,et al.  Genome-wide expression monitoring in Saccharomyces cerevisiae , 1997, Nature Biotechnology.

[7]  S. Korourian,et al.  Seprase, a membrane-bound protease, is overexpressed by invasive ductal carcinoma cells of human breast cancers. , 1998, Modern pathology : an official journal of the United States and Canadian Academy of Pathology, Inc.

[8]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[9]  L. Greller,et al.  Detecting selective expression of genes and proteins. , 1999, Genome research.

[10]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[11]  T. Sun,et al.  Detection of circulating uroplakin-positive cells in patients with transitional cell carcinoma of the bladder. , 1999, The Journal of urology.

[12]  G. Yousef,et al.  Human kallikrein 6 (zyme/protease M/neurosin): a new serum biomarker of ovarian carcinoma. , 2000, Clinical biochemistry.

[13]  I. Pastan,et al.  Anti-Tumor Activity of K1-LysPE38QQR, an Immunotoxin Targeting Mesothelin, a Cell-Surface Antigen Overexpressed in Ovarian Cancer and Malignant Mesothelioma , 2000, Journal of immunotherapy.

[14]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[15]  H. Hillen Unknown primary tumours , 2000, Postgraduate medical journal.

[16]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[17]  R. Ghossein,et al.  Molecular detection of micrometastases and circulating tumor cells in melanoma prostatic and breast carcinomas. , 2000, In vivo.

[18]  J. Reis-Filho,et al.  Is TTF1 a good immunohistochemical marker to distinguish primary from metastatic lung adenocarcinomas? , 2000, Pathology, research and practice.

[19]  D. Lockhart,et al.  Analysis of gene expression profiles in normal and neoplastic ovarian tissue samples identifies candidate molecular markers of epithelial ovarian cancer. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[20]  J. Welsh,et al.  Analysis of gene expression identifies candidate markers and pharmacological targets in prostate cancer. , 2001, Cancer research.

[21]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[22]  Taylor Murray,et al.  Cancer Statistics, 2001 , 2001, CA: a cancer journal for clinicians.

[23]  Sayan Mukherjee,et al.  Molecular classification of multiple tumor types , 2001, ISMB.

[24]  E. Dougherty,et al.  Gene-expression profiles in hereditary breast cancer. , 2001, The New England journal of medicine.