Outcome signature genes in breast cancer: is there a unique set?

MOTIVATION Predicting the metastatic potential of primary malignant tissues has direct bearing on the choice of therapy. Several microarray studies yielded gene sets whose expression profiles successfully predicted survival. Nevertheless, the overlap between these gene sets is almost zero. Such small overlaps were observed also in other complex diseases, and the variables that could account for the differences had evoked a wide interest. One of the main open questions in this context is whether the disparity can be attributed only to trivial reasons such as different technologies, different patients and different types of analyses. RESULTS To answer this question, we concentrated on a single breast cancer dataset, and analyzed it by a single method, the one which was used by van't Veer et al. to produce a set of outcome-predictive genes. We showed that, in fact, the resulting set of genes is not unique; it is strongly influenced by the subset of patients used for gene selection. Many equally predictive lists could have been produced from the same analysis. Three main properties of the data explain this sensitivity: (1) many genes are correlated with survival; (2) the differences between these correlations are small; (3) the correlations fluctuate strongly when measured over different subsets of patients. A possible biological explanation for these properties is discussed. CONTACT eytan.domany@weizmann.ac.il SUPPLEMENTARY INFORMATION http://www.weizmann.ac.il/physics/complex/compphys/downloads/liate/

[1]  David E. Misek,et al.  Gene-expression profiles predict survival of patients with lung adenocarcinoma , 2002, Nature Medicine.

[2]  R. Tibshirani,et al.  Repeated observation of breast tumor subtypes in independent gene expression data sets , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[3]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[4]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[5]  L. Staudt,et al.  The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. , 2002, The New England journal of medicine.

[6]  L Namba,et al.  The molecular biology of mammary intraepithelial neoplasia outgrowths , 2003, Breast Cancer Research.

[7]  R. Tibshirani,et al.  Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data , 2004, PLoS biology.

[8]  Richard P. Ciavarra,et al.  Impact of the tumor microenvironment on host infiltrating cells and the efficacy of flt3-ligand combination immunotherapy evaluated in a treatment model of mouse prostate cancer , 2003, Cancer Immunology, Immunotherapy.

[9]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[10]  R. Spang,et al.  Predicting the clinical status of human breast cancer by using gene expression profiles , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[11]  I. Hoffmann,et al.  Cell cycle regulation by the Cdc25 phosphatase family. , 2000, Progress in cell cycle research.

[12]  Robert Wechsler-Reya,et al.  BIN1 is a novel MYC–interacting protein with features of a tumour suppressor , 1996, Nature Genetics.

[13]  Carsten Peterson,et al.  Expression profiling to predict outcome in breast cancer: the influence of sample selection , 2002, Breast Cancer Research.

[14]  S. T. Buckland,et al.  An Introduction to the Bootstrap. , 1994 .

[15]  H. Nakshatri,et al.  Cloning of BRAK, a novel divergent CXC chemokine preferentially expressed in normal versus malignant cells. , 1999, Biochemical and biophysical research communications.

[16]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[17]  A. Harris,et al.  Subcellular localisation of cyclin B, Cdc2 and p21(WAF1/CIP1) in breast cancer. association with prognosis. , 2001, European journal of cancer.

[18]  Yudong D. He,et al.  Expression profiling using microarrays fabricated by an ink-jet oligonucleotide synthesizer , 2001, Nature Biotechnology.

[19]  Ash A. Alizadeh,et al.  Prediction of survival in diffuse large-B-cell lymphoma based on the expression of six genes. , 2004, The New England journal of medicine.

[20]  R. Tibshirani,et al.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Van,et al.  A gene-expression signature as a predictor of survival in breast cancer. , 2002, The New England journal of medicine.

[22]  F. Bertucci,et al.  Gene expression profiling of primary breast carcinomas using arrays of candidate genes. , 2000, Human molecular genetics.

[23]  E. Lander,et al.  A molecular signature of metastasis in primary solid tumors , 2003, Nature Genetics.

[24]  N Andrieu,et al.  Strong association between c-myb and oestrogen-receptor expression in human breast cancer. , 1990, Oncogene.

[25]  M. Hung,et al.  Enhancement of Bik antitumor effect by Bik mutants. , 2003, Cancer research.

[26]  Danh V. Nguyen,et al.  Partial least squares proportional hazard regression for application to DNA microarray survival data , 2002, Bioinform..

[27]  Sara A Byron,et al.  Potential therapeutic strategies to interrupt insulin-like growth factor signaling in breast cancer. , 2003, Seminars in oncology.

[28]  Robert L Strausberg,et al.  Comparison of medulloblastoma and normal neural transcriptomes identifies a restricted set of activated genes , 2003, Oncogene.

[29]  Ryszard Maleszka,et al.  Microarray reality checks in the context of a complex disease , 2004, Nature Biotechnology.

[30]  M. Ffrench,et al.  ckshs expression is linked to cell proliferation in normal and malignant human lymphoid cells , 1999, International journal of cancer.

[31]  Sunil Singhal,et al.  Alterations in Cell Cycle Genes in Early Stage Lung Adenocarcinoma Identified by Expression Profiling , 2003, Cancer biology & therapy.

[32]  A. Hoeflich,et al.  IGF-binding protein-4: biochemical characteristics and functional consequences. , 2003, The Journal of endocrinology.

[33]  S. Dhanasekaran,et al.  The polycomb group protein EZH2 is involved in progression of prostate cancer , 2002, Nature.

[34]  E. Domany,et al.  Inhibition of p53-induced apoptosis without affecting expression of p53-regulated genes , 2003, Proceedings of the National Academy of Sciences of the United States of America.