Identification of aberrant chromosomal regions from gene expression microarray studies applied to human breast cancer

MOTIVATION In cancer, chromosomal imbalances like amplifications and deletions, or changes in epigenetic mechanisms like DNA methylation influence the transcriptional activity. These alterations are often not limited to a single gene but affect several genes of the genomic region and may be relevant for the disease status. For example, the ERBB2 amplicon (17q21) in breast cancer is associated with poor patient prognosis. We present a general, unsupervised method for genome-wide gene expression data to systematically detect tumor patients with chromosomal regions of distinct transcriptional activity. The method aims to find expression patterns of adjacent genes with a consistently decreased or increased level of gene expression in tumor samples. Such patterns have been found to be associated with chromosomal aberrations and clinical parameters like tumor grading and thus can be useful for risk stratification or therapy. RESULTS Our approach was applied to 12 independent human breast cancer microarray studies comprising 1422 tumor samples. We prioritized chromosomal regions and genes predominantly found across all studies. The result highlighted not only regions which are well known to be amplified like 17q21 and 11q13, but also others like 8q24 (distal to MYC) and 17q24-q25 which may harbor novel putative oncogenes. Since our approach can be applied to any microarray study it may become a valuable tool for the exploration of transcriptional changes in diverse disease types. AVAILABILITY The R source codes which implement the method and an exemplary analysis are available at http://www.dkfz.de/mga2/people/buness/CTP/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  P. Tan,et al.  Clinicopathological significance of metallothioneins in breast cancer , 2008, Pathology & Oncology Research.

[2]  R. Tibshirani,et al.  Outlier sums for differential gene expression analysis. , 2007, Biostatistics.

[3]  Ajay N. Jain,et al.  Genomic and transcriptional aberrations linked to breast cancer pathophysiologies. , 2006, Cancer cell.

[4]  A. Callegaro,et al.  A locally adaptive statistical procedure (LAP) to identify differentially expressed chromosomal regions , 2006, Bioinform..

[5]  C. Pilarsky,et al.  A genome-wide map of aberrantly expressed chromosomal islands in colorectal cancer , 2006, Molecular Cancer.

[6]  David Levens,et al.  The FUSE/FBP/FIR/TFIIH system is a molecular machine programming a pulse of c‐myc expression , 2006, The EMBO journal.

[7]  Lothar Thiele,et al.  A systematic comparison and evaluation of biclustering methods for gene expression data , 2006, Bioinform..

[8]  Jun Yao,et al.  Combined cDNA array comparative genomic hybridization and serial analysis of gene expression analysis of breast tumor progression. , 2006, Cancer research.

[9]  E. Check Big money for cancer genomics , 2005, Nature.

[10]  I. Ellis,et al.  Total loss of MHC class I is an independent indicator of good prognosis in breast cancer , 2005, International journal of cancer.

[11]  P. Hall,et al.  An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[12]  David Cameron,et al.  Identification of molecular apocrine breast tumours by microarray analysis , 2005, Oncogene.

[13]  Debashis Ghosh,et al.  A model-based scan statistic for identifying extreme chromosomal regions of gene expression in human tumors , 2005, Bioinform..

[14]  T. Barrette,et al.  Mining for regulatory programs in the cancer transcriptome , 2005, Nature Genetics.

[15]  Thomas Lengauer,et al.  Estimating cancer survival and clinical outcome based on genetic tumor progression scores , 2005, Bioinform..

[16]  Benjamin Georgi,et al.  BIOINFORMATICS APPLICATIONS NOTE Gene expression MACAT—microarray chromosome analysis tool , 2022 .

[17]  R. Kiyama,et al.  Expression profiling of the estrogen responsive genes in response to phytoestrogens using a customized DNA microarray , 2005, FEBS letters.

[18]  Howard Y. Chang,et al.  Robustness, scalability, and integration of a wound-response gene expression signature in predicting breast cancer survival. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Mattias Höglund,et al.  Microarray analyses reveal strong influence of DNA copy number alterations on the transcriptional patterns in pancreatic cancer: implications for the interpretation of genomic amplifications , 2005, Oncogene.

[20]  J. Foekens,et al.  Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer , 2005, The Lancet.

[21]  Carlos Caldas,et al.  Identification and validation of prognostic markers in breast cancer with the complementary use of array‐CGH and tissue microarrays , 2005, The Journal of pathology.

[22]  P. Lichter,et al.  Candidate genes in breast cancer revealed by microarray-based comparative genomic hybridization of archived tissue. , 2005, Cancer research.

[23]  P. Hall,et al.  An expression signature for p 53 status in human breast cancer predicts mutation status , transcriptional effects , and patient survival , 2005 .

[24]  Sun-Yuan Kung,et al.  Accurate detection of aneuploidies in array CGH and gene expression microarray data , 2004, Bioinform..

[25]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[26]  Wei Wang,et al.  A two-gene expression ratio predicts clinical outcome in breast cancer patients treated with tamoxifen. , 2004, Cancer cell.

[27]  David Botstein,et al.  Different gene expression patterns in invasive lobular and ductal carcinomas of the breast. , 2004, Molecular biology of the cell.

[28]  Jorma Isola,et al.  Patterns of chromosomal imbalances defines subgroups of breast cancer with distinct clinical features and prognosis. A study of 305 tumors by comparative genomic hybridization. , 2003, Cancer research.

[29]  Päivikki Kauraniemi,et al.  Amplification of a 280-kilobase core region at the ERBB2 locus leads to activation of two hypothetical proteins in breast cancer. , 2003, The American journal of pathology.

[30]  Thomas D. Wu,et al.  Genome-wide identification of chromosomal regions of increased tumor expression by transcriptome analysis. , 2003, Cancer research.

[31]  Philip M. Long,et al.  Breast cancer classification and prognosis based on gene expression profiles from a population-based study , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[32]  S. Lahiri Resampling Methods for Dependent Data , 2003 .

[33]  R. Tibshirani,et al.  Repeated observation of breast tumor subtypes in independent gene expression data sets , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[34]  M. Maio,et al.  Methylation‐regulated expression of HLA class I antigens in melanoma , 2003, International journal of cancer.

[35]  M. West,et al.  Gene expression predictors of breast cancer outcomes , 2003, The Lancet.

[36]  M. West,et al.  GENE EXPRESSION PREDICTORS OF BREAST CANCER OUTCOMES ( revision of : PREDICTION OF BREAST CANCER STATES AND OUTCOMES BY INCORPORATING GENE EXPRESSION PROFILES ) , 2003 .

[37]  K. Furge,et al.  Identification of frequent cytogenetic aberrations in hepatocellular carcinoma using gene-expression microarray data , 2002, Genome Biology.

[38]  M. Ringnér,et al.  Impact of DNA amplification on gene expression patterns in breast cancer. , 2002, Cancer research.

[39]  Christian A. Rees,et al.  Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Roded Sharan,et al.  Discovering statistically significant biclusters in gene expression data , 2002, ISMB.

[41]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[42]  O. Monni,et al.  New amplified and highly expressed genes discovered in the ERBB2 amplicon in breast cancer by cDNA microarrays. , 2001, Cancer research.

[43]  G. Yang,et al.  DNA hypermethylation is a mechanism for loss of expression of the HLA class I genes in human esophageal squamous cell carcinomas. , 2001, Carcinogenesis.

[44]  R. Spang,et al.  Predicting the clinical status of human breast cancer by using gene expression profiles , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[45]  T. Liesegang The human transcriptome map: Clustering of highly expressed genes in chromosomal domains. Caron H, ∗ van Schaik B, van der Mee M, et al. Science 2001;291:1289–1292. , 2001 .

[46]  Joe W. Gray,et al.  Quantitative analysis of chromosomal CGH in human breast tumors associates copy number abnormalities with p53 status and patient survival , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[47]  F. Baas,et al.  The Human Transcriptome Map: Clustering of Highly Expressed Genes in Chromosomal Domains , 2001, Science.

[48]  M. Hung,et al.  Grb2 downregulation leads to Akt inactivation in heregulin-stimulated and ErbB2-overexpressing breast cancer cells , 2000, Oncogene.

[49]  C. Theillet,et al.  Comparative genomic hybridization analysis of breast tumors with predetermined profiles of DNA amplification. , 1997, Cancer research.

[50]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[51]  I. Bièche,et al.  Two distinct amplified regions at 17q11-q21 involved in human primary breast cancer. , 1996, Cancer research.