arrayMap: A Reference Resource for Genomic Copy Number Imbalances in Human Malignancies

Background The delineation of genomic copy number abnormalities (CNAs) from cancer samples has been instrumental for identification of tumor suppressor genes and oncogenes and proven useful for clinical marker detection. An increasing number of projects have mapped CNAs using high-resolution microarray based techniques. So far, no single resource does provide a global collection of readily accessible oncogenomic array data. Methodology/Principal Findings We here present arrayMap, a curated reference database and bioinformatics resource targeting copy number profiling data in human cancer. The arrayMap database provides a platform for meta-analysis and systems level data integration of high-resolution oncogenomic CNA data. To date, the resource incorporates more than 40,000 arrays in 224 cancer types extracted from several resources, including the NCBI’s Gene Expression Omnibus (GEO), EBI’s ArrayExpress (AE), The Cancer Genome Atlas (TCGA), publication supplements and direct submissions. For the majority of the included datasets, probe level and integrated visualization facilitate gene level and genome wide data review. Results from multi-case selections can be connected to downstream data analysis and visualization tools. Conclusions/Significance To our knowledge, currently no data source provides an extensive collection of high resolution oncogenomic CNA data which readily could be used for genomic feature mining, across a representative range of cancer entities. arrayMap represents our effort for providing a long term platform for oncogenomic CNA data independent of specific platform considerations or specific project dependence. The online database can be accessed at http//www.arraymap.org.

[1]  M. Lubin,et al.  Selective Killing of Tumors Deficient in Methylthioadenosine Phosphorylase: A Novel Strategy , 2009, PloS one.

[2]  C. James,et al.  PTEN mutation, EGFR amplification, and outcome in patients with anaplastic astrocytoma and glioblastoma multiforme. , 2001, Journal of the National Cancer Institute.

[3]  Samuel Myllykangas,et al.  CanGEM: mining gene copy number changes in cancer , 2007, Nucleic Acids Res..

[4]  M. Shapero,et al.  High-resolution analysis of DNA copy number using oligonucleotide microarrays. , 2004, Genome research.

[5]  A. Krasinskas,et al.  CDKN2A and MTAP deletions in peritoneal mesotheliomas are correlated with loss of p16 protein expression and poor survival , 2010, Modern Pathology.

[6]  C. Croce,et al.  Human c-myc onc gene is located on the region of chromosome 8 that is translocated in Burkitt lymphoma cells. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Thomas D. Wu,et al.  Genetic Alterations and Oncogenic Pathways Associated with Breast Cancer Subtypes , 2009, Molecular Cancer Research.

[8]  R. Stallings Are chromosomal imbalances important in cancer? , 2007, Trends in genetics : TIG.

[9]  M. Wigler,et al.  PTEN, a Putative Protein Tyrosine Phosphatase Gene Mutated in Human Brain, Breast, and Prostate Cancer , 1997, Science.

[10]  J. Gearhart,et al.  Pluripotency redux--advances in stem-cell research. , 2007, The New England journal of medicine.

[11]  Jonathan W. Pillow,et al.  POSTER PRESENTATION Open Access , 2013 .

[12]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[13]  A. Friedman,et al.  Comparative genetic patterns of glioblastoma multiforme: potential diagnostic tool for tumor classification. , 2000, Neuro-oncology.

[14]  James Hadfield,et al.  The pitfalls of platform comparison: DNA copy number array technologies assessed , 2009, BMC Genomics.

[15]  David Haussler,et al.  The UCSC genome browser database: update 2007 , 2006, Nucleic Acids Res..

[16]  Ibrahim Emam,et al.  ArrayExpress update—an archive of microarray and high-throughput sequencing-based functional genomics experiments , 2010, Nucleic Acids Res..

[17]  Webster K. Cavenee,et al.  Feedback Circuit among INK4 Tumor Suppressors Constrains Human Glioblastoma Development , 2008, Cancer cell.

[18]  N. Carter Methods and strategies for analyzing copy number variation using DNA microarrays , 2007, Nature Genetics.

[19]  Michael Baudis,et al.  Progenetix.net: an online repository for molecular cytogenetic aberration data , 2001, Bioinform..

[20]  J. Cigudosa,et al.  A large scale survey reveals that chromosomal copy-number alterations significantly affect gene modules involved in cancer initiation and progression , 2011, BMC Medical Genomics.

[21]  L. Chin,et al.  A comparison of DNA copy number profiling platforms. , 2008, Cancer research.

[22]  P. Rogan,et al.  Structural and genic characterization of stable genomic regions in breast cancer: Relevance to chemotherapy , 2012, Molecular oncology.

[23]  Xiao-Jun Ma,et al.  The HOXB7 protein renders breast cancer cells resistant to tamoxifen through activation of the EGFR pathway , 2011, Proceedings of the National Academy of Sciences.

[24]  H. Putter,et al.  Clinical and histologic characteristics of malignant melanoma in families with a germline mutation in CDKN2A. , 2011, Journal of the American Academy of Dermatology.

[25]  Wei Zhang,et al.  Comparison of the inhibitory effects of three transcriptional variants of CDKN2A in human lung cancer cell line A549 , 2010, Journal of experimental & clinical cancer research : CR.

[26]  Robert Tibshirani,et al.  Distinct patterns of DNA copy number alteration are associated with different clinicopathological features and gene‐expression subtypes of breast cancer , 2006, Genes, chromosomes & cancer.

[27]  Peter Kraft,et al.  Quality control and quality assurance in genotypic data for genome‐wide association studies , 2010, Genetic epidemiology.

[28]  Derek Y. Chiang,et al.  Characterizing the cancer genome in lung adenocarcinoma , 2007, Nature.

[29]  Christopher B. Miller,et al.  Genome-wide analysis of genetic alterations in acute lymphoblastic leukaemia , 2007, Nature.

[30]  Ajay N. Jain,et al.  Genomic and transcriptional aberrations linked to breast cancer pathophysiologies. , 2006, Cancer cell.

[31]  Zhi Chen,et al.  CaSNP: a database for interrogating copy number alterations of cancer genome from SNP array data , 2010, Nucleic Acids Res..

[32]  J. Fridlyand,et al.  Deletion of chromosome 11q predicts response to anthracycline-based chemotherapy in early breast cancer. , 2007, Cancer research.

[33]  M. Baudis Genomic imbalances in 5918 malignant epithelial tumors: an explorative meta-analysis of chromosomal CGH data , 2007, BMC Cancer.

[34]  K. Gunderson,et al.  Comparison of the Agilent, ROMA/NimbleGen and Illumina platforms for classification of copy number alterations in human breast tumors , 2008, BMC Genomics.

[35]  Terence P. Speed,et al.  A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6 , 2009, Bioinform..

[36]  Ash A. Alizadeh,et al.  Genome-wide analysis of DNA copy number variation in breast cancer using DNA microarrays , 1999, Nature Genetics.

[37]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[38]  James H. Bullard,et al.  aroma.affymetrix: A generic framework in R for analyzing small to very large Affymetrix data sets in bounded memory , 2008 .

[39]  Mingming Jia,et al.  COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer , 2010, Nucleic Acids Res..

[40]  S. Knuutila,et al.  DNA copy number amplification profiling of human neoplasms , 2006, Oncogene.

[41]  Ash A. Alizadeh,et al.  Genome-wide analysis of DNA copy-number changes using cDNA microarrays , 1999, Nature Genetics.

[42]  D. Pinkel,et al.  Comparative Genomic Hybridization for Molecular Cytogenetic Analysis of Solid Tumors , 2022 .

[43]  Stefan Heinrichs,et al.  Identification of structural aberrations in cancer by SNP array analysis , 2007, Genome Biology.

[44]  O. Delattre,et al.  Homozygous PTEN deletion in neuroblastoma arising in a child with Cowden syndrome , 2011, American journal of medical genetics. Part A.

[45]  Y. Leea,et al.  Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular target , 2006 .

[46]  Daniel J. Park,et al.  19p13.1 is a triple-negative-specific breast cancer susceptibility locus. , 2012, Cancer research.

[47]  Dennis B. Troup,et al.  NCBI GEO: archive for functional genomics data sets—10 years on , 2010, Nucleic Acids Res..

[48]  Manuel Corpas,et al.  DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources. , 2009, American journal of human genetics.

[49]  Terence P. Speed,et al.  A single-sample method for normalizing and combining full-resolution copy numbers from multiple platforms, labs and analysis methods , 2009, Bioinform..