Association of imputed prostate cancer transcriptome with disease risk reveals novel mechanisms

Here we train cis-regulatory models of prostate tissue gene expression and impute expression transcriptome-wide for 233,955 European ancestry men (14,616 prostate cancer (PrCa) cases, 219,339 controls) from two large cohorts. Among 12,014 genes evaluated in the UK Biobank, we identify 38 associated with PrCa, many replicating in the Kaiser Permanente RPGEH. We report the association of elevated TMPRSS2 expression with increased PrCa risk (independent of a previously-reported risk variant) and with increased tumoral expression of the TMPRSS2:ERG fusion-oncogene in The Cancer Genome Atlas, suggesting a novel germline-somatic interaction mechanism. Three novel genes, HOXA4, KLK1, and TIMM23, additionally replicate in the RPGEH cohort. Furthermore, 4 genes, MSMB, NCOA4, PCAT1, and PPP1R14A, are associated with PrCa in a trans-ethnic meta-analysis (N = 9117). Many genes exhibit evidence for allele-specific transcriptional activation by PrCa master-regulators (including androgen receptor) in Position Weight Matrix, Chip-Seq, and Hi-C experimental data, suggesting common regulatory mechanisms for the associated genes.In prostate cancer, investigating aberrant gene expression may shed light on disease etiology. Here, the authors imputed expression transcriptome-wide for 233,955 European ancestry men, discovering and replicating the associations between prostatic expression for select genes and prostate cancer risk, including the highly prevalent gene fusion partner TMPRSS2. The authors furthermore integrate diverse functional genomic datasets to interpret the epigenetic mechanisms by which the implicated risk variants and genes modulate disease risk.

[1]  Peter Kraft,et al.  Association of Prostate Cancer Risk Variants with Gene Expression in Normal and Tumor Tissue , 2014, Cancer Epidemiology, Biomarkers & Prevention.

[2]  J. Marchini,et al.  Genotype Imputation with Thousands of Genomes , 2011, G3: Genes | Genomes | Genetics.

[3]  Alan M. Kwong,et al.  Next-generation genotype imputation service and methods , 2016, Nature Genetics.

[4]  P. Leung,et al.  Cell Motility and Spreading Are Suppressed by HOXA4 in Ovarian Cancer Cells: Possible Involvement of β1 Integrin , 2008, Molecular Cancer Research.

[5]  Zhiyong Lu,et al.  tmVar 2.0: integrating genomic variant information from literature with dbSNP and ClinVar for precision medicine , 2018, Bioinform..

[6]  P. Donnelly,et al.  Genome-wide genetic data on ~500,000 UK Biobank participants , 2017, bioRxiv.

[7]  John T. Wei,et al.  Transcriptome sequencing across a prostate cancer cohort identifies PCAT-1, an unannotated lincRNA implicated in disease progression , 2011, Nature Biotechnology.

[8]  Ming Tang,et al.  TumorFusions: an integrative resource for cancer-associated transcript fusions , 2017, Nucleic Acids Res..

[9]  Peter Kraft,et al.  A meta-analysis of 87,040 individuals identifies 23 new susceptibility loci for prostate cancer , 2014, Nature Genetics.

[10]  David Haussler,et al.  The UCSC Genome Browser database: update 2010 , 2009, Nucleic Acids Res..

[11]  Simon G. Coetzee,et al.  Comprehensive Functional Annotation of 77 Prostate Cancer Risk Loci , 2014, PLoS genetics.

[12]  Jeff S. Jasper,et al.  ELF3 is a repressor of androgen receptor action in prostate cancer cells , 2014, Oncogene.

[13]  Josyf Mychaleckyj,et al.  Robust relationship inference in genome-wide association studies , 2010, Bioinform..

[14]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[15]  Asha A. Nair,et al.  Identification of candidate genes for prostate cancer-risk SNPs utilizing a normal prostate tissue eQTL data set , 2015, Nature Communications.

[16]  S. Thibodeau,et al.  Chromatin interactions and candidate genes at ten prostate cancer risk loci , 2016, Scientific Reports.

[17]  Y. Hayashizaki,et al.  FOXP1 is an androgen-responsive transcription factor that negatively regulates androgen receptor signaling in prostate cancer cells. , 2008, Biochemical and biophysical research communications.

[18]  S. Natsugoe,et al.  ZFP36L2 promotes cancer cell aggressiveness and is regulated by antitumor microRNA‐375 in pancreatic ductal adenocarcinoma , 2017, Cancer science.

[19]  Stefan Nickels,et al.  Haplotype reference consortium panel: Practical implications of imputations with large reference panels , 2017, Human mutation.

[20]  Judy H. Cho,et al.  Transcriptional Risk Scores link GWAS to eQTL and Predict Complications in Crohn's Disease , 2017, Nature Genetics.

[21]  M. van Iterson,et al.  Controlling bias and inflation in epigenome- and transcriptome-wide association studies using the empirical null distribution , 2016, Genome Biology.

[22]  G. Coetzee,et al.  4C-seq revealed long-range interactions of a functional enhancer at the 8q24 prostate cancer risk locus , 2016, Scientific Reports.

[23]  Pui-Yan Kwok,et al.  Genome-wide association study of prostate-specific antigen levels identifies novel loci independent of prostate cancer , 2017, Nature Communications.

[24]  H. Zhang,et al.  Knockdown of zinc finger protein X-linked inhibits prostate cancer cell proliferation and induces apoptosis by activating caspase-3 and caspase-9 , 2012, Cancer Gene Therapy.

[25]  M. Bollen,et al.  The gene encoding the prostatic tumor suppressor PSP94 is a target for repression by the Polycomb group protein EZH2 , 2007, Oncogene.

[26]  Steven J. M. Jones,et al.  The Molecular Taxonomy of Primary Prostate Cancer , 2015, Cell.

[27]  Terrence S. Furey,et al.  The UCSC Table Browser data retrieval tool , 2004, Nucleic Acids Res..

[28]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[29]  Thawfeek M. Varusai,et al.  The Reactome Pathway Knowledgebase , 2017, Nucleic acids research.

[30]  Martin Vingron,et al.  Statistical Modeling of Transcription Factor Binding Affinities Predicts Regulatory Interactions , 2008, PLoS Comput. Biol..

[31]  Hongshan Zhao,et al.  Differential expression of MST4, STK25 and PDCD10 between benign prostatic hyperplasia and prostate cancer. , 2014, International journal of clinical and experimental pathology.

[32]  Eurie L. Hong,et al.  Annotation of functional variation in personal genomes using RegulomeDB , 2012, Genome research.

[33]  J. Tchinda,et al.  Recurrent Fusion of TMPRSS2 and ETS Transcription Factor Genes in Prostate Cancer , 2005, Science.

[34]  W. Hahn,et al.  Genetic and functional analyses implicate the NUDT11, HNF1B, and SLC22A3 genes in prostate cancer pathogenesis , 2012, Proceedings of the National Academy of Sciences.

[35]  Kaanan P. Shah,et al.  A gene-based association method for mapping traits using reference transcriptome data , 2015, Nature Genetics.

[36]  A. Jemal,et al.  Cancer statistics, 2017 , 2017, CA: a cancer journal for clinicians.

[37]  Ye Ding,et al.  Identification of novel genes that regulate androgen receptor signaling and growth of androgen-deprived prostate cancer cells , 2015, Oncotarget.

[38]  Mary Goldman,et al.  The UCSC Cancer Genomics Browser: update 2015 , 2014, Nucleic Acids Res..

[39]  Ramana V. Davuluri,et al.  Identification and validation of regulatory SNPs that modulate transcription factor chromatin binding and gene expression in prostate cancer , 2016, Oncotarget.

[40]  J. Lindberg,et al.  Gene regulatory mechanisms underpinning prostate cancer susceptibility , 2016, Nature Genetics.

[41]  R. Eeles,et al.  Detection of TMPRSS2-ERG translocations in human prostate cancer by expression profiling using GeneChip Human Exon 1.0 ST arrays. , 2008, The Journal of molecular diagnostics : JMD.

[42]  M. Menon,et al.  Androgen receptor and E2F-1 targeted thymoquinone therapy for hormone-refractory prostate cancer. , 2007, Cancer research.

[43]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[44]  Alexander Gusev,et al.  Large-scale transcriptome-wide association study identifies new prostate cancer risk regions , 2018, Nature Communications.

[45]  Xueying Mao,et al.  The complexity of prostate cancer: genomic alterations and heterogeneity , 2012, Nature Reviews Urology.

[46]  Alan M. Kwong,et al.  A reference panel of 64,976 haplotypes for genotype imputation , 2015, Nature Genetics.

[47]  Mary Goldman,et al.  The UCSC cancer genomics browser: update 2011 , 2010, Nucleic Acids Res..

[48]  S. Gabriel,et al.  Assessing the impact of population stratification on genetic association studies , 2004, Nature Genetics.

[49]  M. Clynes,et al.  The prognostic utility of the transcription factor SRF in docetaxel-resistant prostate cancer: in-vitro discovery and in-vivo validation , 2017, BMC Cancer.

[50]  Sebastian M. Armasu,et al.  HNF1B variants associate with promoter methylation and regulate gene networks activated in prostate and ovarian cancer , 2016, Oncotarget.

[51]  M. Rubin,et al.  TMPRSS2:ERG gene fusion predicts subsequent detection of prostate cancer in patients with high-grade prostatic intraepithelial neoplasia. , 2014, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[52]  Mitchell J. Machiela,et al.  LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants , 2015, Bioinform..

[53]  Anushya Muruganujan,et al.  PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements , 2016, Nucleic Acids Res..

[54]  Robert Brown,et al.  Inactivation of HOXA Genes by Hypermethylation in Myeloid and Lymphoid Malignancy is Frequent and Associated with Poor Prognosis , 2007, Clinical Cancer Research.

[55]  C. Sabatti,et al.  Characterizing Race/Ethnicity and Genetic Ancestry for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort , 2015, Genetics.

[56]  P. Blackshear,et al.  Tristetraprolin Impairs Myc-Induced Lymphoma and Abolishes the Malignant State , 2012, Cell.

[57]  O. Delaneau,et al.  A linear complexity phasing method for thousands of genomes , 2011, Nature Methods.

[58]  Nawaid Usmani,et al.  Fine-mapping of prostate cancer susceptibility loci in a large meta-analysis identifies candidate causal variants , 2018, Nature Communications.

[59]  Terrence S. Furey,et al.  The UCSC Genome Browser Database: update 2006 , 2005, Nucleic Acids Res..

[60]  Andrew R. Gehrke,et al.  Genome-wide analysis of ETS-family DNA-binding in vitro and in vivo , 2010, The EMBO journal.

[61]  Pui-Yan Kwok,et al.  A large multiethnic genome-wide association study of prostate cancer identifies novel risk variants and substantial ethnic differences. , 2015, Cancer discovery.

[62]  Eleazar Eskin,et al.  Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies. , 2011, American journal of human genetics.

[63]  P. Eline Slagboom,et al.  Controlling bias and inflation in epigenome- and transcriptome-wide association studies using the empirical null distribution , 2016, bioRxiv.

[64]  A. Spurdle,et al.  Kallikrein-Related Peptidase 3 (KLK3/PSA) Single Nucleotide Polymorphisms and Ovarian Cancer Survival , 2011, Twin Research and Human Genetics.

[65]  C. Morrison,et al.  High SPDEF may identify patients who will have a prolonged response to androgen deprivation therapy , 2014, The Prostate.

[66]  Alan D. Lopez,et al.  Global, Regional, and National Cancer Incidence, Mortality, Years of Life Lost, Years Lived With Disability, and Disability-Adjusted Life-years for 32 Cancer Groups, 1990 to 2015: A Systematic Analysis for the Global Burden of Disease Study , 2017, JAMA oncology.

[67]  Andrew Menzies,et al.  Analysis of the Genetic Phylogeny of Multifocal Prostate Cancer Identifies Multiple Independent Clonal Expansions in Neoplastic and Morphologically Normal Prostate Tissue , 2015, Nature Genetics.

[68]  Allison P. Heath,et al.  Toward a Shared Vision for Cancer Genomic Data. , 2016, The New England journal of medicine.

[69]  F. Claessens,et al.  Androgen regulation of the TMPRSS2 gene and the effect of a SNP in an androgen response element. , 2013, Molecular endocrinology.

[70]  Matthew L. Freedman,et al.  Analysis of the 10q11 Cancer Risk Locus Implicates MSMB and NCOA4 in Human Prostate Tumorigenesis , 2010, PLoS genetics.

[71]  L. Ashworth,et al.  Molecular Cloning of the Human Kallikrein 15 Gene (KLK15) , 2001, The Journal of Biological Chemistry.

[72]  Bin Yu,et al.  Superheat: An R Package for Creating Beautiful and Extendable Heatmaps for Visualizing Complex Data , 2015, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[73]  M. Loda,et al.  Vitamin D receptor protein expression in tumor tissue and prostate cancer progression. , 2011, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[74]  Shane A. McCarthy,et al.  Reference-based phasing using the Haplotype Reference Consortium panel , 2016, Nature Genetics.

[75]  Brian L Browning,et al.  Genotype Imputation with Millions of Reference Samples. , 2016, American journal of human genetics.

[76]  Takafumi N. Yamaguchi,et al.  TMPRSS2–ERG fusion co-opts master transcription factors and activates NOTCH signaling in primary prostate cancer , 2017, Nature Genetics.

[77]  N. Dubrawsky Cancer statistics , 1989, CA: a cancer journal for clinicians.

[78]  Ting Wang,et al.  The 3D Genome Browser: a web-based browser for visualizing 3D genome organization and long-range chromatin interactions , 2017, Genome Biology.