Human polymorphisms at long non-coding RNAs (lncRNAs) and association with prostate cancer risk.

Long non-coding RNAs (lncRNAs), representing a large proportion of non-coding transcripts across the human genome, are evolutionally conserved and biologically functional. At least one-third of the phenotype-related loci identified by genome-wide association studies (GWAS) are mapped to non-coding intervals. However, the relationships between phenotype-related loci and lncRNAs are largely unknown. Utilizing the 1000 Genomes data, we compared single-nucleotide polymorphisms (SNPs) within the sequences of lncRNA and protein-coding genes as defined in the Ensembl database. We further annotated the phenotype-related SNPs reported by GWAS at lncRNA intervals. Because prostate cancer (PCa) risk-related loci were enriched in lncRNAs, we then performed meta-analysis of two existing GWAS for discovery and an additional sample set for replication, revealing PCa risk-related loci at lncRNA regions. The SNP density in regions of lncRNA was similar to that in protein-coding regions, but they were less polymorphic than surrounding regions. Among the 1998 phenotype-related SNPs identified by GWAS, 52 loci were located directly in lncRNA intervals with a 1.5-fold enrichment compared with the entire genome. More than a 5-fold enrichment was observed for eight PCa risk-related loci in lncRNA genes. We also identified a new PCa risk-related SNP rs3787016 in an lncRNA region at 19q13 (per allele odds ratio = 1.19; 95% confidence interval: 1.11-1.27) with P value of 7.22 × 10(-7). lncRNAs may be important for interpreting and mining GWAS data. However, the catalog of lncRNAs needs to be better characterized in order to fully evaluate the relationship of phenotype-related loci with lncRNAs.

[1]  Thomas E. Royce,et al.  Global Identification of Human Transcribed Sequences with Genome Tiling Arrays , 2004, Science.

[2]  E. Gillanders,et al.  Genome‐wide scan of Swedish families with hereditary prostate cancer: Suggestive evidence of linkage at 5q11.2 and 19p13.3 , 2003, The Prostate.

[3]  Life Technologies,et al.  A map of human genome variation from population-scale sequencing , 2011 .

[4]  Kevin M. Bradley,et al.  Common sequence variants on 2p15 and Xp11.22 confer susceptibility to prostate cancer , 2008, Nature Genetics.

[5]  D. Gudbjartsson,et al.  Genome-wide association study identifies a second prostate cancer susceptibility variant at 8q24 , 2007, Nature Genetics.

[6]  G. Helt,et al.  Transcriptional Maps of 10 Human Chromosomes at 5-Nucleotide Resolution , 2005, Science.

[7]  D. Bartel,et al.  The impact of microRNAs on protein output , 2008, Nature.

[8]  Kari Stefansson,et al.  Genome-wide association and replication studies identify four variants associated with prostate cancer susceptibility , 2009, Nature Genetics.

[9]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[10]  P. Fearnhead,et al.  Genome-wide association study of prostate cancer identifies a second risk locus at 8q24 , 2007, Nature Genetics.

[11]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[12]  T. Derrien,et al.  Long Noncoding RNAs with Enhancer-like Function in Human Cells , 2010, Cell.

[13]  A. Gylfason,et al.  A common variant associated with prostate cancer in European and African populations , 2006, Nature Genetics.

[14]  Jin Woo Kim,et al.  Sequence variants at 22q13 are associated with prostate cancer risk. , 2009, Cancer research.

[15]  N. Rajewsky,et al.  Widespread changes in protein synthesis induced by microRNAs , 2008, Nature.

[16]  Jianfeng Xu,et al.  Prostate cancer risk‐associated variants reported from genome‐wide association studies: Meta‐analysis and their contribution to genetic Variation , 2010, The Prostate.

[17]  C. Ponting,et al.  Evolution and Functions of Long Noncoding RNAs , 2009, Cell.

[18]  D. Gudbjartsson,et al.  Two variants on chromosome 17 confer prostate cancer risk, and the one in TCF2 protects against type 2 diabetes , 2007, Nature Genetics.

[19]  J. Mattick The Genetic Signatures of Noncoding RNAs , 2009, PLoS genetics.

[20]  J. Carpten,et al.  Inherited genetic variant predisposes to aggressive but not indolent prostate cancer , 2010, Proceedings of the National Academy of Sciences.

[21]  J. Carpten,et al.  Evidence for two independent prostate cancer risk–associated loci in the HNF1B gene at 17q12 , 2008, Nature Genetics.

[22]  Gregory J. Hannon,et al.  Small RNAs as Guardians of the Genome , 2009, Cell.

[23]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[24]  P. Fraser,et al.  No-Nonsense Functions for Long Noncoding RNAs , 2011, Cell.

[25]  Xia Yang,et al.  Integrating pathway analysis and genetics of gene expression for genome-wide association studies. , 2010, American journal of human genetics.

[26]  Peter Kraft,et al.  Identification of a new prostate cancer susceptibility locus on chromosome 8q24 , 2009, Nature Genetics.

[27]  Ali Amin Al Olama,et al.  Multiple newly identified loci associated with prostate cancer susceptibility , 2008, Nature Genetics.

[28]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[29]  J. Carpten,et al.  Two genome-wide association studies of aggressive prostate cancer implicate putative prostate tumor suppressor gene DAB2IP. , 2007, Journal of the National Cancer Institute.

[30]  J. Carpten,et al.  A novel prostate cancer susceptibility locus at 19q13. , 2009, Cancer research.

[31]  C. Ponting,et al.  Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs. , 2007, Genome research.

[32]  A. Whittemore,et al.  A genome screen of families with multiple cases of prostate cancer: evidence of genetic heterogeneity. , 2001, American journal of human genetics.

[33]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[34]  A. Visel,et al.  Genomic Views of Distant-Acting Enhancers , 2009, Nature.

[35]  Yusuke Nakamura,et al.  Association of a novel long non‐coding RNA in 8q24 with prostate cancer susceptibility , 2011, Cancer science.

[36]  Ali Amin Al Olama,et al.  Identification of seven new prostate cancer susceptibility loci through a genome-wide association study , 2009, Nature Genetics.

[37]  W. Willett,et al.  Multiple loci identified in a genome-wide association study of prostate cancer , 2008, Nature Genetics.

[38]  E. Sontheimer,et al.  Origins and Mechanisms of miRNAs and siRNAs , 2009, Cell.

[39]  P. Visscher,et al.  A versatile gene-based test for genome-wide association studies. , 2010, American journal of human genetics.

[40]  Michael F. Lin,et al.  Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals , 2009, Nature.

[41]  P. Stadler,et al.  RNA Maps Reveal New RNA Classes and a Possible Function for Pervasive Transcription , 2007, Science.