Pancan-meQTL: a database to systematically evaluate the effects of genetic variants on methylation in human cancer

Abstract DNA methylation is an important epigenetic mechanism for regulating gene expression. Aberrant DNA methylation has been observed in various human diseases, including cancer. Single-nucleotide polymorphisms can contribute to tumor initiation, progression and prognosis by influencing DNA methylation, and DNA methylation quantitative trait loci (meQTL) have been identified in physiological and pathological contexts. However, no database has been developed to systematically analyze meQTLs across multiple cancer types. Here, we present Pancan-meQTL, a database to comprehensively provide meQTLs across 23 cancer types from The Cancer Genome Atlas by integrating genome-wide genotype and DNA methylation data. In total, we identified 8 028 964 cis-meQTLs and 965 050 trans-meQTLs. Among these, 23 432 meQTLs are associated with patient overall survival times. Furthermore, we identified 2 214 458 meQTLs that overlap with known loci identified through genome-wide association studies. Pancan-meQTL provides a user-friendly web interface (http://bioinfo.life.hust.edu.cn/Pancan-meQTL/) that is convenient for browsing, searching and downloading data of interest. This database is a valuable resource for investigating the roles of genetics and epigenetics in cancer.

[1]  Tom R. Gaunt,et al.  Systematic identification of genetic influences on methylation across the human life course , 2016, Genome Biology.

[2]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[3]  Lucia Altucci,et al.  Cancer epigenetics: Moving forward , 2018, PLoS genetics.

[4]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[5]  Helen E. Parkinson,et al.  The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog) , 2016, Nucleic Acids Res..

[6]  Luigi Ferrucci,et al.  Abundant Quantitative Trait Loci Exist for DNA Methylation and Gene Expression in Human Brain , 2010, PLoS genetics.

[7]  A. Hofman,et al.  Identification of context-dependent expression quantitative trait loci in whole blood , 2016, Nature Genetics.

[8]  Andrey A. Shabalin,et al.  Matrix eQTL: ultra fast eQTL analysis via large matrix operations , 2011, Bioinform..

[9]  William Wheeler,et al.  Common genetic variants in epigenetic machinery genes and risk of upper gastrointestinal cancers. , 2015, International journal of epidemiology.

[10]  Holger Heyn,et al.  Quantitative Trait Loci Identify Functional Noncoding Variation in Cancer , 2016, PLoS genetics.

[11]  William Wheeler,et al.  Characterizing the genetic basis of methylome diversity in histologically normal human lung tissue , 2014, Nature Communications.

[12]  K. Frazer,et al.  Human genetic variation and its contribution to complex traits , 2009, Nature Reviews Genetics.

[13]  Benno Pütz,et al.  Genome-wide mapping of genetic determinants influencing DNA methylation and gene expression in human hippocampus , 2017, Nature Communications.

[14]  Patrick F. Sullivan,et al.  High density methylation QTL analysis in human blood via next-generation sequencing of the methylated genomic DNA fraction , 2015, Genome Biology.

[15]  Steven J. M. Jones,et al.  Comprehensive molecular characterization of human colon and rectal cancer , 2012, Nature.

[16]  Emmanouil T. Dermitzakis,et al.  Putative cis-regulatory drivers in colorectal cancer , 2014, Nature.

[17]  Shannon E. Ellis,et al.  Cross-tissue integration of genetic and epigenetic data offers insight into autism spectrum disorder , 2016, Nature Communications.

[18]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[19]  D. Schübeler Function and information content of DNA methylation , 2015, Nature.

[20]  R. Weksberg,et al.  Discovery of cross-reactive probes and polymorphic CpGs in the Illumina Infinium HumanMethylation450 microarray , 2013, Epigenetics.

[21]  Hu Chen,et al.  An update of miRNASNP database for better SNP selection by GWAS data, miRNA expression and online tools , 2015, Database J. Biol. Databases Curation.

[22]  Andrew D. Johnson,et al.  SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap , 2008, Bioinform..

[23]  Daniel R Weinberger,et al.  Mapping DNA methylation across development, genotype, and schizophrenia in the human frontal cortex , 2015, Nature Neuroscience.

[24]  A. Hofman,et al.  Disease variants alter transcription factor levels and methylation of their binding sites , 2016, Nature Genetics.

[25]  X. Miao,et al.  A polymorphic MYC response element in KBTBD11 influences colorectal cancer risk, especially in interaction with an MYC-regulated SNP rs6983267 , 2017, Annals of oncology : official journal of the European Society for Medical Oncology.

[26]  Zhao Zhang,et al.  PancanQTL: systematic identification of cis-eQTLs and trans-eQTLs in 33 cancer types , 2017, Nucleic Acids Res..

[27]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[28]  A E Giuliano,et al.  Assessment of DNA methylation status in early stages of breast cancer development , 2013, British Journal of Cancer.

[29]  Frank Alber,et al.  CancerDetector: ultrasensitive and non-invasive cancer detection at the resolution of individual reads using cell-free DNA methylation sequencing data , 2018, Nucleic acids research.

[30]  R. Durbin,et al.  Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses , 2012, Nature Protocols.