Enhanced methods to detect haplotypic effects on gene expression

Motivation: Expression quantitative trait loci (eQTLs), genetic variants associated with gene expression levels, are identified in eQTL mapping studies. Such studies typically test for an association between single nucleotide polymorphisms (SNPs) and expression under an additive model, which ignores interaction and haplotypic effects. Mismatches between the model tested and the underlying genetic architecture can lead to a loss of association power. Here we introduce a new haplotype‐based test for eQTL studies that looks for haplotypic effects on expression levels. Our test is motivated by compound heterozygous architectures, a common disease model for recessive monogenic disorders, where two different alleles can have the same effect on a gene's function. Results: When the underlying true causal architecture for a simulated gene is a compound heterozygote, our method is better able to capture the signal than the marginal SNP method. When the underlying model is a single SNP, there is no difference in the power of our method relative to the marginal SNP method. We apply our method to empirical gene expression data measured in 373 European individuals from the GEUVADIS study and find 29 more eGenes (genes with at least one association) than the standard marginal SNP method. Furthermore, in 974 of the 3529 total eGenes, our haplotype‐based method results in a stronger association signal than the standard marginal SNP method. This demonstrates our method both increases power over the standard method and provides evidence of haplotypic architectures regulating gene expression. Availability and Implementation : http://bogdan.bioinformatics.ucla.edu/software/ Contact: rob.brown@ucla.edu or pasaniuc@ucla.edu

[1]  William S Bush,et al.  Are Interactions between cis-Regulatory Variants Evidence for Biological Epistasis or Statistical Artifacts? , 2016, American journal of human genetics.

[2]  Brian L Browning,et al.  Genotype Imputation with Millions of Reference Samples. , 2016, American journal of human genetics.

[3]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[4]  Christian Gilissen,et al.  Disease gene identification strategies for exome sequencing , 2012, European Journal of Human Genetics.

[5]  S. Bandinelli,et al.  Another explanation for apparent epistasis , 2014, Nature.

[6]  Sagi Snir,et al.  Gene-Gene Interactions Detection Using a Two-stage Model , 2015, J. Comput. Biol..

[7]  Linda Koch Genomics: Adding another dimension to gene regulation , 2015, Nature Reviews Genetics.

[8]  Eleazar Eskin,et al.  Local genetic effects on gene expression across 44 human tissues , 2016, bioRxiv.

[9]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[10]  W. G. Hill,et al.  Linkage disequilibrium in finite populations , 1968, Theoretical and Applied Genetics.

[11]  Pedro G. Ferreira,et al.  Transcriptome and genome sequencing uncovers functional variation in humans , 2013, Nature.

[12]  D. Conti,et al.  Efficient Two‐Step Testing of Gene‐Gene Interactions in Genome‐Wide Association Studies , 2013, Genetic epidemiology.

[13]  Eran Halperin,et al.  Leveraging genetic variability across populations for the identification of causal variants. , 2010, American journal of human genetics.

[14]  R. Young,et al.  A Chromatin Landmark and Transcription Initiation at Most Promoters in Human Cells , 2007, Cell.

[15]  Richard Durbin,et al.  Gene-gene and gene-environment interactions detected by transcriptome sequence analysis in twins , 2014, Nature Genetics.

[16]  Timothy J. Durham,et al.  Systematic analysis of chromatin state dynamics in nine human cell types , 2011, Nature.

[17]  Kaanan P. Shah,et al.  A gene-based association method for mapping traits using reference transcriptome data , 2015, Nature Genetics.

[18]  Saurabh Baheti,et al.  Comprehensively evaluating cis-regulatory variation in the human prostate transcriptome by using gene-level allele-specific expression. , 2015, American journal of human genetics.

[19]  Jonathan K. Pritchard,et al.  Identification of Genetic Variants That Affect Histone Modifications in Human Cells , 2013, Science.

[20]  O. Delaneau,et al.  Population Variation and Genetic Control of Modular Chromatin Architecture in Humans , 2015, Cell.

[21]  Christian Gilissen,et al.  Unlocking Mendelian disease using exome sequencing , 2011, Genome Biology.

[22]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[23]  E. Eskin,et al.  Integrating Functional Data to Prioritize Causal Variants in Statistical Fine-Mapping Studies , 2014, PLoS genetics.

[24]  Timothy J. Durham,et al.  "Systematic" , 1966, Comput. J..

[25]  D. Koller,et al.  Population genomics of human gene expression , 2007, Nature Genetics.

[26]  Leighton J. Core,et al.  Coordinated Effects of Sequence Variation on DNA Binding, Chromatin Structure, and Transcription , 2013, Science.

[27]  Greg Gibson,et al.  Rare and common variants: twenty arguments , 2012, Nature Reviews Genetics.

[28]  Joseph K. Pickrell,et al.  Understanding mechanisms underlying human gene expression variation with RNA sequencing , 2010, Nature.

[29]  I. Pe’er,et al.  Ultrafast genome-wide scan for SNP–SNP interactions in common complex disease , 2012, Genome research.

[30]  Eleazar Eskin,et al.  Increasing Power of Genome-Wide Association Studies by Collecting Additional Single-Nucleotide Polymorphisms , 2011, Genetics.

[31]  J. Pritchard,et al.  Linkage disequilibrium in humans: models and data. , 2001, American journal of human genetics.

[32]  G. Kempermann Faculty Opinions recommendation of Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. , 2015 .

[33]  B. Browning,et al.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. , 2007, American journal of human genetics.

[34]  Joseph E. Powell,et al.  Detection and replication of epistasis influencing transcription in humans , 2014, Nature.

[35]  P. Shannon,et al.  Exome sequencing identifies the cause of a Mendelian disorder , 2009, Nature Genetics.

[36]  Judith B. Zaugg,et al.  Genetic Control of Chromatin States in Humans Involves Local and Distal Chromosomal Interactions , 2015, Cell.

[37]  Eleazar Eskin,et al.  Identifying Causal Variants at Loci with Multiple Signals of Association , 2014, Genetics.

[38]  Kathryn Roeder,et al.  Rare Complete Knockouts in Humans: Population Distribution and Significant Role in Autism Spectrum Disorders , 2013, Neuron.

[39]  A. Taudt,et al.  Genetic sources of population epigenomic variation , 2016, Nature Reviews Genetics.

[40]  T. Lehtimäki,et al.  Integrative approaches for large-scale transcriptome-wide association studies , 2015, Nature Genetics.

[41]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[42]  Robert D. Finn,et al.  Modifier Effects between Regulatory and Protein-Coding Variation , 2008, PLoS genetics.