Targeted screening of cis-regulatory variation in human haplotypes.

Regulatory cis-acting variants account for a large proportion of gene expression variability in populations. Cis-acting differences can be specifically measured by comparing relative levels of allelic transcripts within a sample. Allelic expression (AE) mapping for cis-regulatory variant discovery has been hindered by the requirements of having informative or heterozygous single nucleotide polymorphisms (SNPs) within genes in order to assign the allelic origin of each transcript. In this study we have developed an approach to systematically screen for heritable cis-variants in common human haplotypes across >1,000 genes. In order to achieve the highest level of information per haplotype studied, we carried out allelic expression measurements by using both intronic and exonic SNPs in primary transcripts. We used a novel RNA pooling strategy in immortalized lymphoblastoid cell lines (LCLs) and primary human osteoblast cell lines (HObs) to allow for high-throughput AE. Screening hits from RNA pools were further validated by performing allelic expression mapping in individual samples. Our results indicate that >10% of expressed genes in human LCLs show genotype-linked AE. In addition, we have validated cis-acting variants in over 20 genes linked with common disease susceptibility in recent genome-wide studies. More generally, our results indicate that RNA pooling coupled with AE read-out by second generation sequencing or by other methods provides a high-throughput tool for cataloging the impact of common noncoding variants in the human genome.

[1]  Andrew D. Johnson,et al.  Polymorphisms affecting gene transcription and mRNA processing in pharmacogenetic candidate genes: detection through allelic expression imbalance in human target tissues , 2008, Pharmacogenetics and genomics.

[2]  M. Stephens,et al.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. , 2008, Genome research.

[3]  Judy H. Cho,et al.  Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease , 2008, Nature Genetics.

[4]  B. Tycko,et al.  Genomic surveys by methylation-sensitive SNP analysis identify sequence-dependent allele-specific DNA methylation , 2008, Nature Genetics.

[5]  Richard C Trembath,et al.  Identification of ZNF313/RNF114 as a novel psoriasis susceptibility gene. , 2008, Human molecular genetics.

[6]  A. Syvänen,et al.  A risk haplotype of STAT4 for systemic lupus erythematosus is over-expressed, correlates with anti-dsDNA and shows additive effects with two risk alleles of IRF5 , 2008, Human molecular genetics.

[7]  Jean Tichet,et al.  A Polymorphism Within the G6PC2 Gene Is Associated with Fasting Plasma Glucose Levels , 2008, Science.

[8]  A Hofman,et al.  Bone mineral density, osteoporosis, and osteoporotic fractures: a genome-wide association study , 2008, The Lancet.

[9]  John D. Storey,et al.  Mapping the Genetic Architecture of Gene Expression in Human Liver , 2008, PLoS biology.

[10]  T. Pastinen,et al.  Systematic assessment of the human osteoblast transcriptome in resting and induced primary cells. , 2008, Physiological genomics.

[11]  A. Feinberg,et al.  SNP-specific array-based allele-specific expression analysis. , 2008, Genome research.

[12]  M. McPeek,et al.  Complex genetic interactions underlying expression differences between Drosophila races: Analysis of chromosome substitutions , 2008, Proceedings of the National Academy of Sciences.

[13]  David S Sanders,et al.  Newly identified genetic risk variants for celiac disease related to the immune response , 2008, Nature Genetics.

[14]  Catarina D. Campbell,et al.  A survey of allelic imbalance in F1 mice. , 2008, Genome research.

[15]  Bing Ren,et al.  Genome-wide mapping of allele-specific protein-DNA interactions in human cells , 2008, Nature Methods.

[16]  M. McCarthy,et al.  Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes , 2008, Nature Genetics.

[17]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[18]  Geoffrey Hom,et al.  Association of systemic lupus erythematosus with C8orf13-BLK and ITGAM-ITGAX. , 2008, The New England journal of medicine.

[19]  Marta E Alarcón-Riquelme,et al.  Genome-wide association scan in women with systemic lupus erythematosus identifies susceptibility variants in ITGAM, PXK, KIAA1542 and other loci , 2008, Nature Genetics.

[20]  Sandra D'Alfonso,et al.  Functional variants in the B-cell gene BANK1 are associated with systemic lupus erythematosus , 2008, Nature Genetics.

[21]  Thomas J. Hudson,et al.  Differential Allelic Expression in the Human Genome: A Robust Approach To Identify Genetic and Epigenetic Cis-Acting Mechanisms Regulating Gene Expression , 2008, PLoS genetics.

[22]  Jacek Majewski,et al.  Genome-wide analysis of transcript isoform variation in humans , 2008, Nature Genetics.

[23]  R. Collins,et al.  Newly identified loci that influence lipid concentrations and risk of coronary artery disease , 2008, Nature Genetics.

[24]  K. Mossman The Wellcome Trust Case Control Consortium, U.K. , 2008 .

[25]  D. Strachan,et al.  Rheumatoid arthritis association at 6q23 , 2007, Nature Genetics.

[26]  John N. Hutchinson,et al.  Widespread Monoallelic Expression on Human Autosomes , 2007, Science.

[27]  Simon C. Potter,et al.  Association scan of 14,500 nonsynonymous SNPs in four diseases identifies autoimmunity variants , 2007, Nature Genetics.

[28]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[29]  D. Koller,et al.  Population genomics of human gene expression , 2007, Nature Genetics.

[30]  L. Liang,et al.  A genome-wide association study of global gene expression , 2007, Nature Genetics.

[31]  L. Almasy,et al.  Discovery of expression QTLs using large-scale transcriptional profiling in human lymphocytes , 2007, Nature Genetics.

[32]  Anbupalam Thalamuthu,et al.  TRAF1-C5 as a risk locus for rheumatoid arthritis--a genomewide study. , 2007, The New England journal of medicine.

[33]  Mark Atkinson,et al.  Large-scale genetic fine mapping and genotype-phenotype associations implicate polymorphism in the IL2RA region in type 1 diabetes , 2007, Nature Genetics.

[34]  Gonçalo R. Abecasis,et al.  Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma , 2007, Nature.

[35]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[36]  Xiaofeng Zhu,et al.  The Association of a SNP Upstream of INSIG2 with Body Mass Index is Reproduced in Several but Not All Cohorts , 2007, PLoS genetics.

[37]  T. Hudson,et al.  A genome-wide approach to identifying novel-imprinted genes , 2007, Human Genetics.

[38]  J. Hajnal,et al.  Variations due to analysis technique in intracellular pH measurements in simulated and in vivo 31P MR spectra of the human brain , 2006, Journal of magnetic resonance imaging : JMRI.

[39]  D. Cox,et al.  Analysis of allelic differential expression in human white blood cells. , 2006, Genome research.

[40]  T. Hudson,et al.  Mapping common regulatory variants to human haplotypes. , 2005, Human molecular genetics.

[41]  Thomas J Hudson,et al.  Survey of allelic expression using EST mining. , 2005, Genome research.

[42]  Joshua T. Burdick,et al.  Mapping determinants of human gene expression by regional and genome-wide association , 2005, Nature.

[43]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[44]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[45]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[46]  Thomas J. Hudson,et al.  Cis-Acting Regulatory Variation in the Human Genome , 2004, Science.

[47]  C. Molony,et al.  Genetic analysis of genome-wide variation in human gene expression , 2004, Nature.

[48]  Bill Newman,et al.  Functional variants of OCTN cation transporter genes are associated with Crohn disease , 2004, Nature Genetics.

[49]  Huda Akil,et al.  Systematic changes in gene expression in postmortem human brains associated with tissue pH and terminal medical conditions. , 2004, Human molecular genetics.

[50]  Daniel Sinnett,et al.  A survey of genetic and epigenetic variation affecting human gene expression. , 2004, Physiological genomics.

[51]  M. Schalling,et al.  Pyrosequencing™‐based SNP allele frequency estimation in DNA pools , 2004, Human mutation.

[52]  K. Buetow,et al.  Allelic variation in gene expression is common in the human genome. , 2003, Genome research.

[53]  M. Owen,et al.  Cis-acting variation in the expression of a high proportion of genes in human brain , 2003, Human Genetics.

[54]  Bert Vogelstein,et al.  Allelic Variation in Human Gene Expression , 2002, Science.

[55]  Howard C. Berg,et al.  Genetic analysis , 1957, Nature Biotechnology.