Fine-mapping inflammatory bowel disease loci to single variant resolution

Inflammatory bowel diseases are chronic gastrointestinal inflammatory disorders that affect millions of people worldwide. Genome-wide association studies have identified 200 inflammatory bowel disease-associated loci, but few have been conclusively resolved to specific functional variants. Here we report fine-mapping of 94 inflammatory bowel disease loci using high-density genotyping in 67,852 individuals. We pinpoint 18 associations to a single causal variant with greater than 95% certainty, and an additional 27 associations to a single variant with greater than 50% certainty. These 45 variants are significantly enriched for protein-coding changes (n = 13), direct disruption of transcription-factor binding sites (n = 3), and tissue-specific epigenetic marks (n = 10), with the last category showing enrichment in specific immune cells among associations stronger in Crohn’s disease and in gut mucosa among associations stronger in ulcerative colitis. The results of this study suggest that high-resolution fine-mapping in large samples can convert many discoveries from genome-wide association studies into statistically convincing causal variants, providing a powerful substrate for experimental elucidation of disease mechanisms.

[1]  Tariq Ahmad,et al.  Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci , 2010, Nature Genetics.

[2]  Judy H. Cho,et al.  Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease , 2008, Nature Genetics.

[3]  Buhm Han,et al.  Chromatin marks identify critical cell types for fine mapping complex trait variants , 2012 .

[4]  J. Todd,et al.  Rare Variants of IFIH1, a Gene Implicated in Antiviral Responses, Protect Against Type 1 Diabetes , 2009, Science.

[5]  Andre Franke,et al.  1000 Genomes-based imputation identifies novel and refined associations for the Wellcome Trust Case Control Consortium phase 1 Data , 2012, European Journal of Human Genetics.

[6]  Wieslawa I. Mentzen,et al.  Genetic Variants Regulating Immune Cell Levels in Health and Disease , 2013, Cell.

[7]  Jake K. Byrnes,et al.  Bayesian refinement of association signals for 14 loci in 3 common diseases , 2012, Nature Genetics.

[8]  Bin Wu,et al.  Cooperative assembly and dynamic disassembly of MDA5 filaments for viral dsRNA recognition , 2011, Proceedings of the National Academy of Sciences.

[9]  Hailiang Huang,et al.  High density mapping of the MHC identifies a shared role for HLA-DRB1*01:03 in inflammatory bowel diseases and heterozygous advantage in ulcerative colitis , 2014, Nature Genetics.

[10]  C. Fiocchi,et al.  TGF-beta/Smad signaling defects in inflammatory bowel disease: mechanisms and possible novel therapies for chronic inflammation. , 2001, The Journal of clinical investigation.

[11]  Gilean McVean,et al.  Trinculo: Bayesian and frequentist multinomial logistic regression for genome-wide association studies of multi-category phenotypes , 2016, Bioinform..

[12]  K. Shianna,et al.  Long-range LD can confound genome scans in admixed populations. , 2008, American journal of human genetics.

[13]  Mark I McCarthy,et al.  Genomic inflation factors under polygenic inheritance , 2011, European Journal of Human Genetics.

[14]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[15]  M. Pirinen,et al.  Analysis of immune-related loci identifies 48 new susceptibility variants for multiple sclerosis , 2013, Nature Genetics.

[16]  Richard F. Gunst,et al.  Applied Regression Analysis , 1999, Technometrics.

[17]  C. Porter,et al.  Direct health care costs of Crohn's disease and ulcerative colitis in US children and adults. , 2008, Gastroenterology.

[18]  Maria Fichera,et al.  Mongersen, an oral SMAD7 antisense oligonucleotide, and Crohn's disease. , 2015, The New England journal of medicine.

[19]  M. Daly,et al.  LD Score regression distinguishes confounding from polygenicity in genome-wide association studies , 2014, Nature Genetics.

[20]  J. Barrett,et al.  Strategies for fine-mapping complex traits , 2015, Human molecular genetics.

[21]  Stefan Rose-John,et al.  Strawberry notch homolog 2 is a novel inflammatory response factor predominantly but not exclusively expressed by astrocytes in the central nervous system , 2015, Glia.

[22]  P. Donnelly,et al.  A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies , 2009, PLoS genetics.

[23]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[24]  Isabelle Cleynen,et al.  Resequencing of positional candidates identifies low frequency IL23R coding variants protecting against inflammatory bowel disease , 2011, Nature Genetics.

[25]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[26]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[27]  Lijun Pu,et al.  Improved LASSO priors for shrinkage quantitative trait loci mapping , 2012, Theoretical and Applied Genetics.

[28]  Joel Eriksson,et al.  FTO genotype is associated with phenotypic variability of body mass index , 2012, Nature.

[29]  Manolis Kellis,et al.  Fine mapping of type 1 diabetes susceptibility loci and evidence for colocalization of causal variants with lymphoid gene enhancers , 2015, Nature Genetics.

[30]  Judy H. Cho,et al.  Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations , 2015, Nature Genetics.

[31]  R. Andrews,et al.  Innate Immune Activity Conditions the Effect of Regulatory Variants upon Monocyte Gene Expression , 2014, Science.

[32]  E. Dermitzakis,et al.  Candidate Causal Regulatory Effects by Integration of Expression QTLs with Complex Trait Genetic Associations , 2010, PLoS genetics.

[33]  James A. Morris,et al.  Evoker: a visualization tool for genotype intensity data , 2010, Bioinform..

[34]  Mitchell Kronenberg,et al.  The tumor necrosis factor family member TNFSF14 (LIGHT) is required for resolution of intestinal inflammation in mice. , 2014, Gastroenterology.

[35]  P. Deloukas,et al.  Multiple common variants for celiac disease influencing immune gene expression , 2010, Nature Genetics.

[36]  Ian Diamond,et al.  Analysis of Binary Data. 2nd Edn. , 1990 .

[37]  Bindu Nanduri,et al.  Transcriptomic analysis of peritoneal cells in a mouse model of sepsis: confirmatory and novel results in early and late sepsis , 2012, BMC Genomics.

[38]  C. Spencer,et al.  Biological Insights From 108 Schizophrenia-Associated Genetic Loci , 2014, Nature.

[39]  John D. Storey A direct approach to false discovery rates , 2002 .

[40]  Joshua M. Korn,et al.  Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease , 2011, Nature Genetics.

[41]  Subrata Ghosh,et al.  Increasing incidence and prevalence of the inflammatory bowel diseases with time, based on systematic review. , 2012, Gastroenterology.

[42]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[43]  Manolis Kellis,et al.  Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments , 2013, Nucleic acids research.

[44]  Mark I. McCarthy,et al.  Evaluating the Performance of Fine-Mapping Strategies at Common Variant GWAS Loci , 2015, PLoS genetics.

[45]  J. Marchini,et al.  Genotype Imputation with Thousands of Genomes , 2011, G3: Genes | Genomes | Genetics.

[46]  N. Woychik,et al.  Regulating the regulators. , 1994, Trends in biochemical sciences.

[47]  Ole F. Christensen,et al.  Proceedings, 10 World Congress of Genetics Applied to Livestock Production DMU - A Package for Analyzing Multivariate Mixed Models in quantitative Genetics and Genomics , 2014 .

[48]  P. D’haeseleer What are DNA sequence motifs? , 2006, Nature Biotechnology.

[49]  W. G. Hill,et al.  Genome partitioning of genetic variation for complex traits using common SNPs , 2011, Nature Genetics.

[50]  Hailiang Huang,et al.  Gene-Based Tests of Association , 2011, PLoS genetics.

[51]  T. Mikkelsen,et al.  The NIH Roadmap Epigenomics Mapping Consortium , 2010, Nature Biotechnology.

[52]  Pan Du,et al.  lumi: a pipeline for processing Illumina microarray , 2008, Bioinform..

[53]  David C. Wilson,et al.  Genome-wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease , 2016, Nature Genetics.

[54]  Osamu Takeuchi,et al.  Strawberry notch homologue 2 regulates osteoclast fusion by enhancing the expression of DC-STAMP , 2013, The Journal of experimental medicine.

[55]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[56]  C. Wallace,et al.  Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics , 2013, PLoS genetics.

[57]  D. Cox,et al.  Analysis of Binary Data (2nd ed.). , 1990 .

[58]  O. Delaneau,et al.  Supplementary Information for ‘ Improved whole chromosome phasing for disease and population genetic studies ’ , 2012 .

[59]  S. Targan,et al.  Inhibition of a novel fibrogenic factor Tl1a reverses established colonic fibrosis , 2014, Mucosal Immunology.

[60]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[61]  P. Sullivan,et al.  Heritability and Genomics of Gene Expression in Peripheral Blood , 2014, Nature Genetics.

[62]  G. Fonseca-Camarillo,et al.  Expression of interleukin (IL)‐19 and IL‐24 in inflammatory bowel disease patients: a cross‐sectional study , 2014, Clinical and experimental immunology.

[63]  Fang Yan,et al.  Kinase suppressor of Ras-1 protects intestinal epithelium from cytokine-mediated apoptosis during inflammation. , 2004, The Journal of clinical investigation.

[64]  Wan-Wan Lin,et al.  Decoy Receptor 3 Increases Monocyte Adhesion to Endothelial Cells via NF-κB-Dependent Up-Regulation of Intercellular Adhesion Molecule-1, VCAM-1, and IL-8 Expression1 , 2005, The Journal of Immunology.

[65]  O. Delaneau,et al.  A linear complexity phasing method for thousands of genomes , 2011, Nature Methods.

[66]  James A. Morris,et al.  optiCall: a robust genotype-calling algorithm for rare, low-frequency and common variants , 2012, Bioinform..

[67]  John A. Todd,et al.  Statistical colocalization of monocyte gene expression and genetic risk variants for type 1 diabetes , 2012, Human molecular genetics.

[68]  Wei Lu,et al.  The kinase LRRK2 is a regulator of the transcription factor NFAT that modulates the severity of inflammatory bowel disease , 2011, Nature Immunology.

[69]  M. Peters,et al.  Systematic identification of trans eQTLs as putative drivers of known disease associations , 2013, Nature Genetics.

[70]  Rudolf Grosschedl,et al.  Transcription factor EBF1 is essential for the maintenance of B cell identity and prevention of alternative fates in committed cells , 2013, Nature Immunology.

[71]  Peter J. Murray,et al.  Cutting Edge: A Transcriptional Repressor and Corepressor Induced by the STAT3-Regulated Anti-Inflammatory Signaling Pathway1 , 2007, The Journal of Immunology.

[72]  Michel Georges,et al.  BayesFM: a software program to fine-map multiple causative variants in GWAS identified risk loci , 2016, bioRxiv.

[73]  P. Visscher,et al.  GCTA: a tool for genome-wide complex trait analysis. , 2011, American journal of human genetics.

[74]  Ruslan Medzhitov,et al.  Toll Pathway-Dependent Blockade of CD4+CD25+ T Cell-Mediated Suppression by Dendritic Cells , 2003, Science.

[75]  M. Stephens,et al.  Bayesian variable selection regression for genome-wide association studies and other large-scale problems , 2011, 1110.6019.

[76]  Pedro G. Ferreira,et al.  Transcriptome and genome sequencing uncovers functional variation in humans , 2013, Nature.

[77]  David C. Wilson,et al.  Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease , 2012, Nature.

[78]  M. Daly,et al.  Genetic and Epigenetic Fine-Mapping of Causal Autoimmune Disease Variants , 2014, Nature.

[79]  Mark J Daly,et al.  LRRK2 Is Involved in the IFN-γ Response and Host Response to Pathogens , 2010, The Journal of Immunology.

[80]  Tom R. Gaunt,et al.  The UK10K project identifies rare variants in health and disease , 2016 .

[81]  W. Huber,et al.  Model-based variance-stabilizing transformation for Illumina microarray data , 2008, Nucleic acids research.

[82]  I. Metón,et al.  Interleukin-19 Impairment in Active Crohn’s Disease Patients , 2014, PloS one.

[83]  M. Metzker,et al.  Overexpression of M68/DcR3 in human gastrointestinal tract tumors independent of gene amplification and its location in a four-gene cluster. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[84]  Søren Brunak,et al.  Analysis of five chronic inflammatory diseases identifies 27 new associations and highlights disease-specific patterns at shared loci , 2016, Nature Genetics.