Identifying novel high-impact rare disease-causing mutations, genes and pathways in exomes of Ashkenazi Jewish inflammatory bowel disease patients

Inflammatory bowel disease (IBD) is a group of chronic diseases, affecting different parts of the gastrointestinal tract, that mainly comprises Crohn's Disease (CD) and Ulcerative Colitis (UC). Most IBD genomic research to date has involved genome-wide association studies (GWAS) of common genetic variants, mostly in Europeans, resulting in the identification of over 200 risk loci. The incidence of IBD in Ashkenazi Jews (AJ) is particularly high compared to other population groups and rare protein-coding variants are significantly enriched in AJ. These variants are expected to have a larger phenotypic effect and are hypothesized to complement the missing heritability that cannot be fully addressed by GWAS in IBD. Therefore, we genetically identified 4,974 AJs IBD cases and controls from whole exome sequencing (WES) data from the NIDDK IBD Genetics Consortium (IBDGC). We selected credible rare variants with high predicted impact, aggregated them into genes, and performed gene burden and pathway enrichment analyses to identify 7 novel plausible IBD-causing genesNCF1, CES1, ICAM1, INPP5D, ABCB1, IL33 and TLR4. We further perform bulk and single-cell RNA sequencing, demonstrating the likely relatedness of the novel genes to IBD. Importantly, we demonstrate that the rare and high impact genetic architecture of AJ adult IBD displays a significant overlap with very early onset IBD (VEOIBD) genetics. At the variant level, we performed Phenome-wide association studies (PheWAS) in the UK Biobank to replicate risk sites in IBD and reveal shared risk sites with other diseases. Finally, we showed that a polygenic risk score (PRS) has high power to differentiate AJ IBD cases from controls when using rare and high impact variants.

[1]  Noor B. Dawany,et al.  Exome sequencing analysis reveals variants in primary immunodeficiency genes in patients with very early onset inflammatory bowel disease. , 2015, Gastroenterology.

[2]  Judy H. Cho,et al.  MDR1 Ala893 polymorphism is associated with inflammatory bowel disease. , 2003, American journal of human genetics.

[3]  Karin Breuer,et al.  InnateDB: systems biology of innate immunity and beyond—recent updates and continuing curation , 2012, Nucleic Acids Res..

[4]  Hailiang Huang,et al.  Fine-mapping inflammatory bowel disease loci to single variant resolution , 2017, Nature.

[5]  Lillian Maggio-Price,et al.  Helicobacter bilis infection accelerates and H. hepaticus infection delays the development of colitis in multiple drug resistance-deficient (mdr1a-/-) mice. , 2002, The American journal of pathology.

[6]  A. Forbes,et al.  NCF1 (p47phox) and NCF1 Pseudogenes Are Not Associated with Inflammatory Bowel Disease , 2004, Inflammatory bowel diseases.

[7]  †The International HapMap Consortium The International HapMap Project , 2003, Nature.

[8]  P. Stenson,et al.  The Human Gene Mutation Database: towards a comprehensive repository of inherited mutation data for medical research, genetic diagnosis and next-generation sequencing studies , 2017, Human Genetics.

[9]  Zahra Bagheri,et al.  A comparative study of the impacts of unbalanced sample sizes on the four synthesized methods of meta-analytic structural equation modeling , 2017, BMC Research Notes.

[10]  M. Stephens,et al.  fastSTRUCTURE: Variational Inference of Population Structure in Large SNP Data Sets , 2014, Genetics.

[11]  M. Daly,et al.  Searching for missing heritability: Designing rare variant association studies , 2014, Proceedings of the National Academy of Sciences.

[12]  Mauricio O. Carneiro,et al.  From FastQ Data to High‐Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline , 2013, Current protocols in bioinformatics.

[13]  Judy H. Cho,et al.  Functional variants in the LRRK2 gene confer shared effects on risk for Crohn’s disease and Parkinson’s disease , 2018, Science Translational Medicine.

[14]  Jianguo Xia,et al.  NetworkAnalyst for statistical, visual and network-based meta-analysis of gene expression data , 2015, Nature Protocols.

[15]  Lluis Quintana-Murci,et al.  The mutation significance cutoff: gene-level thresholds for variant predictions , 2016, Nature Methods.

[16]  Itsik Pe'er,et al.  Abraham's children in the genome era: major Jewish diaspora populations comprise distinct genetic clusters with shared Middle Eastern Ancestry. , 2010, American journal of human genetics.

[17]  Judy H. Cho,et al.  Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations , 2015, Nature Genetics.

[18]  Jacques Fellay,et al.  Blacklisting variants common in private cohorts but not in public databases optimizes human exome analysis , 2018, Proceedings of the National Academy of Sciences.

[19]  P. Rutgeerts,et al.  Intestinal expression of SHIP in inflammatory bowel diseases , 2011, Gut.

[20]  E. Zintzaras Is there evidence to claim or deny association between variants of the multidrug resistance gene (MDR1 or ABCB1) and inflammatory bowel disease? , 2012, Inflammatory bowel diseases.

[21]  M. Rieder,et al.  Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies. , 2012, American journal of human genetics.

[22]  Andreas Krämer,et al.  Causal analysis approaches in Ingenuity Pathway Analysis , 2013, Bioinform..

[23]  Mamta Giri,et al.  Prioritizing Crohn’s disease genes by integrating association signals with gene expression implicates monocyte subsets , 2019, Genes & Immunity.

[24]  Paul Hoffman,et al.  Integrating single-cell transcriptomic data across different conditions, technologies, and species , 2018, Nature Biotechnology.

[25]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[26]  P. Visscher,et al.  Modeling Linkage Disequilibrium Increases Accuracy of Polygenic Risk Scores , 2015, bioRxiv.

[27]  W. Reinisch,et al.  Targeting Endothelial Ligands: ICAM-1/alicaforsen, MAdCAM-1. , 2018, Journal of Crohn's & colitis.

[28]  J. Casanova,et al.  Whole-exome sequencing to analyze population structure, parental inbreeding, and familial linkage , 2016, Proceedings of the National Academy of Sciences.

[29]  Lei Shang,et al.  Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants , 2014, Proceedings of the National Academy of Sciences.

[30]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, Genome Biology.

[31]  Timothy L. Tickle,et al.  Pediatric Crohn disease patients exhibit specific ileal transcriptome and microbiome signature. , 2014, The Journal of clinical investigation.

[32]  E. Seidman,et al.  Associations between variants in the ABCB1 (MDR1) gene and corticosteroid dependence in children with Crohn's disease , 2011, Inflammatory bowel diseases.

[33]  Cole Trapnell,et al.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions , 2013, Genome Biology.

[34]  P. Lefèvre,et al.  Clinical Pharmacology of Janus Kinase Inhibitors in Inflammatory Bowel Disease , 2020, Journal of Crohn's & colitis.

[35]  Shuang Feng,et al.  RAREMETAL: fast and powerful meta-analysis for rare variants , 2014, Bioinform..

[36]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[37]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[38]  Jean-Michel Claverie,et al.  The human gene damage index as a gene-level approach to prioritizing exome variants , 2015, Proceedings of the National Academy of Sciences.

[39]  Jennifer Mulle,et al.  A Genome-Wide Scan of Ashkenazi Jewish Crohn's Disease Suggests Novel Susceptibility Loci , 2012, PLoS genetics.

[40]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[41]  Josyf Mychaleckyj,et al.  Robust relationship inference in genome-wide association studies , 2010, Bioinform..

[42]  S. Paczesny,et al.  The ST2/IL-33 Axis in Immune Cells during Inflammatory Diseases , 2017, Front. Immunol..

[43]  Alastair Forbes,et al.  The contribution of NOD2 gene mutations to the risk and site of disease in inflammatory bowel disease. , 2002, Gastroenterology.

[44]  Bruce J. Aronow,et al.  ToppCluster: a multiple gene list feature analyzer for comparative enrichment clustering and network-based dissection of biological systems , 2010, Nucleic Acids Res..

[45]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[46]  Judy H. Cho,et al.  Insights into the genetic epidemiology of Crohn's and rare diseases in the Ashkenazi Jewish population , 2016, bioRxiv.

[47]  P. Visscher,et al.  Estimation and partitioning of (co)heritability of inflammatory bowel disease from GWAS and immunochip data. , 2014, Human molecular genetics.

[48]  J. C. Jones,et al.  A novel model of inflammatory bowel disease: mice deficient for the multiple drug resistance gene, mdr1a, spontaneously develop colitis. , 1998, Journal of immunology.

[49]  R. Holmdahl,et al.  The protective role of ROS in autoimmune disease. , 2009, Trends in immunology.

[50]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[51]  Daniel S. Himmelstein,et al.  Understanding multicellular function and disease with human tissue-specific networks , 2015, Nature Genetics.

[52]  Guillaume Vogt,et al.  The human gene connectome as a map of short cuts for morbid allele discovery , 2013, Proceedings of the National Academy of Sciences.

[53]  Gregory M. Cooper,et al.  CADD: predicting the deleteriousness of variants throughout the human genome , 2018, Nucleic Acids Res..

[54]  David Díaz-Jiménez,et al.  Characterization of the novel ST2/IL‐33 system in patients with inflammatory bowel disease , 2010, Inflammatory bowel diseases.

[55]  E. Loftus,et al.  Risk of cancer in inflammatory bowel disease: going up, going down, or still the same? , 2016, Current opinion in gastroenterology.

[56]  Paul Theodor Pyl,et al.  HTSeq—a Python framework to work with high-throughput sequencing data , 2014, bioRxiv.

[57]  Judy H. Cho,et al.  High-Throughput Identification of the Plasma Proteomic Signature of Inflammatory Bowel Disease , 2018, Journal of Crohn's & colitis.

[58]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .

[59]  Joshua C. Denny,et al.  TYK2 Protein-Coding Variants Protect against Rheumatoid Arthritis and Autoimmunity, with No Evidence of Major Pleiotropic Effects on Non-Autoimmune Complex Traits , 2015, PloS one.

[60]  P. Visscher,et al.  Modeling Linkage Disequilibrium Increases Accuracy of Polygenic Risk Scores , 2015, bioRxiv.

[61]  F. Middleton,et al.  P-176 SHIP1 Deficiency in Human IBD: Molecular Basis, Prognosis and Agonist Development , 2016 .

[62]  David C. Wilson,et al.  Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease , 2012, Nature.

[63]  Xinrui Li,et al.  Toll-like Receptors and Inflammatory Bowel Disease , 2018, Front. Immunol..

[64]  S. Bloom,et al.  Association between p47phox pseudogenes and inflammatory bowel disease. , 2003, Blood.

[65]  A. Zellos,et al.  Dental Caries and Periodontal Disease in Children and Adolescents with Inflammatory Bowel Disease: A Case–Control Study , 2015, Inflammatory bowel diseases.

[66]  H. Ostrer,et al.  The population genetics of the Jewish people , 2012, Human Genetics.

[67]  M. Silverberg,et al.  Variants in nicotinamide adenine dinucleotide phosphate oxidase complex components determine susceptibility to very early onset inflammatory bowel disease. , 2014, Gastroenterology.

[68]  B. Müller-Myhsok,et al.  The ATG16L1 Gene Variants rs2241879 and rs2241880 (T300A) Are Strongly Associated With Susceptibility to Crohn's Disease in the German Population , 2008, The American Journal of Gastroenterology.

[69]  David C. Wilson,et al.  Genome-wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease , 2016, Nature Genetics.