Poly-omic risk scores predict inflammatory bowel disease diagnosis

Inflammatory Bowel Disease (IBD) is characterized by complex etiology and a disrupted colonic ecosystem. We provide a framework for the analysis of multi-omic data, which we apply to study the gut ecosystem in IBD. Specifically, we train and validate models using data on the metagenome metatranscriptome, virome, and metabolome from the Human Microbiome Project 2 IBD Multi-omics Database, with 1,785 repeated samples from 131 individuals (103 cases, 27 controls). After splitting the participants into training and testing groups, we used mixed effects least absolute shrinkage and selection operator (LASSO) regression to select features for each -omic. These features, with demographic covariates, were used to generate separate single-omic prediction scores. All four single-omic scores were then combined into a final regression to assess the relative importance of the individual -omics and the predictive benefits when considered together. We identified several species, pathways, and metabolites known to be associated with IBD risk, and we explored the connections between datasets. Individually, metabolomics and viromics scores were more predictive than metagenomics or metatranscriptomics, and when all four scores were combined, we predicted disease diagnosis with a Nagelkerke’s R2 of 0.46 and an AUC of 0.80 [95% CI: 0.63, 0.98]. Our work suggests that some single-omic models for complex traits are more predictive than others, that incorporating multiple -omics datasets may improve prediction, and that each -omic data type provides a combination of unique and redundant information. This modeling framework can be extended to other complex traits and multi-omic datasets. Importance Complex traits are characterized by many biological and environmental factors, such that multi-omics datasets are well-positioned to help us understand their underlying etiologies. We applied a prediction framework across multiple -omics (metagenomics, metatranscriptomics, metabolomics, and viromics) from the gut ecosystem to predict inflammatory bowel disease (IBD) diagnosis. The predicted scores from our models highlighted key features and allowed us to compare the relative utility of each -omic dataset in single-omic versus multi-omics models. Our results emphasized the importance of metabolomics and viromics over metagenomics and metatranscriptomics for predicting IBD status. The greater predictive capability of metabolomics and viromics is likely because these -omics serve as markers of lifestyle factors such as diet. This study provides a modeling framework for multi-omic data, and our results show the utility of combining multiple -omic data types to disentangle complex disease etiologies and biological signatures.

[1]  K. Estrada,et al.  A systematic review and functional bioinformatics analysis of genes associated with Crohn’s disease identify more than 120 related genes , 2022, BMC genomics.

[2]  Danfeng Chen,et al.  Gut microbial metabolome in inflammatory bowel disease: From association to therapeutic perspectives , 2022, Computational and structural biotechnology journal.

[3]  S. Sankararaman,et al.  Methylation risk scores are associated with a collection of phenotypes within electronic health record systems , 2022, npj Genomic Medicine.

[4]  S. Kelley,et al.  Gut Metabolites Are More Predictive of Disease and Cohoused States than Gut Bacterial Features in a Polycystic Ovary Syndrome-Like Mouse Model , 2021, mSystems.

[5]  L. Ohno-Machado,et al.  Machine Learning-based Prediction Models for Diagnosis and Prognosis in Inflammatory Bowel Diseases: A Systematic Review. , 2021, Journal of Crohn's & colitis.

[6]  Keishi Fujio,et al.  Multi-omics approach to precision medicine for immune-mediated diseases , 2021, Inflammation and Regeneration.

[7]  Phillip M. Mach,et al.  Utilizing machine learning with knockoff filtering to extract significant metabolites in Crohn’s disease with a publicly available untargeted metabolomics dataset , 2021, PloS one.

[8]  P. Schmitt‐Kopplin,et al.  The role of fecal sulfur metabolome in inflammatory bowel diseases. , 2021, International journal of medical microbiology : IJMM.

[9]  J. Crespo,et al.  Propionate Fermentative Genes of the Gut Microbiome Decrease in Inflammatory Bowel Disease , 2021, Journal of clinical medicine.

[10]  V. Moreno,et al.  Identifying environmental risk factors for inflammatory bowel diseases: a Mendelian randomization study , 2020, Scientific Reports.

[11]  Krishna R. Kalari,et al.  Longitudinal Multi-omics Reveals Subset-Specific Mechanisms Underlying Irritable Bowel Syndrome , 2020, Cell.

[12]  P. Wearsch,et al.  The Genus Alistipes: Gut Bacteria With Emerging Implications to Inflammation, Cancer, and Mental Health , 2020, Frontiers in Immunology.

[13]  H. Ochman,et al.  Resurrection of a global, metagenomically defined gokushovirus , 2020, eLife.

[14]  E. Quigley,et al.  The microbiome and inflammatory bowel disease. , 2020, The Journal of allergy and clinical immunology.

[15]  D. Plichta,et al.  Multi-"-Omics" Profiling in Patients With Quiescent Inflammatory Bowel Disease Identifies Biomarkers Predicting Relapse. , 2020, Inflammatory bowel diseases.

[16]  T. H. Nguyen,et al.  The global, regional, and national burden of inflammatory bowel disease in 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017 , 2019, The lancet. Gastroenterology & hepatology.

[17]  S. Bonovas,et al.  Environmental Risk Factors for Inflammatory Bowel Diseases: an Umbrella Review of Meta-analyses. , 2019, Gastroenterology.

[18]  Zhiyuan Dai,et al.  Docosapentaenoic acid (DPA, 22:5n-3) ameliorates inflammation in an ulcerative colitis model. , 2019, Food & function.

[19]  H. Vlamakis,et al.  Ruminococcus gnavus, a member of the human gut microbiome associated with Crohn’s disease, produces an inflammatory polysaccharide , 2019, Proceedings of the National Academy of Sciences.

[20]  Colin J. Brislawn,et al.  Multi-omics of the gut microbial ecosystem in inflammatory bowel diseases , 2019, Nature.

[21]  Alicia R. Martin,et al.  Clinical use of current polygenic risk scores may exacerbate health disparities , 2019, Nature Genetics.

[22]  Thomas P. Quinn,et al.  A field guide for the compositional analysis of any-omics data , 2018, bioRxiv.

[23]  H. Flint,et al.  Mechanistic Insights Into the Cross-Feeding of Ruminococcus gnavus and Ruminococcus bromii on Host and Dietary Carbohydrates , 2018, Front. Microbiol..

[24]  Shengtao Zhu,et al.  Helicobacter pylori infection and inflammatory bowel disease: a crosstalk between upper and lower digestive tract , 2018, Cell Death & Disease.

[25]  Shengtao Zhu,et al.  Helicobacter pylori infection and inflammatory bowel disease: a crosstalk between upper and lower digestive tract , 2018, Cell Death & Disease.

[26]  Timothy Shin Heng Mak,et al.  Tutorial: a guide to performing polygenic risk score analyses , 2018, bioRxiv.

[27]  G. Kaplan,et al.  Co-occurrence of Asthma and the Inflammatory Bowel Diseases: A Systematic Review and Meta-analysis , 2018, Clinical and Translational Gastroenterology.

[28]  D. Ojcius,et al.  Gut commensal Parabacteroides goldsteinii plays a predominant role in the anti-obesity effects of polysaccharides isolated from Hirsutella sinensis , 2018, Gut.

[29]  Kenneth G. C. Smith,et al.  Genome‐wide association studies in Crohn's disease: Past, present and future , 2018, Clinical & translational immunology.

[30]  D. Spring,et al.  Loving the poison: the methylcitrate cycle and bacterial pathogenesis. , 2018, Microbiology.

[31]  W. Reindl,et al.  P796 Adverse childhood events and psychiatric comorbidity in a single-centre IBD-cohort , 2018 .

[32]  E. Mohammadi,et al.  Barriers and facilitators related to the implementation of a physiological track and trigger system: A systematic review of the qualitative evidence , 2017, International journal for quality in health care : journal of the International Society for Quality in Health Care.

[33]  Jean M. Macklaim,et al.  Microbiome Datasets Are Compositional: And This Is Not Optional , 2017, Front. Microbiol..

[34]  M. Delgado-Rodríguez,et al.  Systematic review and meta-analysis. , 2017, Medicina intensiva.

[35]  A. Sandvik,et al.  Gammadelta T Cells in Crohn’s Disease: A New Player in the Disease Pathogenesis? , 2017, Journal of Crohn's & colitis.

[36]  J. Schneider,et al.  An umbrella review of meta-analyses of interventions to improve maternal outcomes for teen mothers. , 2017, Journal of adolescence.

[37]  P. Patil,et al.  Biochemical and genome sequence analyses of Megasphaera sp. strain DISK18 from dental plaque of a healthy individual reveals commensal lifestyle , 2016, Scientific Reports.

[38]  V. Andersen,et al.  Heritability in Inflammatory Bowel Disease: From the First Twin Study to Genome-Wide Association Studies , 2015, Inflammatory bowel diseases.

[39]  T. To,et al.  Asthma, Type 1 and Type 2 Diabetes Mellitus, and Inflammatory Bowel Disease amongst South Asian Immigrants to Canada and Their Children: A Population-Based Cohort Study , 2015, PloS one.

[40]  T. Nakai,et al.  Complete Genome Sequence Analysis of Two Pseudomonas plecoglossicida Phages, Potential Therapeutic Agents , 2014, Applied and Environmental Microbiology.

[41]  P. Visscher,et al.  Estimation and partitioning of (co)heritability of inflammatory bowel disease from GWAS and immunochip data. , 2014, Human molecular genetics.

[42]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[43]  Gerhard Tutz,et al.  Variable selection for generalized linear mixed models by L1-penalized estimation , 2012, Statistics and Computing.

[44]  F. Dudbridge Power and Predictive Accuracy of Polygenic Risk Scores , 2013, PLoS genetics.

[45]  H. Gaskins,et al.  Microbial pathways in colonic sulfur metabolism and links with health and disease , 2012, Front. Physio..

[46]  Timothy L. Tickle,et al.  Dysfunction of the intestinal microbiome in inflammatory bowel disease and treatment , 2012, Genome Biology.

[47]  David M. Simcha,et al.  Tackling the widespread and critical impact of batch effects in high-throughput data , 2010, Nature Reviews Genetics.

[48]  S. Tedelind,et al.  Anti-inflammatory properties of the short-chain fatty acids acetate and propionate: a study with relevance to inflammatory bowel disease. , 2007, World journal of gastroenterology.

[49]  Tao Zhang,et al.  RNA Viral Community in Human Feces: Prevalence of Plant Pathogenic Viruses , 2005, PLoS biology.

[50]  H. Jomaa,et al.  Cutting Edge: Human γδ T Cells Are Activated by Intermediates of the 2-C-methyl-d-erythritol 4-phosphate Pathway of Isoprenoid Biosynthesis1 , 2001, The Journal of Immunology.

[51]  S. Carlson,et al.  Prevalence of Inflammatory Bowel Disease Among Medicare Fee-For-Service Beneficiaries — United States, 2001−2018 , 2021, MMWR. Morbidity and mortality weekly report.

[52]  S. Larsson,et al.  Alcohol Consumption and Cardiovascular Disease , 2020, Circulation. Genomic and precision medicine.

[53]  Subhasis Chaudhuri,et al.  Current State of Art , 2019, Visual and Text Sentiment Analysis through Hierarchical Deep Learning Networks.

[54]  C. Fiocchi,et al.  Immunopathogenesis of IBD: current state of the art , 2016, Nature Reviews Gastroenterology &Hepatology.

[55]  J. Wardle,et al.  Sleep and nighttime energy consumption in early childhood: a population‐based cohort study , 2015, Pediatric obesity.

[56]  Sanjeet Kumar,et al.  Virus diseases of peppers (Capsicum spp.) and their control. , 2014, Advances in virus research.

[57]  Subrata Ghosh,et al.  Increasing incidence and prevalence of the inflammatory bowel diseases with time, based on systematic review. , 2012, Gastroenterology.

[58]  D. Holdstock Past, present--and future? , 2005, Medicine, conflict, and survival.

[59]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[60]  Evon M. O. Abu-Taieh,et al.  Comparative Study , 2020, Definitions.