Type 2 diabetes genetic loci informed by multi-trait associations point to disease mechanisms and subtypes: A soft clustering analysis

Background Type 2 diabetes (T2D) is a heterogeneous disease for which (1) disease-causing pathways are incompletely understood and (2) subclassification may improve patient management. Unlike other biomarkers, germline genetic markers do not change with disease progression or treatment. In this paper, we test whether a germline genetic approach informed by physiology can be used to deconstruct T2D heterogeneity. First, we aimed to categorize genetic loci into groups representing likely disease mechanistic pathways. Second, we asked whether the novel clusters of genetic loci we identified have any broad clinical consequence, as assessed in four separate subsets of individuals with T2D. Methods and findings In an effort to identify mechanistic pathways driven by established T2D genetic loci, we applied Bayesian nonnegative matrix factorization (bNMF) clustering to genome-wide association study (GWAS) results for 94 independent T2D genetic variants and 47 diabetes-related traits. We identified five robust clusters of T2D loci and traits, each with distinct tissue-specific enhancer enrichment based on analysis of epigenomic data from 28 cell types. Two clusters contained variant-trait associations indicative of reduced beta cell function, differing from each other by high versus low proinsulin levels. The three other clusters displayed features of insulin resistance: obesity mediated (high body mass index [BMI] and waist circumference [WC]), “lipodystrophy-like” fat distribution (low BMI, adiponectin, and high-density lipoprotein [HDL] cholesterol, and high triglycerides), and disrupted liver lipid metabolism (low triglycerides). Increased cluster genetic risk scores were associated with distinct clinical outcomes, including increased blood pressure, coronary artery disease (CAD), and stroke. We evaluated the potential for clinical impact of these clusters in four studies containing individuals with T2D (Metabolic Syndrome in Men Study [METSIM], N = 487; Ashkenazi, N = 509; Partners Biobank, N = 2,065; UK Biobank [UKBB], N = 14,813). Individuals with T2D in the top genetic risk score decile for each cluster reproducibly exhibited the predicted cluster-associated phenotypes, with approximately 30% of all individuals assigned to just one cluster top decile. Limitations of this study include that the genetic variants used in the cluster analysis were restricted to those associated with T2D in populations of European ancestry. Conclusion Our approach identifies salient T2D genetically anchored and physiologically informed pathways, and supports the use of genetics to deconstruct T2D heterogeneity. Classification of patients by these genetic pathways may offer a step toward genetically informed T2D patient management.

[1]  Thomas W. Mühleisen,et al.  Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease , 2011, Nature Genetics.

[2]  Christian Gieger,et al.  Edinburgh Research Explorer Common variants at 10 genomic loci influence hemoglobin A(C) levels via glycemic and nonglycemic pathways , 2010 .

[3]  Alan M. Kwong,et al.  Next-generation genotype imputation service and methods , 2016, Nature Genetics.

[4]  Sara M. Willems,et al.  The impact of low-frequency and rare variants on lipid levels , 2015, Nature Genetics.

[5]  David M. Evans,et al.  A novel common variant in DCST2 is associated with length in early life and height in adulthood , 2014, Human molecular genetics.

[6]  Claude Bouchard,et al.  Identification of heart rate-associated loci and their effects on cardiac conduction and rhythm disorders , 2014 .

[7]  Cathie Sudlow,et al.  Algorithms for the Capture and Adjudication of Prevalent and Incident Diabetes in UK Biobank , 2016, PloS one.

[8]  Jon Wakefield,et al.  A Bayesian measure of the probability of false discovery in genetic epidemiology studies. , 2007, American journal of human genetics.

[9]  May E. Montasser,et al.  Genome-Wide Association Study of the Modified Stumvoll Insulin Sensitivity Index Identifies BCL2 and FAM19A2 as Novel Insulin Sensitivity Loci , 2016, Diabetes.

[10]  Jonathan C. Cohen,et al.  Pnpla3I148M knockin mice accumulate PNPLA3 on lipid droplets and develop hepatic steatosis , 2014, Hepatology.

[11]  Cook,et al.  Fine-mapping of an expanded set of type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps , 2018, bioRxiv.

[12]  Tomoaki Hishida,et al.  Crucial roles of D-type cyclins in the early stage of adipocyte differentiation. , 2008, Biochemical and biophysical research communications.

[13]  Y. Jang,et al.  Standards of Medical Care in Diabetes-2010 by the American Diabetes Association: Prevention and Management of Cardiovascular Disease , 2010 .

[14]  Tom R. Gaunt,et al.  Edinburgh Research Explorer Genetic associations at 53 loci highlight cell types and biological pathways relevant for kidney function , 2022 .

[15]  P. Munroe,et al.  Genetic Evidence for a Normal-Weight “Metabolically Obese” Phenotype Linking Insulin Resistance, Hypertension, Coronary Artery Disease, and Type 2 Diabetes , 2014, Diabetes.

[16]  Anne Tybjærg-Hansen,et al.  Exome-wide association study identifies a TM6SF2 variant that confers susceptibility to nonalcoholic fatty liver disease , 2014, Nature Genetics.

[17]  D. MacArthur,et al.  An eMERGE Clinical Center at Partners Personalized Medicine , 2016, Journal of personalized medicine.

[18]  Tom R. Gaunt,et al.  The UK10K project identifies rare variants in health and disease , 2016 .

[19]  Christian Gieger,et al.  New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk , 2010, Nature Genetics.

[20]  Christian Gieger,et al.  Genetic fine-mapping and genomic annotation defines causal mechanisms at type 2 diabetes susceptibility loci , 2016 .

[21]  Andrew D. Johnson,et al.  Multiethnic genome-wide meta-analysis of ectopic fat depots identifies loci associated with adipocyte development and differentiation , 2016, Nature Genetics.

[22]  K. Fox Hypertension and heart disease. , 1996, Nursing standard (Royal College of Nursing (Great Britain) : 1987).

[23]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[24]  Audrey Y. Chu,et al.  Genetic loci associated with circulating levels of very long-chain saturated fatty acids[S] , 2015, Journal of Lipid Research.

[25]  Kyle J. Gaulton,et al.  Genome-wide associations for birth weight and correlations with adult disease , 2016 .

[26]  Vincent Y. F. Tan,et al.  Automatic Relevance Determination in Nonnegative Matrix Factorization with the /spl beta/-Divergence , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  A. Gloyn,et al.  Glucokinase regulatory protein: complexity at the crossroads of triglyceride and glucose metabolism , 2015, Current opinion in lipidology.

[28]  Tanya M. Teslovich,et al.  An Expanded Genome-Wide Association Study of Type 2 Diabetes in Europeans , 2017, Diabetes.

[29]  B. Glaser,et al.  Predicting Diabetic Nephropathy Using a Multifactorial Genetic Model , 2011, PloS one.

[30]  Rosa M. Badia,et al.  Re-analysis of public genetic data reveals a rare X-chromosomal variant associated with type 2 diabetes , 2018, Nature Communications.

[31]  Tom R. Gaunt,et al.  Genetic Variants in Novel Pathways Influence Blood Pressure and Cardiovascular Disease Risk , 2011, Nature.

[32]  J. Kushner,et al.  Cyclins D2 and D1 Are Essential for Postnatal Pancreatic β-Cell Growth , 2005, Molecular and Cellular Biology.

[33]  M. Fornage,et al.  Genetic Loci Associated with Plasma Phospholipid n-3 Fatty Acids: A Meta-Analysis of Genome-Wide Association Studies from the CHARGE Consortium , 2011, PLoS genetics.

[34]  Christian Gieger,et al.  Genetic Variants in Novel Pathways Influence Blood Pressure and Cardiovascular Disease Risk , 2011, Nature.

[35]  Tamara S. Roman,et al.  New genetic loci link adipose and insulin biology to body fat distribution , 2014, Nature.

[36]  Inês Barroso,et al.  Impact of Type 2 Diabetes Susceptibility Variants on Quantitative Glycemic Traits Reveals Mechanistic Heterogeneity , 2014, Diabetes.

[37]  O. Delaneau,et al.  Supplementary Information for ‘ Improved whole chromosome phasing for disease and population genetic studies ’ , 2012 .

[38]  Alex Doney,et al.  Genetic variation in GIPR influences the glucose and insulin responses to an oral glucose challenge , 2010, Nature Genetics.

[39]  Claude Bouchard,et al.  A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance , 2012, Nature Genetics.

[40]  Jonathan C. Cohen,et al.  Inactivation of Tm6sf2, a Gene Defective in Fatty Liver Disease, Impairs Lipidation but Not Secretion of Very Low Density Lipoproteins* , 2016, The Journal of Biological Chemistry.

[41]  Ellen M. Schmidt,et al.  New loci for body fat percentage reveal link between adiposity and cardiometabolic disease risk , 2016, Nature Communications.

[42]  L. Groop,et al.  Novel subgroups of adult-onset diabetes and their association with outcomes: a data-driven cluster analysis of six variables. , 2018, The lancet. Diabetes & endocrinology.

[43]  M. Boehnke,et al.  Recent advances in understanding the genetic architecture of type 2 diabetes. , 2015, Human molecular genetics.

[44]  P. Donnelly,et al.  Genome-wide genetic data on ~500,000 UK Biobank participants , 2017, bioRxiv.

[45]  Tom R. Gaunt,et al.  Genome-wide meta-analysis uncovers novel loci influencing circulating leptin levels , 2016, Nature Communications.

[46]  Ross M. Fraser,et al.  Genetic studies of body mass index yield new insights for obesity biology , 2015, Nature.

[47]  Stephen C. J. Parker,et al.  The genetic architecture of type 2 diabetes , 2016, Nature.

[48]  Tom R. Gaunt,et al.  The UK10K project identifies rare variants in health and disease , 2015, Nature.

[49]  Mark I. McCarthy,et al.  A Central Role for GRB10 in Regulation of Islet Function in Man , 2014, PLoS genetics.

[50]  Fabian J Theis,et al.  Genome-wide association analyses identify 18 new loci associated with serum urate concentrations , 2012, Nature Genetics.

[51]  Inês Barroso,et al.  Genome-Wide Association Identifies Nine Common Variants Associated With Fasting Proinsulin Levels and Provides New Insights Into the Pathophysiology of Type 2 Diabetes , 2011, Diabetes.

[52]  Tanya M. Teslovich,et al.  Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes , 2012, Nature Genetics.

[53]  W. Guan,et al.  Genome-Wide Association Study Identifies Novel Loci Associated With Concentrations of Four Plasma Phospholipid Fatty Acids in the De Novo Lipogenesis Pathway: Results From the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium , 2013, Circulation. Cardiovascular genetics.

[54]  Ross M. Fraser,et al.  Defining the role of common variation in the genomic and biological architecture of adult human height , 2014, Nature Genetics.

[55]  Michael T. McManus,et al.  A Whole-Genome RNA Interference Screen Reveals a Role for Spry2 in Insulin Transcription and the Unfolded Protein Response , 2017, Diabetes.

[56]  Gad Getz,et al.  Somatic ERCC2 Mutations Are Associated with a Distinct Genomic Signature in Urothelial Tumors , 2016, Nature Genetics.

[57]  D. van der Kooy,et al.  β‐Cell evolution: How the pancreas borrowed from the brain , 2011, BioEssays : news and reviews in molecular, cellular and developmental biology.

[58]  P. Froguel,et al.  Disentangling the Role of Melatonin and its Receptor MTNR1B in Type 2 Diabetes: Still a Long Way to Go? , 2017, Current Diabetes Reports.

[59]  C. Sudlow,et al.  Low-frequency and common genetic variation in ischemic stroke , 2016, Neurology.

[60]  S. Gabriel,et al.  Whole-genome sequencing reveals activation-induced cytidine deaminase signatures during indolent chronic lymphocytic leukaemia evolution , 2015, Nature Communications.

[61]  Benjamin S. Glicksberg,et al.  Identification of type 2 diabetes subgroups through topological analysis of patient similarity , 2015, Science Translational Medicine.

[62]  Giovanni Malerba,et al.  Refining the accuracy of validated target identification through coding variant fine-mapping in type 2 diabetes , 2017, Nature Genetics.

[63]  W. Guan,et al.  Genome-Wide Association Study of Plasma N6 Polyunsaturated Fatty Acids Within the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium , 2014, Circulation. Cardiovascular genetics.

[64]  Philippe Froguel,et al.  Decreased STARD10 Expression Is Associated with Defective Insulin Secretion in Humans and Mice , 2017, American journal of human genetics.

[65]  Alan M. Kwong,et al.  A reference panel of 64,976 haplotypes for genotype imputation , 2015, Nature Genetics.

[66]  Steven J. M. Jones,et al.  Comprehensive Molecular Characterization of Muscle-Invasive Bladder Cancer , 2017, Cell.

[67]  Laura J. Scott,et al.  Genetic regulatory signatures underlying islet gene expression and type 2 diabetes , 2017, Proceedings of the National Academy of Sciences.

[68]  Karen L. Mohlke,et al.  Novel Loci for Adiponectin Levels and Their Influence on Type 2 Diabetes and Metabolic Traits: A Multi-Ethnic Meta-Analysis of 45,891 Individuals , 2012, PLoS genetics.

[69]  Udo Hoffmann,et al.  Genome-Wide Association Analysis Identifies Variants Associated with Nonalcoholic Fatty Liver Disease That Have Distinct Effects on Metabolic Traits , 2011, PLoS genetics.

[70]  Samuel E. Jones,et al.  Genetic Evidence for a Link Between Favorable Adiposity and Lower Risk of Type 2 Diabetes, Hypertension, and Heart Disease , 2016, Diabetes.

[71]  J. Kushner,et al.  Cyclins D2 and D1 are essential for postnatal pancreatic beta-cell growth. , 2005, Molecular and cellular biology.

[72]  Manolis Kellis,et al.  ChromHMM: automating chromatin-state discovery and characterization , 2012, Nature Methods.

[73]  M. McCarthy,et al.  Reduced Insulin Exocytosis in Human Pancreatic β-Cells With Gene Variants Linked to Type 2 Diabetes , 2012, Diabetes.

[74]  William W. Greenwald,et al.  Decreased STARD 10 Expression Is Associated with Defective Insulin Secretion in Humans and Mice , 2017 .

[75]  Peter Szolovits,et al.  Toward high-throughput phenotyping: unbiased automated feature extraction and selection from knowledge sources , 2015, J. Am. Medical Informatics Assoc..

[76]  S. Lewitzky,et al.  A genome scan for type 2 diabetes susceptibility loci in a genetically isolated population. , 2001, Diabetes.

[77]  A. Hamsten,et al.  TM6SF2 is a regulator of liver fat metabolism influencing triglyceride secretion and hepatic lipid droplet content , 2014, Proceedings of the National Academy of Sciences.

[78]  2. Classification and Diagnosis of Diabetes: Standards of Medical Care in Diabetes—2018 , 2017, Diabetes Care.

[79]  Johanna Kuusisto,et al.  Changes in Insulin Sensitivity and Insulin Release in Relation to Glycemia and Glucose Tolerance in 6,414 Finnish Men , 2009, Diabetes.