Identification of type 2 diabetes subgroups through topological analysis of patient similarity

Patient networks constructed from genotype data and electronic medical records pinpointed three type 2 diabetes subtypes. Networks work for diabetes Big problems require big solutions, and for complex diseases such as cancer or diabetes, the big solution is big data. One long-term goal of U.S. President Barack Obama’s Precision Medicine Initiative is to assemble medical and genetic data from at least one million volunteers. But how might researchers use all those data? Li et al. provide one answer by using patient electronic medical records (EMRs) and genotype data from Mount Sinai Medical Center in New York to characterize new subtypes of type 2 diabetes (T2D). The group first clustered EMR data to identify T2D patients within the larger group. Topological analysis of the T2D group identified three new T2D subtypes on the basis of distinct patterns of clinical characteristics and disease comorbidities. Genetic association analysis identified more than 300 single nucleotide polymorphisms (SNPs) specific to each subtype. The authors found that classical T2D features such as obesity, high blood sugar, kidney disease, and eye disease, were limited to subtype 1, whereas other comorbidities such as cancer and neurological diseases were specific to subtypes 2 and 3, respectively. These distinctions might call for tailored treatment regimens rather than a one-size-fits-all approach for T2D. Although a larger sample size is needed to determine causal relationships, this study demonstrates the potential of precision medicine. Type 2 diabetes (T2D) is a heterogeneous complex disease affecting more than 29 million Americans alone with a rising prevalence trending toward steady increases in the coming decades. Thus, there is a pressing clinical need to improve early prevention and clinical management of T2D and its complications. Clinicians have understood that patients who carry the T2D diagnosis have a variety of phenotypes and susceptibilities to diabetes-related complications. We used a precision medicine approach to characterize the complexity of T2D patient populations based on high-dimensional electronic medical records (EMRs) and genotype data from 11,210 individuals. We successfully identified three distinct subgroups of T2D from topology-based patient-patient networks. Subtype 1 was characterized by T2D complications diabetic nephropathy and diabetic retinopathy; subtype 2 was enriched for cancer malignancy and cardiovascular diseases; and subtype 3 was associated most strongly with cardiovascular diseases, neurological diseases, allergies, and HIV infections. We performed a genetic association analysis of the emergent T2D subtypes to identify subtype-specific genetic markers and identified 1279, 1227, and 1338 single-nucleotide polymorphisms (SNPs) that mapped to 425, 322, and 437 unique genes specific to subtypes 1, 2, and 3, respectively. By assessing the human disease–SNP association for each subtype, the enriched phenotypes and biological functions at the gene level for each subtype matched with the disease comorbidities and clinical differences that we identified through EMRs. Our approach demonstrates the utility of applying the precision medicine paradigm in T2D and the promise of extending the approach to the study of other complex, multifactorial diseases.

[1]  [DEVELOPMENTAL ASPECTS OF DIABETIC RETINOPATHY. (RETINOGRAPHIC STUDY)]. , 1965, Le Diabete.

[2]  H. Rifkin,et al.  Unilateral nodular diabetic glomerulosclerosis (Kimmelstiel-Wilson): report of a case. , 1973, Metabolism: clinical and experimental.

[3]  S. Person,et al.  Mutation production from tritium decay: a local effect for (3H)a-adenosine and (3H)6-thymidine decays. , 1976, Mutation research.

[4]  J. Bailey,et al.  Effect of alteration of the acetic acid synthesis pathway on the fermentation pattern of escherichia coli , 1991, Biotechnology and bioengineering.

[5]  Clifford M. Hurvich,et al.  A CORRECTED AKAIKE INFORMATION CRITERION FOR VECTOR AUTOREGRESSIVE MODEL SELECTION , 1993 .

[6]  S. Ichihara,et al.  Construction of Pta-Ack pathway deletion mutants of Escherichia coli and characteristic growth profiles of the mutants in a rich medium. , 1994, Bioscience, biotechnology, and biochemistry.

[7]  D. Linden,et al.  Long-term synaptic depression. , 1995, Annual review of neuroscience.

[8]  L. Tartaglia,et al.  Evidence That the Diabetes Gene Encodes the Leptin Receptor: Identification of a Mutation in the Leptin Receptor Gene in db/db Mice , 1996, Cell.

[9]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[10]  M. Laakso,et al.  Predictors of stroke in middle-aged patients with non-insulin-dependent diabetes. , 1996, Stroke.

[11]  A. Krolewski,et al.  Epidemiology of late diabetic complications. A basis for the development and evaluation of preventive programs. , 1996, Endocrinology and metabolism clinics of North America.

[12]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[13]  R. Hayward,et al.  Estimated Benefits of Glycemic Control in Microvascular Complications in Type 2 Diabetes , 1997, Annals of Internal Medicine.

[14]  M. Cowen,et al.  Casemix adjustment of managed care claims data using the clinical classification for health policy research method. , 1998, Medical care.

[15]  J. Kable,et al.  In vivo gene modification elucidates subtype-specific functions of alpha(2)-adrenergic receptors. , 2000, The Journal of pharmacology and experimental therapeutics.

[16]  G. Wolf,et al.  Cell cycle regulation in diabetic nephropathy. , 2000, Kidney international. Supplement.

[17]  J. Kaprio,et al.  Familial association between allergic disorders and depression in adult Finnish twins. , 2000, American journal of medical genetics.

[18]  J. Kable,et al.  In Vivo Gene Modification Elucidates Subtype-Specific Functions of a 2-Adrenergic Receptors 1 , 2000 .

[19]  Ş. Kirazlı,et al.  Thrombopoietin and mean platelet volume in coronary artery disease , 2001, Clinical cardiology.

[20]  M. P. McDonald,et al.  The α2A-Adrenergic Receptor Plays a Protective Role in Mouse Behavioral Models of Depression and Anxiety , 2001, The Journal of Neuroscience.

[21]  Marc Montminy,et al.  Transcriptional regulation by the phosphorylation-dependent factor CREB , 2001, Nature Reviews Molecular Cell Biology.

[22]  M. P. McDonald,et al.  The a 2 A-Adrenergic Receptor Plays a Protective Role in Mouse Behavioral Models of Depression and Anxiety , 2001 .

[23]  Wolfgang Schmid,et al.  Disruption of CREB function in brain leads to neurodegeneration , 2002, Nature Genetics.

[24]  Peter Libby,et al.  Diabetes and atherosclerosis: epidemiology, pathophysiology, and management. , 2002, JAMA.

[25]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[26]  V. Meyer-Rochow,et al.  Atopy and depression: results from the Northern Finland 1966 Birth Cohort Study , 2003, Molecular Psychiatry.

[27]  V. Meyer-Rochow,et al.  Presence of atopy in first-degree relatives as a predictor of a female proband's depression: results from the Northern Finland 1966 Birth Cohort. , 2003, The Journal of allergy and clinical immunology.

[28]  I. Singh,et al.  Increased peroxisomal fatty acid β-oxidation and enhanced expression of peroxisome proliferator-activated receptor-α in diabetic rat liver , 1999, Molecular and Cellular Biochemistry.

[29]  V. Wiwanitkit Angiotensin-converting Enzyme Gene Polymorphism: I and D Alleles From Some Different Countries , 2004, Clinical and applied thrombosis/hemostasis : official journal of the International Academy of Clinical and Applied Thrombosis/Hemostasis.

[30]  Dan Ziegler,et al.  Diabetic neuropathies: a statement by the American Diabetes Association. , 2005, Diabetes care.

[31]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[32]  A. Gyulkhandanyan,et al.  cAMP-mediated signaling normalizes glucose-stimulated insulin secretion in uncoupling protein-2 overexpressing beta-cells. , 2006, The Journal of endocrinology.

[33]  Joost Schymkowitz,et al.  Bioinformatics Applications Note Snpeffect V2.0: a New Step in Investigating the Molecular Phenotypic Effects of Human Non-synonymous Snps , 2022 .

[34]  H. Parving,et al.  Mannose-binding lectin and mortality in type 2 diabetes. , 2006, Archives of internal medicine.

[35]  H. Qian,et al.  An electrophysiological study of retinal function in the diabetic female rat. , 2006, Investigative ophthalmology & visual science.

[36]  S. Hofer,et al.  Diabetic Nephropathy in 27,805 Children, Adolescents, and Adults With Type 1 Diabetes , 2007, Diabetes Care.

[37]  V. Basevi Standards of medical care in diabetes--2007. , 2009, Diabetes care.

[38]  H. Qian,et al.  Streptozotocin-induced diabetes modulates GABA receptor activity of rat retinal neurons. , 2007, Experimental eye research.

[39]  A. Butte,et al.  AILUN: reannotating gene expression data automatically , 2007, Nature Methods.

[40]  T. Postolache,,et al.  Allergy: A risk factor for suicide? , 2008, Current treatment options in neurology.

[41]  P. Donnelly,et al.  A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies , 2009, PLoS genetics.

[42]  A. Classification,et al.  Standards of Medical Care in Diabetes—2009 , 2009, Diabetes Care.

[43]  P. D. de Groot,et al.  β2‐Glycoprotein I is incorrectly named apolipoprotein H , 2009, Journal of thrombosis and haemostasis : JTH.

[44]  Gunnar E. Carlsson,et al.  Topology and data , 2009 .

[45]  R. V. Vardhan,et al.  Angiotensin converting enzyme gene polymorphism in type II diabetics with nephropathy , 2009, Indian journal of nephrology.

[46]  K. Kaushansky,et al.  Molecular mechanisms of thrombopoietin signaling , 2009, Journal of thrombosis and haemostasis : JTH.

[47]  Marylyn D. Ritchie,et al.  PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations , 2010, Bioinform..

[48]  D. Harlan,et al.  Diabetes and Cancer , 2010, Diabetes Care.

[49]  D. Leroith,et al.  Type 2 diabetes and cancer: what is the connection? , 2010, The Mount Sinai journal of medicine, New York.

[50]  P. O S I T I O N S T A T E M E N T,et al.  Diagnosis and Classification of Diabetes Mellitus , 2011, Diabetes Care.

[51]  Edward Giovannucci,et al.  Diabetes and Cancer , 2010, Diabetes Care.

[52]  G. Müller Personalized Prognosis and Diagnosis of Type 2 Diabetes – Vision or Fiction? , 2010, Pharmacology.

[53]  S. Cregan,et al.  Group I metabotropic glutamate receptor signalling and its implication in neurological disease. , 2010, CNS & neurological disorders drug targets.

[54]  D. Meltzer,et al.  Type II diabetes mellitus is associated with decreased measures of lung function in a clinical setting. , 2011, Respiratory medicine.

[55]  M. S. Kirkman,et al.  Comment on: American Diabetes Association. Standards of Medical Care in Diabetes—2011. Diabetes Care 2011;34(Suppl. 1):S11–S61 , 2011 .

[56]  Robert J. Smith,et al.  Personalized medicine in diabetes. , 2011, Clinical chemistry.

[57]  V. Basevi Diagnosis and Classification of Diabetes Mellitus , 2011, Diabetes Care.

[58]  I. Deary,et al.  Genome‐wide association uncovers shared genetic effects among personality traits and mood states , 2012, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[59]  Patrick F. Sullivan,et al.  zCall: a rare variant caller for array-based genotyping: Genetics and population analysis , 2012, Bioinform..

[60]  I. Frank,et al.  HIV infection and glycemic response to newly initiated diabetic medical therapy , 2012, AIDS.

[61]  Alexander A. Morgan,et al.  Type 2 Diabetes Risk Alleles Demonstrate Extreme Directional Differentiation among Human Populations, Compared to Other Diseases , 2012, PLoS genetics.

[62]  E. Grove,et al.  Platelet aggregation is dependent on platelet count in patients with coronary artery disease. , 2012, Thrombosis research.

[63]  Suzette J. Bielinski,et al.  Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study , 2012, J. Am. Medical Informatics Assoc..

[64]  K. Shianna,et al.  Genome-wide mapping for clinically relevant predictors of lamotrigine- and phenytoin-induced hypersensitivity reactions. , 2012, Pharmacogenomics.

[65]  A. Goffi,et al.  Thrombopoietin as Biomarker and Mediator of Cardiovascular Damage in Critical Diseases , 2012, Mediators of inflammation.

[66]  S. Batzoglou,et al.  Linking disease associations with regulatory information in the human genome , 2012, Genome research.

[67]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[68]  Tanya M. Teslovich,et al.  Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes , 2012, Nature Genetics.

[69]  Pedro J. Caraballo,et al.  Impact of data fragmentation across healthcare centers on the accuracy of a high-throughput clinical phenotyping algorithm for specifying subjects with type 2 diabetes mellitus , 2012, J. Am. Medical Informatics Assoc..

[70]  P. Chou,et al.  Impact of increasing alanine aminotransferase levels within normal range on incident diabetes. , 2012, Journal of the Formosan Medical Association = Taiwan yi zhi.

[71]  Eurie L. Hong,et al.  Annotation of functional variation in personal genomes using RegulomeDB , 2012, Genome research.

[72]  J. Manson,et al.  Blood 25-Hydroxy Vitamin D Levels and Incident Type 2 Diabetes , 2013, Diabetes Care.

[73]  Melissa A. Basford,et al.  Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data , 2013, Nature Biotechnology.

[74]  M. Kivimäki,et al.  Trajectories of cardiometabolic risk factors before diagnosis of three subtypes of type 2 diabetes: a post-hoc analysis of the longitudinal Whitehall II cohort study. , 2013, The lancet. Diabetes & endocrinology.

[75]  G. Bray,et al.  Cardiovascular effects of intensive lifestyle intervention in type 2 diabetes. , 2013, The New England journal of medicine.

[76]  C. Apovian,et al.  B cells promote inflammation in obesity and type 2 diabetes through regulation of T-cell function and an inflammatory cytokine profile , 2013, Proceedings of the National Academy of Sciences.

[77]  N. Wray,et al.  A mega-analysis of genome-wide association studies for major depressive disorder , 2013, Molecular Psychiatry.

[78]  E. Clambey,et al.  Protective role for netrin-1 during diabetic nephropathy , 2013, Journal of Molecular Medicine.

[79]  Melissa A. Basford,et al.  Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[80]  P. Y. Lum,et al.  Extracting insights from the shape of complex data using topology , 2013, Scientific Reports.

[81]  S. McGuire,et al.  Centers for Disease Control and Prevention. State indicator report on Physical Activity, 2014. Atlanta, GA: U.S. Department of Health and Human Services; 2014. , 2014, Advances in nutrition.

[82]  G. Bray,et al.  Impact of an Intensive Lifestyle Intervention on Use and Cost of Medical Services Among Overweight and Obese Adults With Type 2 Diabetes: The Action for Health in Diabetes , 2014, Diabetes Care.

[83]  Nicholette D. Palmer,et al.  Meta-Analysis of Genome-Wide Association Studies in African Americans Provides Insights into the Genetic Architecture of Type 2 Diabetes , 2014, PLoS genetics.

[84]  Ross M. Fraser,et al.  A General Approach for Haplotype Phasing across the Full Spectrum of Relatedness , 2014, PLoS genetics.

[85]  Jennette P. Moreno,et al.  Cardiovascular Effects of Intensive Lifestyle Intervention in Type 2 Diabetes , 2014, Current Atherosclerosis Reports.

[86]  Li Li,et al.  An Integrative Pipeline for Multi-Modal Discovery of Disease Relationships , 2014, Pacific Symposium on Biocomputing.

[87]  Teresa A. Webster,et al.  Genotyping Informatics and Quality Control for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort , 2015, Genetics.