Glucodensities: A new representation of glucose profiles using distributional data analysis

Biosensor data have the potential to improve disease control and detection. However, the analysis of these data under free-living conditions is not feasible with current statistical techniques. To address this challenge, we introduce a new functional representation of biosensor data, termed the glucodensity, together with a data analysis framework based on distances between them. The new data analysis procedure is illustrated through an application in diabetes with continuous-time glucose monitoring (CGM) data. In this domain, we show marked improvement with respect to state-of-the-art analysis methods. In particular, our findings demonstrate that (i) the glucodensity possesses an extraordinary clinical sensitivity to capture the typical biomarkers used in the standard clinical practice in diabetes; (ii) previous biomarkers cannot accurately predict glucodensity, so that the latter is a richer source of information and; (iii) the glucodensity is a natural generalization of the time in range metric, this being the gold standard in the handling of CGM data. Furthermore, the new method overcomes many of the drawbacks of time in range metrics and provides more in-depth insight into assessing glucose metabolism.

[1]  A. McCall,et al.  Need for Regulatory Change to Incorporate Beyond A1C Glycemic Metrics , 2018, Diabetes Care.

[2]  Karin Kraft,et al.  [Type-2 diabetes]. , 2010, MMW Fortschritte der Medizin.

[3]  Frédéric Ferraty,et al.  Nonparametric Functional Data Analysis: Theory and Practice (Springer Series in Statistics) , 2006 .

[4]  J. Škrha,et al.  Glucose variability, HbA1c and microvascular complications , 2016, Reviews in Endocrine and Metabolic Disorders.

[5]  6. Glycemic Targets: Standards of Medical Care in Diabetes—2018 , 2017, Diabetes Care.

[6]  Badih Ghattas,et al.  Classifying densities using functional regression trees: Applications in oceanology , 2007, Comput. Stat. Data Anal..

[7]  Manuel Febrero-Bande,et al.  Statistical Computing in Functional Data Analysis: The R Package fda.usc , 2012 .

[8]  D. Klonoff,et al.  Recommendations for standardizing glucose reporting and analysis to optimize clinical decision making in diabetes: the Ambulatory Glucose Profile (AGP). , 2013, Diabetes technology & therapeutics.

[9]  I. Hirsch,et al.  Connecting the Dots: Validation of Time in Range Metrics With Microvascular Outcomes , 2019, Diabetes Care.

[10]  H. Müller,et al.  Additive Functional Regression for Densities as Responses , 2020, Journal of the American Statistical Association.

[11]  Karel Hron,et al.  Compositional Data Analysis in Time-Use Epidemiology: What, Why, How , 2020, International journal of environmental research and public health.

[12]  C. Cobelli,et al.  In Silico Preclinical Trials: A Proof of Concept in Closed-Loop Control of Type 1 Diabetes , 2009, Journal of diabetes science and technology.

[13]  V. Pawlowsky-Glahn,et al.  Bayes Hilbert Spaces , 2014 .

[14]  M. Fréchet Les éléments aléatoires de nature quelconque dans un espace distancié , 1948 .

[15]  Alessandra Menafoglio,et al.  Compositional regression with functional response , 2018, Comput. Stat. Data Anal..

[16]  Gábor J. Székely,et al.  The Energy of Data , 2017 .

[17]  P. Cryer Glycemic Goals in Diabetes: Trade-off Between Glycemic Control and Iatrogenic Hypoglycemia , 2014, Diabetes.

[18]  R. Turner,et al.  Homeostasis model assessment: insulin resistance and β-cell function from fasting plasma glucose and insulin concentrations in man , 1985, Diabetologia.

[19]  A. Barden,et al.  Advanced Glycation End Products: A Review , 2013 .

[20]  F. Doyle,et al.  Clinical Targets for Continuous Glucose Monitoring Data Interpretation: Recommendations From the International Consensus on Time in Range , 2019, Diabetes Care.

[21]  L. Monnier,et al.  Glycemic Variability: Can We Bridge the Divide Between Controversies? , 2011, Diabetes Care.

[22]  Y. Bao,et al.  Association of Time in Range, as Assessed by Continuous Glucose Monitoring, With Diabetic Retinopathy in Type 2 Diabetes , 2018, Diabetes Care.

[23]  Z. Q. John Lu,et al.  Nonparametric Functional Data Analysis: Theory And Practice , 2007, Technometrics.

[24]  Peter Filzmoser,et al.  Simplicial principal component analysis for density functions in Bayes spaces , 2016, Comput. Stat. Data Anal..

[25]  David R. Owens,et al.  Glycemic Variability: The Third Component of the Dysglycemia in Diabetes. Is it Important? How to Measure it? , 2008, Journal of diabetes science and technology.

[26]  D. Nathan,et al.  Relationship between glycated haemoglobin levels and mean glucose levels over time , 2007, Diabetologia.

[27]  E. Boyko,et al.  Insulin Resistance Predicts Mortality in Nondiabetic Individuals in the U.S. , 2010, Diabetes Care.

[28]  Y. Jang,et al.  Standards of Medical Care in Diabetes-2010 by the American Diabetes Association: Prevention and Management of Cardiovascular Disease , 2010 .

[29]  Michelle Perkins,et al.  Continuous glucose monitoring in pregnant women with type 1 diabetes (CONCEPTT): a multicentre international randomised controlled trial , 2017, The Lancet.

[30]  G. Hu,et al.  Time in Range in Relation to All-Cause and Cardiovascular Mortality in Patients With Type 2 Diabetes: A Prospective Cohort Study , 2020, Diabetes Care.

[31]  Arthur Gretton,et al.  Learning Theory for Distribution Regression , 2014, J. Mach. Learn. Res..

[32]  V. Pawlowsky-Glahn,et al.  Modeling and Analysis of Compositional Data , 2015 .

[33]  A. Antoniadis Wavelets in statistics: A review , 1997 .

[34]  R. Bergenstal Glycemic Variability and Diabetes Complications: Does It Matter? Simply Put, There Are Better Glycemic Markers! , 2015, Diabetes Care.

[35]  Ann. Probab Distance Covariance in Metric Spaces , 2017 .

[36]  The Use of Continuous Glucose Monitoring to Evaluate the Glycemic Response to Food , 2008 .

[37]  Tracey McLaughlin,et al.  Glucotypes reveal new patterns of glucose dysregulation , 2018, PLoS biology.

[38]  Karel Hron,et al.  Compositional data analysis for physical activity, sedentary time and sleep research , 2018, Statistical methods in medical research.

[39]  H. Muller,et al.  Fréchet analysis of variance for random objects , 2017, Biometrika.

[40]  David C Klonoff,et al.  Recommendations for Standardizing Glucose Reporting and Analysis to Optimize Clinical Decision Making in Diabetes: The Ambulatory Glucose Profile , 2013, Journal of diabetes science and technology.

[41]  W. F. Taylor,et al.  Day-to-day variation of continuously monitored glycaemia: A further measure of diabetic instability , 1972, Diabetologia.

[42]  H. Muller,et al.  Fréchet regression for random objects with Euclidean predictors , 2016, The Annals of Statistics.

[43]  Hans-Georg Müller Functional Data Analysis. , 2011 .

[44]  Roy W Beck,et al.  The Fallacy of Average: How Using HbA1c Alone to Assess Glycemic Control Can Be Misleading , 2017, Diabetes Care.

[45]  Anne Leucht,et al.  Dependent wild bootstrap for degenerate U- and V-statistics , 2013, J. Multivar. Anal..

[46]  Hans-Georg Müller,et al.  Density Estimation including Examples , 2014 .

[47]  Young K. Truong,et al.  On bandwidth choice for density estimation with dependent data , 1995 .

[48]  C. McDonnell,et al.  A novel approach to continuous glucose analysis utilizing glycemic variation. , 2005, Diabetes technology & therapeutics.

[49]  A. Izenman Review Papers: Recent Developments in Nonparametric Density Estimation , 1991 .

[50]  Ciprian M Crainiceanu,et al.  Short-term variability in measures of glycemia and implications for the classification of diabetes. , 2007, Archives of internal medicine.

[51]  Cas Weykamp,et al.  IFCC reference system for measurement of hemoglobin A1c in human blood and the national standardization schemes in the United States, Japan, and Sweden: a method-comparison study. , 2004, Clinical chemistry.

[52]  Jae Ho Shin,et al.  Biocompatible materials for continuous glucose monitoring devices. , 2013, Chemical reviews.

[53]  P. Reiss,et al.  Generalized reliability based on distances , 2019, Biometrics.

[54]  V. Pawlowsky-Glahn,et al.  on Compositional Data Analysis , 2007 .

[55]  K. Khunti,et al.  Glucose dysregulation phenotypes — time to improve outcomes , 2018, Nature Reviews Endocrinology.

[56]  H. Muller,et al.  Functional data analysis for density functions by transformation to a Hilbert space , 2016, 1601.02869.

[57]  B. Caffo,et al.  MULTILEVEL FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS. , 2009, The annals of applied statistics.

[58]  B. Silverman Density estimation for statistics and data analysis , 1986 .

[59]  Robin Genuer,et al.  Fréchet random forests , 2019, ArXiv.

[60]  Sean M Ewings,et al.  A Bayesian network for modelling blood glucose concentration and exercise in type 1 diabetes , 2015, Statistical methods in medical research.

[61]  Robin Genuer,et al.  Fr\'echet random forests for metric space valued regression with non euclidean predictors. , 2020 .

[62]  C. Cadarso-Suárez,et al.  Glycemic Variability and Its Association With Demographics and Lifestyles in a General Adult Population , 2016, Journal of diabetes science and technology.

[63]  E. Segal,et al.  Personalized Nutrition by Prediction of Glycemic Responses , 2015, Cell.

[64]  F. J. Ariza-López,et al.  Approximating the null distribution of a class of statistics for testing independence , 2019, J. Comput. Appl. Math..

[65]  C. Villani Optimal Transport: Old and New , 2008 .

[66]  R. Beck,et al.  Validation of Time in Range as an Outcome Measure for Diabetes Clinical Trials , 2018, Diabetes Care.

[67]  C. Preda Regression models for functional data by reproducing kernel Hilbert spaces methods , 2007 .

[68]  Yoav Zemel,et al.  Statistical Aspects of Wasserstein Distances , 2018, Annual Review of Statistics and Its Application.

[69]  W. F. Taylor,et al.  Mean Amplitude of Glycemic Excursions, a Measure of Diabetic Instability , 1970, Diabetes.

[70]  E. Kilpatrick,et al.  Glycated haemoglobin in the year 2000 , 2000, Journal of clinical pathology.

[71]  D. Klonoff,et al.  A Review of Continuous Glucose Monitoring Based Composite Metrics for Glycemic Control. , 2020, Diabetes technology & therapeutics.

[72]  L. Kessler,et al.  Glucose Variability , 2016, Journal of diabetes science and technology.

[73]  C. Crainiceanu,et al.  Modeling continuous glucose monitoring (CGM) data during sleep. , 2020, Biostatistics.

[74]  L. Dimeglio,et al.  Type 1 Diabetes , 2019, Epidemiology of Diabetes.

[75]  Maria L. Rizzo,et al.  Kernel k-Groups via Hartigan’s Method , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.