Improving precision of glomerular filtration rate estimating model by ensemble learning

BackgroundAccurate assessment of kidney function is clinically important, but estimates of glomerular filtration rate (GFR) by regression are imprecise.MethodsWe hypothesized that ensemble learning could improve precision. A total of 1419 participants were enrolled, with 1002 in the development dataset and 417 in the external validation dataset. GFR was independently estimated from age, sex and serum creatinine using an artificial neural network (ANN), support vector machine (SVM), regression, and ensemble learning. GFR was measured by 99mTc-DTPA renal dynamic imaging calibrated with dual plasma sample 99mTc-DTPA GFR.ResultsMean measured GFRs were 70.0 ml/min/1.73 m2 in the developmental and 53.4 ml/min/1.73 m2 in the external validation cohorts. In the external validation cohort, precision was better in the ensemble model of the ANN, SVM and regression equation (IQR = 13.5 ml/min/1.73 m2) than in the new regression model (IQR = 14.0 ml/min/1.73 m2, P < 0.001). The precision of ensemble learning was the best of the three models, but the models had similar bias and accuracy. The median difference ranged from 2.3 to 3.7 ml/min/1.73 m2, 30% accuracy ranged from 73.1 to 76.0%, and P was > 0.05 for all comparisons of the new regression equation and the other new models.ConclusionsAn ensemble learning model including three variables, the average ANN, SVM, and regression equation values, was more precise than the new regression model. A more complex ensemble learning strategy may further improve GFR estimates.

[1]  Hua Tang,et al.  A comparison of the performances of an artificial neural network and a regression model for GFR estimation. , 2013, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[2]  R. Polikar,et al.  Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.

[3]  J. Coresh,et al.  GFR estimation: from physiology to public health. , 2014, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[4]  Jianqiang Li,et al.  Exploiting ensemble learning for automatic cataract detection and grading , 2016, Comput. Methods Programs Biomed..

[5]  Harold I Feldman,et al.  Estimating glomerular filtration rate from serum creatinine and cystatin C. , 2012, The New England journal of medicine.

[6]  T. Lou,et al.  A New Modified CKD-EPI Equation for Chinese Patients with Type 2 Diabetes , 2014, PloS one.

[7]  Bertram L Kasiske,et al.  The definition, classification, and prognosis of chronic kidney disease: a KDIGO Controversies Conference report. , 2011, Kidney international.

[8]  Josef Coresh,et al.  Chronic kidney disease , 2012, The Lancet.

[9]  G. Beck,et al.  Non-GFR Determinants of Low-Molecular-Weight Serum Protein Filtration Markers in CKD. , 2016, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[10]  G. Beck,et al.  GFR Estimation Using β-Trace Protein and β2-Microglobulin in CKD. , 2016, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[11]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[12]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[13]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .