Machine learning algorithms for predicting malnutrition among under-five children in Bangladesh.

OBJECTIVE The aim of this study was is to predict malnutrition status in under-five children in Bangladesh by using various machine learning (ML) algorithms. METHODS For analysis purposes, the nationally representative secondary records from the 2014 Bangladesh Demographic and Health Survey (BDHS) were used. Five well-known ML algorithms such as linear discriminant analysis (LDA), k-nearest neighbors (k-NN), support vector machines (SVM), random forest (RF), and logistic regression (LR) have been considered to accurately predict malnutrition status among children. Additionally, a systematic assessment of the algorithms was performed by using accuracy, sensitivity, specificity, and Cohen's κ statistic. RESULTS Based on various performance parameters, the best results were accomplished with the RF algorithm, which demonstrated an accuracy of 68.51%, a sensitivity of 94.66%, and a specificity of 69.76%. Additionally, a most extreme discriminative ability appeared by RF classification (Cohen's κ = 0.2434). CONCLUSION On the basis of the findings, we can presume that the RF algorithm was moderately superior to any other ML algorithms used in this study to predict malnutrition status among under-five children in Bangladesh. Finally, the present research recommends applying RF classification with RF feature selection when the prediction of malnutrition is the core interest.

[1]  M. Ruel,et al.  Child feeding practices are associated with child nutritional status in Latin America: innovative uses of the demographic and health surveys. , 2002, The Journal of nutrition.

[2]  Nepal.,et al.  Nepal Demographic and Health Survey 2011 , 2002 .

[3]  Manal Alghamdi,et al.  Predicting diabetes mellitus using SMOTE and ensemble machine learning approach: The Henry Ford ExercIse Testing (FIT) project , 2017, PloS one.

[4]  Mohammad Shahed Masud,et al.  Association of Low-Birth Weight with Malnutrition in Children under Five Years in Bangladesh: Do Mother’s Education, Socio-Economic Status, and Birth Interval Matter? , 2016, PloS one.

[5]  Fengxi Song,et al.  Feature Selection Based on Linear Discriminant Analysis , 2010, 2010 International Conference on Intelligent System Design and Engineering Application.

[6]  Dinesh Kumar,et al.  Influence of infant-feeding practices on nutritional status of under-five children , 2006, Indian journal of pediatrics.

[7]  Mz Hossain,et al.  PREDICTORS OF CHILD CHRONIC MALNUTRITION IN BANGLADESH , 2008 .

[8]  C. Brodley,et al.  Exploration of machine learning techniques in predicting multiple sclerosis disease course , 2017, PloS one.

[9]  M. Onis,et al.  Socioeconomic and demographic factors are associated with worldwide patterns of stunting and wasting of children. , 1997, The Journal of nutrition.

[10]  Mehmet Fatih Akay,et al.  Support vector machines combined with feature selection for breast cancer diagnosis , 2009, Expert Syst. Appl..

[11]  S. Vollmer,et al.  Association between economic growth and early childhood undernutrition: evidence from 121 Demographic and Health Surveys from 36 low-income and middle-income countries. , 2014, The Lancet. Global health.

[12]  Deok Won Kim,et al.  Screening for Prediabetes Using Machine Learning Models , 2014, Comput. Math. Methods Medicine.

[13]  Chung-Ho Hsieh,et al.  Novel solutions for an old disease: diagnosis of acute appendicitis with random forest, support vector machines, and artificial neural networks. , 2011, Surgery.

[14]  Amalendu Jyotishi,et al.  Investigation of Nutritional Status of Children based on Machine Learning Techniques using Indian Demographic and Health Survey Data , 2017 .

[15]  E. Nartey,et al.  Factors affecting malnutrition in children and the uptake of interventions to prevent the condition , 2015, BMC Pediatrics.

[16]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[17]  Muin J. Khoury,et al.  Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes , 2010, BMC Medical Informatics Decis. Mak..

[18]  Xuehui Meng,et al.  Comparison of three data mining models for predicting diabetes or prediabetes by risk factors , 2013, The Kaohsiung journal of medical sciences.

[19]  Md. Israt Rayhan,et al.  Factors Causing Malnutrition among under Five Children in Bangladesh , 2006 .

[20]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[21]  C. Mathers,et al.  Maternal and child undernutrition: global and regional exposures and health consequences , 2008, The Lancet.

[22]  B. Dhaka Bangladesh Demographic and Health Survey 2014 , 2015 .

[23]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.