Quantitative population-health relationship (QPHR) for assessing metabolic syndrome

Metabolic syndrome (MS) is a condition that predisposes individuals to the development of cardiovascular diseases and type 2 diabetes mellitus. A cross-sectional investigation of 15,365 participants residing in metropolitan Bangkok who had received an annual health checkup in 2007 was used in this study. Individuals were classified as MS or non-MS according to the International Diabetes Federation criteria using BMI cutoff of ≥ 25 kg/m2 plus two or more MS components. This study explores the utility of quantitative population-health relationship (QPHR) for predicting MS status as well as discovers variables that frequently occur together. The former was achieved by decision tree (DT) analysis, artificial neural network (ANN), support vector machine (SVM) and principal component analysis (PCA) while the latter was obtained by association analysis (AA). DT outperformed both ANN and SVM in MS classification as deduced from its accuracy value of 99 % as compared to accuracies of 98 % and 91 % for ANN and SVM, respectively. Furthermore, PCA was able to effectively classify individuals as MS and non-MS as observed from the scores plot. Moreover, AA was employed to analyze individuals with MS in order to elucidate pertinent rule from MS components that occur frequently together, which included TG+BP, BP+FPG and TG+FPG where TG, BP and FPG corresponds to triglyceride, blood pressure and fasting plasma glucose, respectively. QPHR was demonstrated to be useful in predicting the MS status of individuals from an urban Thai population. Rules obtained from AA analysis provided general guidelines (i.e. co-occurrences of TG, BP and FPG) that may be used in the prevention of MS in at risk individuals.

[1]  Shu-Hsien Liao,et al.  Mining information users' knowledge for one-to-one marketing on information appliance , 2009, Expert Syst. Appl..

[2]  Virapong Prachayasittikul,et al.  Quantitative prediction of imprinting factor of molecularly imprinted polymers by artificial neural network , 2005, J. Comput. Aided Mol. Des..

[3]  Shu-Hsien Liao,et al.  Mining marketing maps for business alliances , 2008, Expert Syst. Appl..

[4]  Mark Woodward,et al.  The metabolic syndrome identifies a heterogeneous group of metabolic component combinations in the Asia-Pacific region. , 2008, Diabetes research and clinical practice.

[5]  Sattar Hashemi,et al.  A decision tree-based approach for determining low bone mineral density in inflammatory bowel disease using WEKA software , 2007, European journal of gastroenterology & hepatology.

[6]  Régis Beuscart,et al.  Assessing association rules and decision trees on analysis of diabetes data from the DiabCare program in France. , 2002, Studies in health technology and informatics.

[7]  Mi Kyung Kim,et al.  Comorbidity Study on Type 2 Diabetes Mellitus Using Data Mining , 2012, The Korean journal of internal medicine.

[8]  W. B. Lee,et al.  Data Mining in Biomedicine: Current Applications and Further Directions for Research , 2009, J. Softw. Eng. Appl..

[9]  Yi-Fan Wang,et al.  A personalized recommender system for the cosmetic business , 2004, Expert Syst. Appl..

[10]  Chartchalerm Isarankura-Na-Ayudhya,et al.  Modeling the LPS Neutralization Activity of Anti-Endotoxins , 2009, Molecules.

[11]  P. Zimmet,et al.  Diagnosis and classification of diabetes mellitus , 2002 .

[12]  Chartchalerm Isarankura-Na-Ayudhya,et al.  Advances in computational methods to predict the biological activity of compounds , 2010, Expert opinion on drug discovery.

[13]  Mary K Obenshain Application of Data Mining Techniques to Healthcare Data , 2004, Infection Control & Hospital Epidemiology.

[14]  I N Lee,et al.  Data mining techniques applied to medical information , 2000, Medical informatics and the Internet in medicine.

[15]  Bernard C. Jiang,et al.  Using data mining techniques for multi-diseases prediction modeling of hypertension and hyperlipidemia by common risk factors , 2011, Expert Syst. Appl..

[16]  Chartchalerm Isarankura-Na-Ayudhya,et al.  Quantitative structure-imprinting factor relationship of molecularly imprinted polymers. , 2007, Biosensors & bioelectronics.

[17]  Chartchalerm Isarankura-Na-Ayudhya,et al.  Prediction of GFP spectral properties using artificial neural network , 2007, J. Comput. Chem..

[18]  Chartchalerm Isarankura-Na-Ayudhya,et al.  Quantitative structure–property relationship study of spectral properties of green fluorescent protein with support vector machine , 2013 .

[19]  Apilak Worachartcheewan,et al.  Elucidating the Structure-Activity Relationships of the Vasorelaxation and Antioxidation Properties of Thionicotinic Acid Derivatives , 2010, Molecules.

[20]  Ian H. Witten,et al.  Data Mining: Practical Machine Learning Tools and Techniques, 3/E , 2014 .

[21]  R. Chang,et al.  Data mining with decision trees for diagnosis of breast tumor in medical ultrasonic images , 2001, Breast Cancer Research and Treatment.

[22]  Apilak Worachartcheewan,et al.  Lower BMI cutoff for assessing the prevalence of metabolic syndrome in Thai population , 2010, Acta Diabetologica.

[23]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[24]  C. Nantasenamat,et al.  QSAR study of amidino bis-benzimidazole derivatives as potent anti-malarial agents against Plasmodium falciparum , 2013, Chemical Papers.

[25]  A. B. M. Shawkat Ali,et al.  Significant Cancer Prevention Factor Extraction: An Association Rule Discovery Approach , 2011, Journal of Medical Systems.

[26]  B. Balkau,et al.  Comment on the provisional report from the WHO consultation , 1999, Diabetic medicine : a journal of the British Diabetic Association.

[27]  C. Nantasenamat,et al.  Antioxidant, cytotoxicity, and QSAR study of 1-adamantylthio derivatives of 3-picoline and phenylpyridines , 2011, Medicinal Chemistry Research.

[28]  N. Unwin,et al.  Expert Panel on Detection, Evaluation, and Treatment of High Blood Cholesterol in Adults (Adult Treatment Panel III) Detection, Evaluation, and Treatment of High Blood Cholesterol Education Program (NCEP) Expert Panel on Executive Summary of the Third Report of the National , 2009 .

[29]  Apilak Worachartcheewan,et al.  Identification of metabolic syndrome using decision tree analysis. , 2010, Diabetes research and clinical practice.

[30]  Johann Gasteiger,et al.  Neural networks in chemistry and drug design , 1999 .

[31]  Frances Talaska Fischbach,et al.  A manual of laboratory diagnostic tests , 1980 .

[32]  S. Grundy,et al.  Harmonizing the metabolic syndrome: a joint interim statement of the International Diabetes Federation Task Force on Epidemiology and Prevention; National Heart, Lung, and Blood Institute; American Heart Association; World Heart Federation; International Atherosclerosis Society; and International As , 2009, Circulation.

[33]  Gyeong Ho Lee,et al.  Diagnostic Analysis of Patients with Essential Hypertension Using Association Rule Mining , 2010, Healthcare informatics research.

[34]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[35]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[36]  E. V. Thomas,et al.  Partial least-squares methods for spectral analyses. 1. Relation to other quantitative calibration methods and the extraction of qualitative information , 1988 .

[37]  Illhoi Yoo,et al.  Data Mining in Healthcare and Biomedicine: A Survey of the Literature , 2012, Journal of Medical Systems.

[38]  Ching-Hsue Cheng,et al.  A predictive model for cerebrovascular disease using data mining , 2011, Expert Syst. Appl..

[39]  Apilak Worachartcheewan,et al.  Predicting the free radical scavenging activity of curcumin derivatives , 2011 .

[40]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[41]  Sou-Sen Leu,et al.  Use of association rules to explore cause-effect relationships in occupational accidents in the Taiwan construction industry , 2010 .

[42]  J. Mckenney,et al.  Executive Summary of The Third Report of The National Cholesterol Education Program (NCEP) Expert Panel on Detection, Evaluation, And Treatment of High Blood Cholesterol In Adults (Adult Treatment Panel III). , 2001, JAMA.

[43]  S. Ólafsson,et al.  Data mining for recognizing patterns in foodborne disease outbreaks , 2010 .

[44]  B. Rhee,et al.  A decision tree-based approach for identifying urban-rural differences in metabolic syndrome risk factors in the adult Korean population , 2012, Journal of Endocrinological Investigation.

[45]  Chia-Wen Liao,et al.  Data mining for occupational injuries in the Taiwan construction industry , 2008 .

[46]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[47]  Chartchalerm Isarankura-Na-Ayudhya,et al.  A practical overview of quantitative structure-activity relationship , 2009 .

[48]  L. Fogelfeld,et al.  Metabolic syndrome and prediabetes. , 2006, Disease-a-month : DM.