Data Mining for Lifestyle Risk Factors Associated with Overweight and Obesity among Adolescents

Data mining techniques have been applied to many areas in the business world and our daily life, including healthcare and clinical health services. One of the mostly watched health problems is obesity and overweight, particularly for children and adolescents. In this paper, we try to find the most significant lifestyle risk factors associated with overweight and obesity among high school students in the US. Lifestyle survey data from the 2011 National Youth Risk Behavior Survey (YRBS) was used with the students' body weight statuses, overweight or obesity, considered as two target variables. Both logistic regression models and decision tree models were created for each target variable. Both the logistic regression and decision tree method show that frequently doing physical activity and having breakfast everyday were protective factors against being overweight or obese. Smoking and drinking sugar-sweeten beverage frequently were found to be associated with an increased risk to be obese.

[1]  Hong Qiao,et al.  Comparing data mining methods with logistic regression in childhood obesity prediction , 2009, Inf. Syst. Frontiers.

[2]  A. Morrill,et al.  The Obesity Epidemic in the United States , 2004, Journal of public health policy.

[3]  Katherine M Flegal,et al.  Prevalence and trends in overweight among US children and adolescents, 1999-2000. , 2002, JAMA.

[4]  Riccardo Bellazzi,et al.  Predictive data mining in clinical medicine: a focus on selected methods and applications , 2011, WIREs Data Mining Knowl. Discov..

[5]  Peter J.F. Lucas Bayesian analysis, pattern analysis, and data mining in health care , 2004, Current opinion in critical care.

[6]  P. Katzmarzyk,et al.  Comparison of overweight and obesity prevalence in school‐aged youth from 34 countries and their relationships with physical activity and dietary patterns , 2005, Obesity reviews : an official journal of the International Association for the Study of Obesity.

[7]  J Gittelsohn,et al.  Overweight among children and adolescents in a Native Canadian community: prevalence and associated factors. , 2000, The American journal of clinical nutrition.

[8]  M. Beydoun,et al.  The obesity epidemic in the United States--gender, age, socioeconomic, racial/ethnic, and geographic characteristics: a systematic review and meta-regression analysis. , 2007, Epidemiologic reviews.

[9]  R. Uauy,et al.  Obesity in children and young people: a crisis in public health. , 2004, Obesity reviews : an official journal of the International Association for the Study of Obesity.

[10]  K. Flegal,et al.  Prevalence of overweight and obesity among US children, adolescents, and adults, 1999-2002. , 2004, JAMA.

[11]  S. Lemon,et al.  Classification and regression tree analysis in public health: Methodological review and comparison with logistic regression , 2003, Annals of behavioral medicine : a publication of the Society of Behavioral Medicine.

[12]  N. Krebs,et al.  Assessment of Child and Adolescent Overweight and Obesity , 2007, Pediatrics.

[13]  B. Saboo,et al.  Prevalence of overweight and obesity in Indian adolescent school going children: its relationship with socioeconomic status and associated lifestyle factors. , 2010, The Journal of the Association of Physicians of India.

[14]  Wahidah Husain,et al.  Parameter Identification and Selection for Childhood Obesity Prediction Using Data Mining , 2012 .

[15]  T. Lobstein,et al.  Worldwide trends in childhood overweight and obesity. , 2006, International journal of pediatric obesity : IJPO : an official journal of the International Association for the Study of Obesity.

[16]  Yang Li,et al.  Prevalence and risk factors of being overweight or obese among children and adolescents in northeast China , 2013, Pediatric Research.

[17]  H. Al-Hazzaa,et al.  Lifestyle factors associated with overweight and obesity among Saudi adolescents , 2012, BMC Public Health.