Adaptive regression modeling of biomarkers of potential harm in a population of U.S. adult cigarette smokers and nonsmokers

BackgroundThis article describes the data mining analysis of a clinical exposure study of 3585 adult smokers and 1077 nonsmokers. The analysis focused on developing models for four biomarkers of potential harm (BOPH): white blood cell count (WBC), 24 h urine 8-epi-prostaglandin F2α (EPI8), 24 h urine 11-dehydro-thromboxane B2 (DEH11), and high-density lipoprotein cholesterol (HDL).MethodsRandom Forest was used for initial variable selection and Multivariate Adaptive Regression Spline was used for developing the final statistical modelsResultsThe analysis resulted in the generation of models that predict each of the BOPH as function of selected variables from the smokers and nonsmokers. The statistically significant variables in the models were: platelet count, hemoglobin, C-reactive protein, triglycerides, race and biomarkers of exposure to cigarette smoke for WBC (R-squared = 0.29); creatinine clearance, liver enzymes, weight, vitamin use and biomarkers of exposure for EPI8 (R-squared = 0.41); creatinine clearance, urine creatinine excretion, liver enzymes, use of Non-steroidal antiinflammatory drugs, vitamins and biomarkers of exposure for DEH11 (R-squared = 0.29); and triglycerides, weight, age, sex, alcohol consumption and biomarkers of exposure for HDL (R-squared = 0.39).ConclusionsLevels of WBC, EPI8, DEH11 and HDL were statistically associated with biomarkers of exposure to cigarette smoking and demographics and life style factors. All of the predictors togather explain 29%-41% of the variability in the BOPH.

[1]  Binbing Yu Approximating the risk score for disease diagnosis using MARS , 2009, Journal of applied statistics.

[2]  K. Tatara,et al.  Association between lifestyle and white blood cell count: a study of Japanese male office workers. , 2003, Occupational medicine.

[3]  W. Pryor,et al.  Oxidants in Cigarette Smoke Radicals, Hydrogen Peroxide, Peroxynitrate, and Peroxynitrite a , 1993, Annals of the New York Academy of Sciences.

[4]  I. Emerit,et al.  Oxidative stress in chronic hepatitis C: a preliminary study on the protective effects of antioxidant flavonoids. , 2005, Hepato-gastroenterology.

[5]  B. Palumbo,et al.  Increase of isoprostane 8-epi-PGF2 αafter restarting smoking , 2001 .

[6]  G. Abel,et al.  Effects of biochemically confirmed smoking cessation on white blood cell count. , 2005, Mayo Clinic proceedings.

[7]  H. M. Vinkers,et al.  Multivariate adaptive regression splines—studies of HIV reverse transcriptase inhibitors , 2004 .

[8]  Leann Myers,et al.  Comparison of multivariate adaptive regression splines and logistic regression in detecting SNP-SNP interactions and their application in prostate cancer , 2008, Journal of Human Genetics.

[9]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[10]  K. Williams,et al.  Atherosclerosis--an inflammatory disease. , 1999, The New England journal of medicine.

[11]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001, Statistical Science.

[12]  Thomas P. Erlinger,et al.  Smoking Cessation and Cardiovascular Disease Risk Factors: Results from the Third National Health and Nutrition Examination Survey , 2005, PLoS medicine.

[13]  J. Cooper,et al.  Activation of the Coagulant Pathway in Cigarette Smokers , 1998, Thrombosis and Haemostasis.

[14]  Q. Liang,et al.  The relationship between smoking machine derived tar yields and biomarkers of exposure in adult cigarette smokers in the US. , 2009, Regulatory toxicology and pharmacology : RTP.

[15]  P. Langenberg,et al.  Impact of lowering triglycerides on raising HDL-C in hypertriglyceridemic and non-hypertriglyceridemic subjects. , 2007, International journal of cardiology.

[16]  J. Cracowski,et al.  Isoprostanes as a biomarker of lipid peroxidation in humans: physiology, pharmacology and clinical implications. , 2002, Trends in pharmacological sciences.

[17]  D. Ávila,et al.  Potentially adverse interactions between haloperidol and valerian. , 2008, Food and chemical toxicology : an international journal published for the British Industrial Biological Research Association.

[18]  S. Feng,et al.  Population estimates for biomarkers of exposure to cigarette smoke in adult U.S. cigarette smokers. , 2009, Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco.

[19]  G. Ciabattoni,et al.  Excretion of thromboxane metabolites in healthy women after cessation of smoking. , 1993, Arteriosclerosis and thrombosis : a journal of vascular biology.

[20]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[21]  Rui Xiong,et al.  APPLICATION OF MULTIVARIATE ADAPTIVE REGRESSION SPLINES (MARS) TO THE PREFERENCE MAPPING OF CHEESE STICKS , 2004 .

[22]  Å. Westin,et al.  Effect of smoking reduction and cessation on cardiovascular risk factors. , 2001, Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco.

[23]  J. Oates,et al.  Cigarette smoking and hemostatic function. , 1988, American heart journal.

[24]  T. Fischer,et al.  Particulate and vapor phase constituents of cigarette mainstream smoke and risk of myocardial infarction. , 2001, Atherosclerosis.

[25]  Office on Smoking The Health Consequences of Smoking: A Report of the Surgeon General , 2004 .

[26]  J. Haddow,et al.  Cigarette smoking and serum lipid and lipoprotein concentrations: an analysis of published data. , 1989, BMJ.

[27]  J. Freidman,et al.  Multivariate adaptive regression splines , 1991 .

[28]  R. Kinser,et al.  Biomarkers of exposure and potential harm in adult smokers of 3–7 mg tar yield (Federal Trade Commission) cigarettes and in adult non-smokers , 2006, Biomarkers : biochemical indicators of exposure, response, and susceptibility to chemicals.

[29]  J. Ozer,et al.  The current state of serum biomarkers of hepatotoxicity. , 2008, Toxicology.

[30]  B. Palumbo,et al.  Increase of isoprostane 8-epi-PGF(2alpha)after restarting smoking. , 2001, Prostaglandins, leukotrienes, and essential fatty acids.

[31]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[32]  S. Goto,et al.  Platelet spontaneous aggregation in platelet-rich plasma is increased in habitual smokers. , 1999, Thrombosis research.