Data Mining Methods Find Demographic Predictors of Preterm Birth

BackgroundPreterm births in the United States increased from 11.0% to 11.4% between 1996 and 1997; they continue to be a complex healthcare problem in the United States. ObjectiveThe objective of this research was to compare traditional statistical methods with emerging new methods called data mining or knowledge discovery in databases in identifying accurate predictors of preterm births. MethodAn ethnically diverse sample (N = 19,970) of pregnant women provided data (1,622 variables) for new methods of analysis. Preterm birth predictors were evaluated using traditional statistical and newer data mining analyses. ResultsSeven demographic variables (maternal age and binary coding for county of residence, education, marital status, payer source, race, and religion) yielded a .72 area under the curve using Receiving Operating Characteristic curves to test predictive accuracy. The addition of hundreds of other variables added only a .03 to the area under the curve. ConclusionSimilar results across data mining methods suggest that results are data-driven and not method-dependent, and that demographic variables offer a small set of parsimonious variables with reasonable accuracy in predicting preterm birth outcomes in a racially diverse population.

[1]  A. Germain,et al.  Preterm labor: placental pathology and clinical correlation. , 1999, Obstetrics and gynecology.

[2]  Vimla L. Patel,et al.  Viewpoint: Science and Practice: A Case for Medical Informatics as a Local Science of Design , 1998, J. Am. Medical Informatics Assoc..

[3]  D. Savitz,et al.  Case-control study of caffeinated beverages and preterm delivery. , 1995, American journal of epidemiology.

[4]  N. Hasaniya,et al.  Direct laparoscopic entry using a sharp and dull trocar technique. , 1996, Obstetrics and gynecology.

[5]  R. Mittendorf,et al.  The control of labor. , 1999, The New England journal of medicine.

[6]  H. Hoffman,et al.  Medical, psychosocial, and behavioral risk factors do not explain the increased risk for low birth weight among black women. , 1996, American journal of obstetrics and gynecology.

[7]  Christopher G. Chute,et al.  Position Paper: A Framework for Comprehensive Health Terminology Systems in the United States: Development Guidelines, Criteria for Selection, and Public Policy Implications , 1998, J. Am. Medical Informatics Assoc..

[8]  R. Creasy,et al.  Preterm birth prevention: where are we? , 1993, American journal of obstetrics and gynecology.

[9]  V. Cokkinides,et al.  Physical violence during pregnancy: maternal complications and birth outcomes. , 1999, Obstetrics and gynecology.

[10]  M. Bracken,et al.  Low-to-moderate gestational alcohol use and intrauterine growth retardation, low birthweight, and preterm delivery. , 1997, Annals of epidemiology.

[11]  A. Samadi,et al.  Maternal Hypertension and Spontaneous Preterm Births Among Black Women , 1998, Obstetrics and gynecology.

[12]  J. Feldman,et al.  An association between the heat-humidity index and preterm labor and delivery: a preliminary analysis. , 1997, American journal of public health.

[13]  E. Thom,et al.  The Preterm Prediction Study: Fetal Fibronectin, Bacterial Vaginosis, and Peripartum Infection , 1996, Obstetrics and gynecology.

[14]  J. Berger,et al.  Testing a Point Null Hypothesis: The Irreconcilability of P Values and Evidence , 1987 .

[15]  M. Ryynänen,et al.  The effects on fetal development of high alpha-fetoprotein and maternal smoking. , 1999, American journal of public health.

[16]  C. Lockwood,et al.  Stress-associated preterm delivery: the role of corticotropin-releasing hormone. , 1999, American journal of obstetrics and gynecology.

[17]  Edward B. Fowlkes,et al.  Risk analysis of the space shuttle: Pre-Challenger prediction of failure , 1989 .

[18]  K. Ali,et al.  High altitude and spontaneous preterm birth , 1996, International journal of gynaecology and obstetrics: the official organ of the International Federation of Gynaecology and Obstetrics.

[19]  M. Cogswell,et al.  Maternal occupational and hobby chemical exposures as risk factors for neural tube defects. , 1999 .

[20]  Jerzy W. Grzymala-Busse,et al.  Machine learning for an expert system to predict preterm birth risk. , 1994, Journal of the American Medical Informatics Association : JAMIA.

[21]  J. Iams,et al.  Prevention of preterm birth. , 1988, Seminars in perinatology.

[22]  M. Klebanoff,et al.  A review of risk scoring for preterm birth. , 1993, Clinics in perinatology.

[23]  Joyce A. Mitchell,et al.  Research Paper: An Expert System for Performance-based Direct Delivery of Published Clinical evidence , 1996, J. Am. Medical Informatics Assoc..

[24]  W. James Excess males in preterm birth: interactions with gestational age, race, and multiple birth. , 1997, Obstetrics and gynecology.

[25]  H. Coovadia,et al.  Randomized trial testing the effect of vitamin A supplementation on pregnancy outcomes and early mother-to-child HIV-1 transmission in Durban, South Africa. South African Vitamin A Study Group. , 1999, AIDS.

[26]  C. Lockwood,et al.  Cervical length in uncomplicated pregnancy: A study of sociodemographic predictors of cervical changes across gestation. , 1999, American journal of obstetrics and gynecology.

[27]  J. Iams,et al.  Cervical ultrasonography , 1997, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[28]  W E Hammond,et al.  Converting a legacy system database into relational format to enhance query efficiency. , 1995, Proceedings. Symposium on Computer Applications in Medical Care.

[29]  D R Lovell,et al.  Design, construction and evaluation of systems to predict risk in obstetrics. , 1997, International journal of medical informatics.

[30]  Creasy Rk,et al.  Prevention of preterm birth. , 1981 .

[31]  T. Nas,et al.  Are sociodemographic factors predictive of preterm birth? A reappraisal of the 1958 British Perinatal Mortality Survey , 1997, British journal of obstetrics and gynaecology.

[32]  W. Feldman,et al.  The economic impact of high-risk pregnancies. , 1997, Journal of health care finance.

[33]  M. Mclean,et al.  Prediction and early diagnosis of preterm labor: a critical review. , 1993, Obstetrical & gynecological survey.

[34]  D. Savitz,et al.  Employment, job strain, and preterm delivery among women in North Carolina. , 1997, American journal of public health.

[35]  D A Savitz,et al.  Risk Factors for Preterm Birth Subtypes , 1998, Epidemiology.

[36]  W O Thompson,et al.  Validity of the Creasy Risk Appraisal Instrument for Prediction of Preterm Labor , 1995, Nursing research.

[37]  M. Cogswell,et al.  Maternal weight gain and preterm delivery: differential effects by body mass index. , 1999, Epidemiology.

[38]  J R Campbell,et al.  A framework for comprehensive health terminology systems in the United States: development guidelines, criteria for selection, and public policy implications. ANSI Healthcare Informatics Standards Board Vocabulary Working Group and the Computer-Based Patient Records Institute Working Group on Codes , 1998, Journal of the American Medical Informatics Association : JAMIA.

[39]  G. G. Nahum,et al.  Fetal weight gain at term: linear with minimal dependence on maternal obesity. , 1995, American journal of obstetrics and gynecology.

[40]  J. Beck,et al.  Periodontal Infection as a Possible Risk Factor for Preterm Low Birth Weight. , 1996, Journal of periodontology.