Public health application of predictive modeling: an example from farm vehicle crashes

BackgroundThe goal of predictive modelling is to identify the likelihood of future events, such as the predictive modelling used in climate science to forecast weather patterns and significant weather occurrences. In public health, increasingly sophisticated predictive models are used to predict health events in patients and to screen high risk individuals, such as for cardiovascular disease and breast cancer. Although causal modelling is frequently used in epidemiology to identify risk factors, predictive modelling provides highly useful information for individual risk prediction and for informing courses of treatment. Such predictive knowledge is often of great utility to physicians, counsellors, health education specialists, policymakers or other professionals, who may then advice course correction or interventions to prevent adverse health outcomes from occurring. In this manuscript, we use an example dataset that documents farm vehicle crashes and conventional statistical methods to forecast the risk of an injury or death in a farm vehicle crash for a specific individual or a scenario.ResultsUsing data from 7094 farm crashes that occurred between 2005 and 2010 in nine mid-western states, we demonstrate and discuss predictive model fitting approaches, model validation techniques using external datasets, and the calculation and interpretation of predicted probabilities. We then developed two automated risk prediction tools using readily available software packages. We discuss best practices and common limitations associated with predictive models built from observational datasets.ConclusionsPredictive analysis offers tools that could aid the decision making of policymakers, physicians, and environmental health practitioners to improve public health.

[1]  S. Schwartz,et al.  Extending the sufficient component cause model to describe the Stable Unit Treatment Value Assumption (SUTVA) , 2012, Epidemiologic perspectives & innovations : EP+I.

[2]  W. Pan Akaike's Information Criterion in Generalized Estimating Equations , 2001, Biometrics.

[3]  E. Brynjolfsson,et al.  The Future of Prediction: How Google Searches Foreshadow Housing Prices and Sales , 2013, ICIS 2013.

[4]  Melissa Bondy,et al.  Projecting individualized absolute invasive breast cancer risk in African American women. , 2007, Journal of the National Cancer Institute.

[5]  Corinne Peek-Asa,et al.  Characteristics of crashes with farm equipment that increase potential for injury. , 2007, The Journal of rural health : official journal of the American Rural Health Association and the National Rural Health Care Association.

[6]  Panagiotis Takis Metaxas,et al.  The power of prediction with social media , 2013, Internet Res..

[7]  Neal Hawkins,et al.  An empirical analysis of farm vehicle crash injury severities on Iowa's public road system. , 2010, Accident; analysis and prevention.

[8]  Marizen Ramirez,et al.  Not just a rural occurrence: differences in agricultural equipment crash characteristics by rural-urban crash site and proximity to town. , 2014, Accident; analysis and prevention.

[9]  R Hughes,et al.  CRASHES INVOVING FARM TRACTORS AND OTHER FARM VEHICLES/EQUIPMENT IN NORTH CAROLINA 1995-1999 , 2000 .

[10]  R E Burney,et al.  Rural motor vehicle crash mortality: the role of crash severity and medical resources. , 1992, Accident; analysis and prevention.

[11]  A. Keys,et al.  Probability of Middle‐Aged Men Developing Coronary Heart Disease in Five Years , 1972, Circulation.

[12]  R L Berg,et al.  Evaluation of a policy to reduce youth tractor crashes on public roads , 2006, Injury Prevention.

[13]  S G Gerberich,et al.  An epidemiological study of roadway fatalities related to farm vehicles: United States, 1988 to 1993. , 1996, Journal of occupational and environmental medicine.

[14]  Karen L Stephan,et al.  Characteristics of the Road and Surrounding Environment in Metropolitan Shopping Strips: Association with the Frequency and Severity of Single-Vehicle Crashes , 2014, Traffic injury prevention.

[15]  T M Costello,et al.  Understanding the public health impacts of farm vehicle public road crashes in North Carolina. , 2003, Journal of agricultural safety and health.

[16]  D. Kleinbaum,et al.  Multivariate analysis of risk of coronary heart disease in Evans County, Georgia. , 1971, Archives of internal medicine.

[17]  M. Hotopf,et al.  It’s a long shot, but it just might work! Perspectives on the future of medicine , 2016, BMC Medicine.

[18]  S. Lockman,et al.  Cardiovascular disease risk prediction by the American College of Cardiology (ACC)/American Heart Association (AHA) Atherosclerotic Cardiovascular Disease (ASCVD) risk score among HIV-infected patients in sub-Saharan Africa , 2017, PloS one.

[19]  Catharinus F Jaarsma,et al.  Agricultural Vehicles and Rural Road Safety: Tackling a Persistent Problem , 2014, Traffic injury prevention.

[20]  Francesca Russo,et al.  Road Safety from the Perspective of Driver Gender and Age as Related to the Injury Crash Frequency and Road Scenario , 2014, Traffic injury prevention.

[21]  Konstantin Strauch,et al.  Breast cancer risk assessment across the risk continuum: genetic and nongenetic risk factors contributing to differential model performance , 2012, Breast Cancer Research.

[22]  Komal J. Narwaney,et al.  Prediction Model for Two-Year Risk of Opioid Overdose Among Patients Prescribed Chronic Opioid Therapy , 2018, Journal of General Internal Medicine.

[23]  M. Iredell,et al.  The NCEP Climate Forecast System Version 2 , 2014 .

[24]  D. Levy,et al.  Prediction of coronary heart disease using risk factor categories. , 1998, Circulation.

[25]  S Pinzke,et al.  Slow-moving vehicles in Swedish traffic. , 2004, Journal of agricultural safety and health.

[26]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[27]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[28]  Marizen R. Ramirez,et al.  A GIS-based Matched Case–control Study of Road Characteristics in Farm Vehicle Crashes , 2016, Epidemiology.

[29]  Alexander P Keil,et al.  You are smarter than you think: (super) machine learning in context , 2018, European Journal of Epidemiology.