Preventive healthcare policies in the US: solutions for disease management using Big Data Analytics

Data-driven healthcare policy discussions are gaining traction after the Covid-19 outbreak and ahead of the 2020 US presidential elections. The US has a hybrid healthcare structure; it is a system that does not provide universal coverage, albeit few years ago enacted a mandate (Affordable Care Act-ACA) that provides coverage for the majority of Americans. The US has the highest health expenditure per capita of all western and developed countries; however, most Americans don’t tap into the benefits of preventive healthcare. It is estimated that only 8% of Americans undergo routine preventive screenings. On a national level, very few states (15 out of the 50) have above-average preventive healthcare metrics. In literature, many studies focus on the cure of diseases (research areas such as drug discovery and disease prediction); whilst a minority have examined data-driven preventive measures—a matter that Americans and policy makers ought to place at the forefront of national issues. In this work, we present solutions for preventive practices and policies through Machine Learning (ML) methods. ML is morally neutral, it depends on the data that train the models; in this work, we make the case that Big Data is an imperative paradigm for healthcare. We examine disparities in clinical data for US patients by developing correlation and imputation methods for data completeness. Non-conventional patterns are identified. The data lifecycle followed is methodical and deliberate; 1000+ clinical, demographical, and laboratory variables are collected from the Centers for Disease Control and Prevention (CDC). Multiple statistical models are deployed (Pearson correlations, Cramer’s V, MICE, and ANOVA). Other unsupervised ML models are also examined (K-modes and K-prototypes for clustering). Through the results presented in the paper, pointers to preventive chronic disease tests are presented, and the models are tested and evaluated.

[1]  L. Borrell,et al.  Socioeconomic position indicators and periodontitis: examining the evidence. , 2012, Periodontology 2000.

[2]  Roy Taylor,et al.  Periodontitis and diabetes: a two-way relationship , 2011, Diabetologia.

[3]  Robert L. Phillips,et al.  Projecting US Primary Care Physician Workforce Needs: 2010-2025 , 2012, The Annals of Family Medicine.

[4]  G. Loewenstein,et al.  Measuring the Prevalence of Questionable Research Practices With Incentives for Truth Telling , 2012, Psychological science.

[5]  Nachman Ash,et al.  Problems and challenges in patient information retrieval: a descriptive study , 2001, AMIA.

[6]  D. Himmelstein,et al.  Physicians for a National Health Program. , 1987, International journal of health services : planning, administration, evaluation.

[7]  Trevor L. Strome,et al.  Healthcare Analytics for Quality and Performance Improvement , 2013 .

[8]  Susan S. Ellenberg Food and Drug Administration (FDA) , 2005 .

[9]  Veronica Phillips,et al.  About the Open Government Initiative , 2015 .

[10]  M. Sanz,et al.  Development and validation of a predictive model for periodontitis using NHANES 2011–2012 data , 2019, Journal of clinical periodontology.

[11]  P. Hotez,et al.  The state of the antivaccine movement in the United States: A focused examination of nonmedical exemptions in states and counties , 2018, PLoS Medicine.

[12]  J. Dwyer,et al.  Update on NHANES Dietary Data: Focus on Collection, Release, Analytical Considerations, and Uses to Inform Public Policy , 2016, Advances in nutrition.

[13]  Avelino J. Gonzalez,et al.  Incremental Lifecycle Validation of Knowledge-Based Systems Through CommonKADS , 2013, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[14]  A. Hinman,et al.  Vaccination Mandates: The Public Health Imperative and Individual Rights , 2007 .

[15]  D. Mant Prevention , 1994, The Lancet.

[16]  G.E. Moore,et al.  Cramming More Components Onto Integrated Circuits , 1998, Proceedings of the IEEE.

[17]  Avelino J. Gonzalez,et al.  The engineering of knowledge-based systems: theory and practice , 1993 .

[18]  William W. Stead,et al.  Computational Technology for Effective Health Care , 2009 .

[19]  R. Andersen Revisiting the behavioral model and access to medical care: does it matter? , 1995, Journal of health and social behavior.

[20]  R. Saunders,et al.  Best Care at Lower Cost: The Path to Continuously Learning Health Care in America , 2013 .

[21]  Trevor L. Strome,et al.  Healthcare Analytics for Quality and Performance Improvement: Strome/Healthcare , 2013 .

[22]  Matthew C. Makel,et al.  Facts Are More Important Than Novelty , 2014 .

[23]  Feras Batarseh,et al.  Assessing the Quality of Service Using Big Data Analytics: With Application to Healthcare , 2016, Big Data Res..

[24]  Diane P. Martin,et al.  The Causal Effect of Health Insurance on Utilization and Outcomes in Adults: A Systematic Review of US Studies , 2008, Medical care.

[25]  P. Eke,et al.  Periodontitis in US Adults: National Health and Nutrition Examination Survey 2009-2014. , 2018, Journal of the American Dental Association.

[26]  J. Scannell,et al.  Diagnosing the decline in pharmaceutical R&D efficiency , 2012, Nature Reviews Drug Discovery.