Appalachian Informatics Platform – A Multicentric Data Warehouse with Embedded Data Analytics and Interactive Visualization for Clinical and Translational Research (Preprint)

BACKGROUND The Appalachian population is distinct, not just culturally and geographically but also in its health care needs, facing the most health care disparities in the United States. To meet these unique demands, Appalachian medical centers need an arsenal of analytics and data science tools with the foundation of a centralized data warehouse to transform health care data into actionable clinical interventions. However, this is an especially challenging task given the fragmented state of medical data within Appalachia and the need for integration of other types of data such as environmental, social, and economic with medical data. OBJECTIVE This paper aims to present the structure and process of the development of an integrated platform at a midlevel Appalachian academic medical center along with its initial uses. METHODS The Appalachian Informatics Platform was developed by the Appalachian Clinical and Translational Science Institute's Division of Clinical Informatics and consists of 4 major components: a centralized clinical data warehouse, modeling (statistical and machine learning), visualization, and model evaluation. Data from different clinical systems, billing systems, and state- or national-level data sets were integrated into a centralized data warehouse. The platform supports research efforts by enabling curation and analysis of data using the different components, as appropriate. RESULTS The Appalachian Informatics Platform is functional and has supported several research efforts since its implementation for a variety of purposes, such as increasing knowledge of the pathophysiology of diseases, risk identification, risk prediction, and health care resource utilization research and estimation of the economic impact of diseases. CONCLUSIONS The platform provides an inexpensive yet seamless way to translate clinical and translational research ideas into clinical applications for regions similar to Appalachia that have limited resources and a largely rural population.

[1]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[2]  Amar K. Das,et al.  A simple heuristic for blindfolded record linkage , 2012, J. Am. Medical Informatics Assoc..

[3]  Paul L. Huang A comprehensive definition for metabolic syndrome , 2009, Disease Models & Mechanisms.

[4]  M. Hendryx,et al.  Hospitalization Patterns Associated with Appalachian Coal Mining , 2007, Journal of toxicology and environmental health. Part A.

[5]  Korine N. Kolivras,et al.  Environmental health disparities in the Central Appalachian region of the United States , 2017, Reviews on environmental health.

[6]  J. Takeda,et al.  Identification of individuals with non-alcoholic fatty liver disease by the diagnostic criteria for the metabolic syndrome. , 2012, World journal of gastroenterology.

[7]  Tomohiro Tanaka,et al.  Decision tree analysis to stratify risk of de novo non-melanoma skin cancer following liver transplantation , 2018, Journal of Cancer Research and Clinical Oncology.

[8]  A. Statnikov,et al.  Bridging a translational gap: using machine learning to improve the prediction of PTSD , 2015, BMC Psychiatry.

[9]  N. Paynter,et al.  The Rural vs Urban Practice Decision , 2002 .

[10]  M. Hendryx,et al.  Health Disparities and Environmental Competence: A Case Study of Appalachian Coal Mining , 2008 .

[11]  R. Ohsfeldt,et al.  Hospital Characteristics are Associated With Readiness to Attain Stage 2 Meaningful Use of Electronic Health Records , 2017, The Journal of rural health : official journal of the American Rural Health Association and the National Rural Health Care Association.

[12]  M. Kogan,et al.  Widening Disparities In Infant Mortality And Life Expectancy Between Appalachia And The Rest Of The United States, 1990-2013. , 2017, Health affairs.

[13]  Dean F Sittig,et al.  A Survey of Informatics Platforms That Enable Distributed Comparative Effectiveness Research Using Multi-institutional Heterogenous Clinical Data , 2012, Medical care.

[14]  P. Harris,et al.  Research electronic data capture (REDCap) - A metadata-driven methodology and workflow process for providing translational research informatics support , 2009, J. Biomed. Informatics.

[15]  William A. Mattingly,et al.  Predicting 30-Day Mortality in Hospitalized Patients with Community-Acquired Pneumonia Using Statistical and Machine Learning Approaches , 2017 .

[16]  D. Bates,et al.  Big data in health care: using analytics to identify and manage high-risk and high-cost patients. , 2014, Health affairs.

[17]  M. R. Simpson,et al.  "God brought all these churches together": issues in developing religion-health partnerships in an Appalachian community. , 1999, Public health nursing.

[18]  Peter F. Liddle,et al.  Clinical Utility of Machine-Learning Approaches in Schizophrenia: Improving Diagnostic Confidence for Translational Neuroimaging , 2013, Front. Psychiatry.

[19]  Erin Holve,et al.  Opportunities and Challenges for Comparative Effectiveness Research (CER) With Electronic Clinical Data: A Perspective From the EDM Forum , 2012, Medical care.

[20]  M. Hendryx,et al.  Health Disparities In Appalachia. , 2017, Health affairs.

[21]  Jennifer D. Lenardson,et al.  Rural and Remote Food Environments and Obesity , 2015, Current Obesity Reports.

[22]  J. Raper,et al.  Characteristics of HIV-infected adults in the Deep South and their utilization of mental health services: A rural vs. urban comparison , 2006, AIDS care.

[23]  Susan L Flesher,et al.  Is Fever a Red Flag for Bacterial Pneumonia in Children With Viral Bronchiolitis? , 2019, Global pediatric health.

[24]  Alvaro D. Oijuela-Canon,et al.  Respiratory Diseases discrimination based on acoustic lung signals and neural networks , 2015, 2015 20th Symposium on Signal Processing, Images and Computer Vision (STSIVA).

[25]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[26]  R. C. Jisha,et al.  A Real Time Patient Monitoring System for Heart Disease Prediction Using Random Forest Algorithm , 2015, SIRS.

[27]  Min Chen,et al.  Disease Prediction by Machine Learning Over Big Data From Healthcare Communities , 2017, IEEE Access.

[28]  P. C. Joshi,et al.  ESTIMATION OF WAIST AND HIP CIRCUMFERENCES USING BODY MASS INDEX IN DELHI SCHOOL GIRLS (6–11 YEARS) , 2014, Journal of Biosocial Science.

[29]  Shelly C. Lu,et al.  NASH Leading Cause of Liver Transplant in Women: Updated Analysis of Indications For Liver Transplant and Ethnic and Gender Variances , 2018, The American Journal of Gastroenterology.

[30]  A. Coustasse,et al.  The Impact of Big Data on Chronic Disease Management , 2017, The health care manager.

[31]  Sterling C. Johnson,et al.  Multimodal and Multiscale Deep Neural Networks for the Early Diagnosis of Alzheimer's Disease using structural MR and FDG-PET images , 2017, ArXiv.

[32]  D. Hoaglin,et al.  Predicting waist circumference from body mass index , 2012, BMC Medical Research Methodology.

[33]  S. Woolf The meaning of translational research and why it matters. , 2008, JAMA.

[34]  Natarajan Sriraam,et al.  Classification of non-chronic and chronic kidney disease using SVM neural networks , 2017 .

[35]  Steven E Hyler,et al.  Problems with the Electronic Medical Record in Clinical Psychiatry: A Hidden Cost , 2005, Journal of psychiatric practice.

[36]  S. Bauer,et al.  Fully automated stroke tissue estimation using random forest classifiers (FASTER) , 2017, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[37]  D. Engelbret Storyboarding — A better way of planning and writing proposals , 1972 .

[38]  Shiraz I Mishra,et al.  Health Extension and Clinical and Translational Science: An Innovative Strategy for Community Engagement , 2017, The Journal of the American Board of Family Medicine.

[39]  Krishan L. Khatri,et al.  Early Detection of Peak Demand Days of Chronic Respiratory Diseases Emergency Department Visits Using Artificial Neural Networks , 2018, IEEE Journal of Biomedical and Health Informatics.

[40]  Terry Anthony Byrd,et al.  Big data analytics: Understanding its capabilities and potential benefits for healthcare organizations , 2018 .

[41]  Saeed Ayat,et al.  A comparison of artificial neural networks learning algorithms in predicting tendency for suicide , 2012, Neural Computing and Applications.

[42]  N. Mann,et al.  Evaluation of Rural vs Urban Trauma Patients Served by 9-1-1 Emergency Medical Services , 2017, JAMA surgery.

[43]  J Bernadette Moore,et al.  Non-alcoholic fatty liver disease: the hepatic consequence of obesity and the metabolic syndrome , 2010, Proceedings of the Nutrition Society.

[44]  Kevin A. Henry,et al.  Rural vs urban residence affects risk-appropriate colorectal cancer screening. , 2013, Clinical gastroenterology and hepatology : the official clinical practice journal of the American Gastroenterological Association.

[45]  Reid Luhman,et al.  Appalachian English stereotypes: Language attitudes in Kentucky , 1990, Language in Society.

[46]  Yunhao Liu,et al.  Big Data: A Survey , 2014, Mob. Networks Appl..

[47]  W. Morgan,et al.  The mortality of Appalachian coal miners, 1963 to 1971. , 1974, Archives of environmental health.

[49]  Alfred A Cecchetti Why Introduce Machine Learning To Rural Health Care , 2018 .

[50]  L. Shaw,et al.  Errors in Electronic Health Record–Based Data Query of Statin Prescriptions in Patients With Coronary Artery Disease in a Large, Academic, Multispecialty Clinic Practice , 2018, Journal of the American Heart Association.

[51]  Qiang Zhang,et al.  Risk prediction of type II diabetes based on random forest model , 2017, 2017 Third International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB).

[52]  B. J. Mcculloch,et al.  The relationship of family proximity and social support to the mental health of older rural adults: The Appalachian context , 1995 .

[53]  Shridar Ganesan,et al.  Roadmap to a Comprehensive Clinical Data Warehouse for Precision Medicine Applications in Oncology , 2017, Cancer informatics.

[54]  Hua Xu,et al.  A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries , 2011, J. Am. Medical Informatics Assoc..