A fully integrated open-source toolkit for mining healthcare big-data: architecture and applications

We create an analytics toolkit based on open-source modules that facilitate the exploration of healthcare-related datasets. We illustrate our framework by providing a detailed analysis of physician and hospital ratings data. Our technique should prove valuable to software developers, big-data architects, hospital administrators, policy makers and patients. As an illustration of the capabilities of our toolkit, we examine a controversial issue in the medical field regarding the relationship between seniority of medical professionals and clinical outcomes. We use a publicly available dataset of national hospital ratings in the USA to suggest that there is no significant association between experience of medical professionals and hospital ratings as defined by the US government.

[1]  Joel M Kupfer The Graying of US Physicians: Implications for Quality and the Future Supply of Physicians. , 2016, JAMA.

[2]  Charles Safran,et al.  Toward a national framework for the secondary use of health data: an American Medical Informatics Association White Paper. , 2007, Journal of the American Medical Informatics Association : JAMIA.

[3]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[4]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[5]  Jay Giri,et al.  Changes in Hospitalizations, Treatment Patterns, and Outcomes During Major Cardiovascular Meetings. , 2015, JAMA internal medicine.

[6]  Brian E. Granger,et al.  IPython: A System for Interactive Scientific Computing , 2007, Computing in Science & Engineering.

[7]  S. Miller,et al.  Systematic Review: The Relationship Between Clinical Experience and Quality of Health Care , 2006 .

[8]  Rajarshi Das,et al.  A framework for analyzing publicly available healthcare data , 2015, 2015 17th International Conference on E-health Networking, Application & Services (HealthCom).

[9]  Krysthel Engstrom,et al.  Changes in Hospitalizations, Treatment Patterns, and Outcomes During Major Cardiovascular Meetings. , 2015, JAMA internal medicine.

[10]  Wes McKinney,et al.  Python for Data Analysis , 2012 .

[11]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.

[12]  M. R. Martínez-Torres,et al.  Current issues and research trends on open-source software communities , 2014, Technol. Anal. Strateg. Manag..

[13]  F. McAlister,et al.  Physician experience and outcomes among patients admitted to general internal medicine teaching wards , 2015, Canadian Medical Association Journal.

[14]  B. Landon,et al.  Measuring low-value care in Medicare. , 2014, JAMA internal medicine.

[15]  Mark F Kurd,et al.  A Review and Analysis of the YODA Trials: What Can We Glean Clinically? , 2014, Current Reviews in Musculoskeletal Medicine.

[16]  S. Cemaj,et al.  The Aging Surgeon: Implications for the Workforce, the Surgeon, and the Patient. , 2016, The Surgical clinics of North America.

[17]  Susanne Boll,et al.  Digital Health Devices for Everyone! , 2014, IEEE Pervasive Computing.

[18]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[19]  A. Jena,et al.  Changes in Hospitalizations, Treatment Patterns, and Outcomes During Major Cardiovascular Meetings--Reply. , 2015, JAMA internal medicine.

[20]  Scott F. Midkiff Internet-Scale Sensor Systems: Design and Policy , 2003, IEEE Pervasive Comput..

[21]  S. Schneeweiss Learning from big health care data. , 2014, The New England journal of medicine.

[22]  H. Krumholz Big data and new knowledge in medicine: the thinking, training, and tools needed for a learning health system. , 2014, Health affairs.

[23]  I. Guyon,et al.  The Higgs Machine Learning Challenge , 2015 .

[24]  D. Estrin,et al.  Open mHealth Architecture: An Engine for Health Care Innovation , 2010, Science.

[25]  A. Bullinger,et al.  Open innovation in health care: analysis of an open health platform. , 2012, Health policy.

[26]  Robert Fletcher,et al.  Systematic Review: The Relationship between Clinical Experience and Quality of Health Care , 2005, Annals of Internal Medicine.

[27]  Mark Finkelstein,et al.  A Medical Application to Bridge the Gap between Clinicians and Clinical Data , 2015, MIE.