A Data analytical Framework for Improving Real-Time, Decision Support Systems in Healthcare

Title of dissertation: A DATA ANALYTICAL FRAMEWORK FOR IMPROVING REAL-TIME, DECISION SUPPORT SYSTEMS IN HEALTHCARE Inbal Yahav, PhD Candidate, 2010 Dissertation directed by: Associate Professor Galit Shmueli Department of Decisions, Operations and Information Technologies In this dissertation we develop a framework that combines data mining, statistics and operations research methods for improving real-time decision support systems in healthcare. Our approach consists of three main concepts: data gathering and preprocessing, modeling, and deployment. We introduce the notion of offline and semi-offline modeling to differentiate between models that are based on known baseline behavior and those based on a baseline with missing information. We apply and illustrate the framework in the context of two important healthcare contexts: biosurveillance and kidney allocation. In the biosurveillance context, we address the problem of early detection of disease outbreaks. We discuss integer programming-based univariate monitoring and statistical and operations research-based multivariate monitoring approaches. We assess method performance on authentic biosurveillance data. In the kidney allocation context, we present a two-phase model that combines an integer programming-based learning phase and a dataanalytical based real-time phase. We examine and evaluate our method on the current Organ Procurement and Transplantation Network (OPTN) waiting list. In both contexts, we show that our framework produces significant improvements over existing methods. A DATA ANALYTICAL FRAMEWORK FOR IMPROVING REAL-TIME, DECISION SUPPORT SYSTEMS IN HEALTHCARE

[1]  Bhavik R. Bakshi,et al.  Multiscale SPC using wavelets: Theoretical analysis and properties , 2003 .

[2]  Douglas C. Montgomery,et al.  Economic Design of T2 Control Charts to Maintain Current Control of a Process , 1972 .

[3]  Raghu Pasupathy,et al.  A method for fast generation of bivariate poisson random vectors , 2007, 2007 Winter Simulation Conference.

[4]  Andrew J. Schaefer,et al.  Determining the Acceptance of Cadaveric Livers Using an Implicit Model of the Waiting List , 2007, Oper. Res..

[5]  D. Howard,et al.  Why do transplant surgeons turn down organs? A model of the accept/reject decision. , 2002, Journal of health economics.

[6]  Stefanos A. Zenios Models for Kidney Allocation , 2005 .

[7]  Philippe Lambert,et al.  Parametric accelerated failure time models with random effects and an application to kidney transplant survival , 2004, Statistics in medicine.

[8]  L. Green Capacity Planning and Management in Hospitals , 2005 .

[9]  George C. Runger,et al.  Comparison of multivariate CUSUM charts , 1990 .

[10]  Thomas Porter,et al.  Identifying Diabetic Patients: A Data Mining Approach , 2009, AMCIS.

[11]  Andrew W. Moore,et al.  Algorithms for rapid outbreak detection: a research synthesis , 2005, J. Biomed. Informatics.

[12]  M. Brandeau,et al.  Operations research and health care : a handbook of methods and applications , 2004 .

[13]  Rizvan Erol,et al.  An Optimization Model for Locating and Sizing Emergency Medical Service Stations , 2008, Journal of Medical Systems.

[14]  H. Hotelling Multivariate Quality Control-illustrated by the air testing of sample bombsights , 1947 .

[15]  Fay Cobb Payton,et al.  Data mining in health care applications , 2003 .

[16]  Charles W. Champ,et al.  The Performance of Exponentially Weighted Moving Average Charts With Estimated Parameters , 2001, Technometrics.

[17]  Andrew J. Schaefer,et al.  Maximizing the Efficiency of the U.S. Liver Allocation System Through Region Design , 2010, Manag. Sci..

[18]  J. Kalbfleisch,et al.  Calculating Life Years from Transplant (LYFT): Methods for Kidney and Kidney‐Pancreas Candidates , 2008, American journal of transplantation : official journal of the American Society of Transplantation and the American Society of Transplant Surgeons.

[19]  R. Platt,et al.  A generalized linear mixed models approach for detecting incident clusters of disease in small areas, with an application to biological terrorism. , 2004, American journal of epidemiology.

[20]  Jie Chen,et al.  Using data mining to segment healthcare markets from patients' preference perspectives. , 2009, International journal of health care quality assurance.

[21]  Ronald D. Fricker,et al.  Comparing Directionally Sensitive MCUSUM and MEWMA Procedures with Application to Biosurveillance , 2008 .

[22]  Zachary G. Stoumbos,et al.  Robustness to Non-Normality of the Multivariate EWMA Control Chart , 2002 .

[23]  F. Krummenauer Limit theorems for multivariate discrete distributions , 1998 .

[24]  P. Eggers Racial Differences in Access to Kidney Transplantation , 1995, Health care financing review.

[25]  E. Kaplan,et al.  Nonparametric Estimation from Incomplete Observations , 1958 .

[26]  Oguzhan Alagoz,et al.  Optimizing Organ Allocation and Acceptance , 2009 .

[27]  Ronald D. Fricker,et al.  Directionally Sensitive Multivariate Statistical Process Control Procedures with Application to Syndromic Surveillance , 2007 .

[28]  Uri Yechiali,et al.  A Time-dependent Stopping Problem with Application to Live Organ Transplants , 1985, Oper. Res..

[29]  Galit Shmueli,et al.  Automated time series forecasting for biosurveillance , 2007, Statistics in medicine.

[30]  Richard A. Davis,et al.  Time Series: Theory and Methods , 2013 .

[31]  D. Karlis An EM algorithm for multivariate Poisson distribution and related models , 2003 .

[32]  W. R. Schucany,et al.  Simulating multivariate distributions with specific correlations , 2004 .

[33]  Sean Murphy,et al.  Preparing Biosurveillance Data for Classic Monitoring , 2007 .

[34]  R. Stoelting,et al.  Stoelting's Anesthesia and Co-Existing Disease , 2012 .

[35]  Charles W. Champ,et al.  Effects of Parameter Estimation on Control Chart Properties: A Literature Review , 2006 .

[36]  Uri Yechiali,et al.  One-Attribute Sequential Assignment Match Processes in Discrete Time , 1995, Oper. Res..

[37]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[38]  Tom Burr,et al.  Modeling emergency department visit patterns for infectious disease complaints: results and application to disease surveillance , 2005, BMC Medical Informatics Decis. Mak..

[39]  Bruce W. Schmeiser,et al.  Developing a national allocation model for cadaveric kidneys , 2000, 2000 Winter Simulation Conference Proceedings (Cat. No.00CH37165).

[40]  Fred Spiring,et al.  Introduction to Statistical Quality Control , 2007, Technometrics.

[41]  Huifen Chen,et al.  Initialization for NORTA: Generation of Random Vectors with Specified Marginals and Correlations , 2001, INFORMS J. Comput..

[42]  Dean Follmann,et al.  A Simple Multivariate Test for One-Sided Alternatives , 1996 .

[43]  W. Kruskal,et al.  Use of Ranks in One-Criterion Variance Analysis , 1952 .

[44]  Eric Rosow,et al.  Virtual Instrumentation and Real-Time Executive Dashboards: Solutions for Health Care Systems , 2003, Nursing administration quarterly.

[45]  R. Kay The Analysis of Survival Data , 2012 .

[46]  Alberto Luceño,et al.  Statistical Control by Monitoring and Adjustment , 2009 .

[47]  H. Burkom Development, adaptation, and assessment of alerting algorithms for biosurveillance , 2003 .

[48]  R. Nelsen An Introduction to Copulas , 1998 .

[49]  Mark S. Daskin,et al.  Location of Health Care Facilities , 2005 .

[50]  R. Burton,et al.  A Role for Operational Research in Health Care Planning and Management Teams , 1978, The Journal of the Operational Research Society.

[51]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[52]  Z. Zafirova,et al.  Stoeltingʼs Anesthesia and Co-Existing Disease , 2010 .

[53]  Inbal Yahav,et al.  ALGORITHM COMBINATION FOR IMPROVED , 2011 .

[54]  Oguzhan Alagoz,et al.  A Clinically Based Discrete-Event Simulation of End-Stage Liver Disease and the Organ Allocation Process , 2005, Medical decision making : an international journal of the Society for Medical Decision Making.

[55]  Galit Shmueli,et al.  Simulating Multivariate Syndromic Time Series and Outbreak Signatures , 2007 .

[56]  Pierre L'Ecuyer,et al.  Efficient Correlation Matching for Fitting Discrete Multivariate Distributions with Arbitrary Marginals and Normal-Copula Dependence , 2009, INFORMS J. Comput..

[57]  H. Edwin Romeijn,et al.  Introduction to the Special Issue on Operations Research in Health Care , 2008, Oper. Res..

[58]  Rhonda Righter,et al.  A Resource Allocation Problem in a Random Environment , 1989, Oper. Res..

[59]  Geoffrey Land,et al.  Improved Definition of Human Leukocyte Antigen Frequencies Among Minorities and Applicability to Estimates of Transplant Compatibility , 2007, Transplantation.

[60]  John Hornberger,et al.  Involving Patients in the Cadaveric Kidney Transplant Allocation Process: A Decision-Theoretic Perspective , 1996 .

[61]  C. Derman,et al.  A Sequential Stochastic Assignment Problem , 1972 .

[62]  William H. Woodall,et al.  A one‐sided MEWMA chart for health surveillance , 2008, Qual. Reliab. Eng. Int..

[63]  R. Crosier Multivariate generalizations of cumulative sum quality-control schemes , 1988 .

[64]  Lawrence M. Wein,et al.  Dynamic Allocation of Kidneys to Candidates on the Transplant Waiting List , 2000, Oper. Res..

[65]  George C. Runger,et al.  Multivariate one-sided control charts , 2006 .

[66]  S. Love,et al.  Survival Analysis Part II: Multivariate data analysis – an introduction to concepts and methods , 2003, British Journal of Cancer.

[67]  Nan Kong,et al.  OPTIMIZING THE EFFICIENCY OF THE UNITED STATES ORGAN ALLOCATION SYSTEM THROUGH REGION REORGANIZATION , 2006 .

[68]  Kenneth D. Mandl,et al.  Time series modeling for syndromic surveillance , 2003, BMC Medical Informatics Decis. Mak..

[69]  Kanti V. Mardia,et al.  Families of Bivariate Distributions. , 1971 .

[70]  Karolina J. Glowacka,et al.  A hybrid data mining/simulation approach for modelling outpatient no-shows in clinic scheduling , 2009, J. Oper. Res. Soc..

[71]  Xuanming Su,et al.  Recipient Choice Can Address the Efficiency-Equity Trade-off in Kidney Transplantation: A Mechanism Design Model , 2006, Manag. Sci..

[72]  Howard S. Burkom,et al.  Statistical Challenges Facing Early Outbreak Detection in Biosurveillance , 2010, Technometrics.

[73]  Stephen E. Fienberg,et al.  Current and Potential Statistical Methods for Monitoring Multiple Data Streams for Biosurveillance , 2006 .

[74]  J. Rice Mathematical Statistics and Data Analysis , 1988 .

[75]  Peter E. Nuesch,et al.  ON THE PROBLEM OF TESTING LOCATION IN MULTIVARIATE POPULATIONS FOR RESTRICTED ALTERNATIVES , 1966 .

[76]  W. Whitt Bivariate Distributions with Given Marginals , 1976 .

[77]  Robert M Merion,et al.  Simulating the Allocation of Organs for Transplantation , 2004, Health care management science.

[78]  Chris Chatfield,et al.  The Holt-Winters Forecasting Procedure , 1978 .

[79]  Bruce L. Golden,et al.  Maximizing cardiac surgery throughput at a major hospital , 2008, SpringSim '08.

[80]  J. Edmonds Paths, Trees, and Flowers , 1965, Canadian Journal of Mathematics.

[81]  Xuanming Su,et al.  Patient Choice in Kidney Allocation: The Role of the Queueing Discipline , 2004, Manuf. Serv. Oper. Manag..

[82]  J. Rodgers,et al.  Thirteen ways to look at the correlation coefficient , 1988 .

[83]  Joseph J. Pignatiello,et al.  On constructing T2 control charts for on-line process monitoring , 1999 .

[84]  Geoff Royston,et al.  One hundred years of Operational Research in Health—UK 1948–2048 , 2009, J. Oper. Res. Soc..

[85]  Douglas C. Montgomery,et al.  A review of multivariate control charts , 1995 .

[86]  Ronald D. Fricker,et al.  Directionally Sensitive Multivariate Statistical Process Control Methods with Application to Syndromic Surveillance , 2007 .

[87]  Frank Krummenauer Efficient Simulation of Multivariate Binomial and Poisson Distributions , 1998 .

[88]  Xuanming Su,et al.  Patient Choice in Kidney Allocation: A Sequential Stochastic Assignment Model , 2005, Oper. Res..

[89]  Manuel A. Nunez,et al.  OR Practice - Efficient Short-Term Allocation and Reallocation of Patients to Floors of a Hospital During Demand Surges , 2009, Oper. Res..

[90]  Willem Albers,et al.  Data-Driven Rank Tests for Classes of Tail Alternatives , 1999 .

[91]  Galit Shmueli,et al.  Algorithm Combination for Improved Performance in Biosurveillance Systems , 2007, BioSurveillance.

[92]  H. Edwin Romeijn,et al.  An Exact Method for Balancing Efficiency and Equity in the Liver Allocation Hierarchy , 2012, INFORMS J. Comput..

[93]  H. Scheld,et al.  The heart-allocation simulation model: a tool for comparison of transplantation allocation policies1 , 2003, Transplantation.

[94]  Charles W. Champ,et al.  A multivariate exponentially weighted moving average control chart , 1992 .