Using Machine Learning to Plan Rehabilitation for Home Care Clients: Beyond "Black-Box" Predictions

Resistance to adopting machine-learning algorithms in clinical practice may be due to a perception that these are “black-box” techniques and incompatible with decision-making based on evidence and clinical experience. We believe this resistance is unfortunate, given the increasing availability of large databases containing assessment information that could benefit from machine-learning and data-mining techniques, thereby providing a new and important source of evidence upon which to base clinical decisions. We have focused our investigation on the clinical applications of machine-learning algorithms on older persons in a home care rehabilitation setting. Data for this research were obtained from standardized client assessments using the comprehensive RAI-Home Care (RAI-HC) assessment instrument. Our work has shown that machine-learning algorithms can produce better decisions than standard clinical protocols. More importantly, we have shown that machine-learning algorithms can do much more than make “black-box” predictions; they can generate important new clinical and scientific insights. These insights can be used to make better decisions about treatment plans for patients and about resource allocation for healthcare services, resulting in better outcomes for patients, and in a more efficient and effective healthcare system.

[1]  B. Efron Bootstrap Methods: Another Look at the Jackknife , 1979 .

[2]  Kerry Kuluski,et al.  Aging at home: integrating community-based care for older persons. , 2009, HealthcarePapers.

[3]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[4]  Mu Zhu,et al.  Darwinian Evolution in Parallel Universes: A Parallel Genetic Algorithm for Variable Selection , 2006, Technometrics.

[5]  N. Ikegami,et al.  Comprehensive Clinical Assessment in Community Setting: Applicability of the MDS‐HC , 1997, Journal of the American Geriatrics Society.

[6]  M. Pepe The Statistical Evaluation of Medical Tests for Classification and Prediction , 2003 .

[7]  Gary Naglie,et al.  Interdisciplinary inpatient care for elderly people with hip fracture: a randomized controlled trial. , 2002, CMAJ : Canadian Medical Association journal = journal de l'Association medicale canadienne.

[8]  Naoki Ikegami,et al.  Home care quality indicators (HCQIs) based on the MDS-HC. , 2004, The Gerontologist.

[9]  R. Hofmann-Wellenhof,et al.  A support vector machine for decision support in melanoma recognition , 2010, Experimental dermatology.

[10]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[11]  Brant E. Fries,et al.  Problem Identification and Care Plan Responses in a Home and Community-Based Services Program , 2004 .

[12]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[13]  Arnold B Mitnitski,et al.  Frailty, fitness and late-life mortality in relation to chronological and biological age , 2002, BMC geriatrics.

[14]  R. L. Kennedy,et al.  Artificial neural network models for prediction of acute coronary syndromes using clinical data from the time of presentation. , 2005, Annals of emergency medicine.

[15]  Philip E. Gill,et al.  Practical optimization , 1981 .

[16]  A. Mitnitski,et al.  Frailty in relation to the accumulation of deficits. , 2007, The journals of gerontology. Series A, Biological sciences and medical sciences.

[17]  A. Hershkovitz,et al.  Factors affecting short-term rehabilitation outcomes of disabled elderly patients with proximal hip fracture. , 2007, Archives of physical medicine and rehabilitation.

[18]  B E Fries,et al.  Integrated Health Information Systems Based on the RAI/MDS Series of Instruments , 1999, Healthcare management forum.

[19]  L. Rapport,et al.  Measures of executive functioning as predictors of functional ability and social integration in a rehabilitation sample. , 1999, Archives of physical medicine and rehabilitation.

[20]  A. Fugl-Meyer,et al.  On prediction of vocational rehabilitation outcome at a Swedish employability institute. , 2003, Journal of rehabilitation medicine.

[21]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[22]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[23]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[24]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[25]  R Begg,et al.  A machine learning approach for automated recognition of movement patterns using basic, kinetic and kinematic gait data. , 2005, Journal of biomechanics.

[26]  Miguel Ángel Guevara-López,et al.  Discovering Mammography-based Machine Learning Classifiers for Breast Cancer Diagnosis , 2012, Journal of Medical Systems.

[27]  T. Gill,et al.  A standard procedure for creating a frailty index , 2008, BMC geriatrics.

[28]  Mu Zhu,et al.  The K-nearest neighbor algorithm predicted rehabilitation potential better than current Clinical Assessment Protocol. , 2007, Journal of clinical epidemiology.

[29]  Galit Shmueli,et al.  To Explain or To Predict? , 2010 .

[30]  Mu Zhu,et al.  Using machine learning algorithms to guide rehabilitation planning for home care clients , 2007, BMC Medical Informatics Decis. Mak..

[31]  Mu Zhu,et al.  Stochastic Stepwise Ensembles for Variable Selection , 2010, 1003.5930.

[32]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[33]  Alessandra Marengoni,et al.  Predictors of successful rehabilitation in geriatric patients: subgroup analysis of patients with cognitive impairment , 2007, Aging clinical and experimental research.

[34]  Rebecca Smith,et al.  A comparative analysis of multi-level computer-assisted decision making systems for traumatic injuries , 2009, BMC Medical Informatics Decis. Mak..

[35]  Frank Knoefel,et al.  State of the art in geriatric rehabilitation. Part I: review of frailty and comprehensive geriatric assessment. , 2003, Archives of physical medicine and rehabilitation.

[36]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[37]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[38]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[39]  Elizaveta Levina,et al.  Discussion of "Stability selection" by N. Meinshausen and P. Buhlmann , 2010 .

[40]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[41]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001, Statistical Science.

[42]  B E Fries,et al.  Scaling ADLs within the MDS. , 1999, The journals of gerontology. Series A, Biological sciences and medical sciences.

[43]  Peter J.F. Lucas Bayesian analysis, pattern analysis, and data mining in health care , 2004, Current opinion in critical care.

[44]  Hailong Zhu,et al.  Support vector machine for classification of walking conditions of persons after stroke with dropped foot. , 2009, Human movement science.

[45]  M Colombo,et al.  The impact of cognitive impairment on the rehabilitation process in geriatrics. , 2004, Archives of gerontology and geriatrics. Supplement.

[46]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[47]  Konstantin G Arbeev,et al.  Cumulative Deficits Better Characterize Susceptibility to Death in Elderly People than Phenotypic Frailty: Lessons from the Cardiovascular Health Study , 2008, Journal of the American Geriatrics Society.

[48]  G. Onder,et al.  Minimum Data Set for Home Care: A Valid Instrument to Assess Frail Older People Living in the Community , 2000, Medical care.

[49]  Paul Stolee,et al.  Examining three frailty conceptualizations in their ability to predict negative outcomes for home-care clients. , 2010, Age and ageing.

[50]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[51]  P. Bühlmann,et al.  The group lasso for logistic regression , 2008 .

[52]  Frank Knoefel,et al.  State of the art in geriatric rehabilitation. Part II: clinical challenges. , 2003, Archives of physical medicine and rehabilitation.

[53]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[54]  J. Hirdes,et al.  The Method for Assigning Priority Levels (MAPLe): A new decision-support system for allocating home care resources , 2008, BMC medicine.

[55]  T. Bayes An essay towards solving a problem in the doctrine of chances , 2003 .

[56]  Gareth M. James,et al.  Variable Inclusion and Shrinkage Algorithms , 2008 .

[57]  A. Atkinson Subset Selection in Regression , 1992 .

[58]  Tso-Jung Yen,et al.  Discussion on "Stability Selection" by Meinshausen and Buhlmann , 2010 .

[59]  Sing-Fai Tam,et al.  Predicting osteoarthritic knee rehabilitation outcome by using a prediction model developed by data mining techniques , 2004, International journal of rehabilitation research. Internationale Zeitschrift fur Rehabilitationsforschung. Revue internationale de recherches de readaptation.

[60]  Kenneth J Ottenbacher,et al.  Comparison of logistic regression and neural network analysis applied to predicting living setting after hip fracture. , 2004, Annals of epidemiology.

[61]  D. Mehr,et al.  MDS Cognitive Performance Scale. , 1994, Journal of gerontology.

[62]  S. Fosså,et al.  Cancer patients’ needs for rehabilitation services , 2011, Acta oncologica.

[63]  John P Hirdes,et al.  The MDS‐CHESS Scale: A New Measure to Predict Mortality in Institutionalized Older People , 2003, Journal of the American Geriatrics Society.

[64]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[65]  R K Price,et al.  Applying artificial neural network models to clinical decision making. , 2000, Psychological assessment.

[66]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[67]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[68]  Arnold B Mitnitski,et al.  Techniques for knowledge discovery in existing biomedical databases: estimation of individual aging effects in cognition in relation to dementia. , 2003, Journal of clinical epidemiology.

[69]  Mu Zhu,et al.  Kernels and Ensembles , 2007, 0712.1027.

[70]  H. Akaike A new look at the statistical model identification , 1974 .

[71]  S. Gunn,et al.  Machine Learning Can Improve Prediction of Severity in Acute Pancreatitis Using Admission Values of APACHE II Score and C-Reactive Protein , 2006, Pancreatology.