Causal Inference via Nonlinear Variable Decorrelation for Healthcare Applications

Causal inference and model interpretability research are gaining increasing attention, especially in the domains of healthcare and bioinformatics. Despite recent successes in this field, decorrelating features under nonlinear environments with human interpretable representations has not been adequately investigated. To address this issue, we introduce a novel method with a variable decorrelation regularizer to handle both linear and nonlinear confounding. Moreover, we employ association rules as new representations using association rule mining based on the original features to further proximate human decision patterns to increase model interpretability. Extensive experiments are conducted on four healthcare datasets (one synthetically generated and three real-world collections on different diseases). Quantitative results in comparison to baseline approaches on parameter estimation and causality computation indicate the model’s superior performance. Furthermore, expert evaluation given by healthcare professionals validates the effectiveness and interpretability of the proposed model.

[1]  S. Athey,et al.  Stable learning establishes some common ground between causal inference and machine learning , 2022, Nature Machine Intelligence.

[2]  Kun Kuang,et al.  Balance-Subsampled Stable Prediction Across Unknown Test Data , 2021, ACM Trans. Knowl. Discov. Data.

[3]  Gwanggil Jeon,et al.  A Deep-Learning-Based Smart Healthcare System for Patient’s Discomfort Detection at the Edge of Internet of Things , 2021, IEEE Internet of Things Journal.

[4]  Daniel Mietchen,et al.  Assessing the Causal Impact of COVID-19 Related Policies on Outbreak Dynamics: A Case Study in the US , 2021, WWW.

[5]  Peng Cui,et al.  Deep Stable Learning for Out-Of-Distribution Generalization , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Seifedine Kadry,et al.  An efficient apriori algorithm for frequent pattern mining using mapreduce in healthcare data , 2021 .

[7]  Tong Zhang,et al.  Stable Learning via Differentiated Variable Decorrelation , 2020, KDD.

[8]  Kun Kuang,et al.  Stable Prediction with Model Misspecification and Agnostic Distribution Shift , 2020, AAAI.

[9]  Saim Yoloğlu,et al.  ARM: An Interactive Web Software for Association Rules Mining and an Application in Medicine , 2019, 2019 International Artificial Intelligence and Data Processing Symposium (IDAP).

[10]  Naimul Mefraz Khan,et al.  DLIME: A Deterministic Local Interpretable Model-Agnostic Explanations Approach for Computer-Aided Diagnosis Systems , 2019, ArXiv.

[11]  R. Schwartzstein,et al.  Teaching Critical Thinking: A Case for Instruction in Cognitive Biases to Reduce Diagnostic Errors and Improve Patient Safety , 2019, Academic medicine : journal of the Association of American Medical Colleges.

[12]  Cynthia Rudin,et al.  Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead , 2018, Nature Machine Intelligence.

[13]  Xia Hu,et al.  Techniques for interpretable machine learning , 2018, Commun. ACM.

[14]  Bo Li,et al.  Stable Prediction across Unknown Environments , 2018, KDD.

[15]  Ankur Teredesai,et al.  Interpretable Machine Learning in Healthcare , 2018, 2018 IEEE International Conference on Healthcare Informatics (ICHI).

[16]  Jorge Nocedal,et al.  A Progressive Batching L-BFGS Method for Machine Learning , 2018, ICML.

[17]  Judea Pearl,et al.  Theoretical Impediments to Machine Learning With Seven Sparks from the Causal Revolution , 2018, WSDM.

[18]  Lori N. Scott,et al.  The Challenge of Transforming the Diagnostic System of Personality Disorders. , 2017, Journal of personality disorders.

[19]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[20]  Bernhard Schölkopf,et al.  Invariant Models for Causal Transfer Learning , 2015, J. Mach. Learn. Res..

[21]  Darshan M. Tank,et al.  Improved Apriori Algorithm for Mining Association Rules , 2014 .

[22]  Johannes Gehrke,et al.  Accurate intelligible models with pairwise interactions , 2013, KDD.

[23]  Pat Croskerry,et al.  From mindless to mindful practice--cognitive bias and clinical decision making. , 2013, The New England journal of medicine.

[24]  Bernhard Schölkopf,et al.  Domain Generalization via Invariant Feature Representation , 2013, ICML.

[25]  Graeme Hutcheson,et al.  The SAGE Dictionary of Quantitative Management Research , 2011 .

[26]  J. Pearl Causal inference in statistics: An overview , 2009 .

[27]  T. Brennan,et al.  Missed and Delayed Diagnoses in the Ambulatory Setting: A Study of Closed Malpractice Claims , 2006, Annals of Internal Medicine.

[28]  Norberto F. Ezquerra,et al.  Constraining and summarizing association rules in medical data , 2006, Knowledge and Information Systems.

[29]  Mahesh Pal,et al.  Random forest classifier for remote sensing classification , 2005 .

[30]  Keng Siau,et al.  A review of data mining techniques , 2001, Ind. Manag. Data Syst..

[31]  S. Agatonovic-Kustrin,et al.  Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research. , 2000, Journal of pharmaceutical and biomedical analysis.

[32]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[33]  Roberto J. Bayardo,et al.  Mining the most interesting rules , 1999, KDD '99.

[34]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[35]  Stefan Sperlich,et al.  Generalized Additive Models , 2014 .

[36]  A. E. Hoerl,et al.  Ridge Regression: Applications to Nonorthogonal Problems , 1970 .

[37]  Tong Zhang,et al.  Why Stable Learning Works? A Theory of Covariate Shift Generalization , 2021, ArXiv.

[38]  Francisco Herrera,et al.  SMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-year Anniversary , 2018, J. Artif. Intell. Res..

[39]  Jesper N. Wulff,et al.  Multiple imputation by chained equations in praxis: Guidelines and review , 2017 .

[40]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .

[41]  Christian Borgelt,et al.  Induction of Association Rules: Apriori Implementation , 2002, COMPSTAT.

[42]  R. Agrawal,et al.  Fast Algorithms for Mining Association Rules , 1998 .

[43]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .