Automated causal inference in application to randomized controlled clinical trials

Randomized controlled trials (RCTs) are considered as the gold standard for testing causal hypotheses in the clinical domain. However, the investigation of prognostic variables of patient outcome in a hypothesized cause-effect route is not feasible using standard statistical methods. Here, we propose a new automated causal inference method (AutoCI) built upon the invariant causal prediction (ICP) framework for the causal re-interpretation of clinical trial data. Compared to existing methods, we show that the proposed AutoCI allows to efficiently determine the causal variables with a clear differentiation on two real-world RCTs of endometrial cancer patients with mature outcome and extensive clinicopathological and molecular data. This is achieved via suppressing the causal probability of non-causal variables by a wide margin. In ablation studies, we further demonstrate that the assignment of causal probabilities by AutoCI remain consistent in the presence of confounders. In conclusion, these results confirm the robustness and feasibility of AutoCI for future applications in real-world clinical analysis.

[1]  A. Jemal,et al.  Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries , 2021, CA: a cancer journal for clinicians.

[2]  Nguyen H. Tran,et al.  Safety and efficacy of the ChAdOx1 nCoV-19 vaccine (AZD1222) against SARS-CoV-2: an interim analysis of four randomised controlled trials in Brazil, South Africa, and the UK , 2020, Lancet.

[3]  Jian Peng,et al.  When causal inference meets deep learning , 2020, Nature Machine Intelligence.

[4]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[5]  Hector Zenil,et al.  Causal deconvolution by algorithmic generative models , 2019, Nature Machine Intelligence.

[6]  Rémi,et al.  Endometrium FIFTEEN-YEAR RADIOTHERAPY OUTCOMES OF THE RANDOMIZED PORTEC-1 TRIAL FOR ENDOMETRIAL CARCINOMA , 2011 .

[7]  T. Bosse,et al.  Incorporation of molecular characteristics into endometrial cancer management , 2019, Histopathology.

[8]  P. Koper,et al.  Postoperative radiotherapy for Stage 1 endometrial carcinoma: long-term outcome of the randomized PORTEC trial with central pathology review. , 2005, International journal of radiation oncology, biology, physics.

[9]  P. Koper,et al.  Surgery and postoperative radiotherapy versus surgery alone for patients with stage-1 endometrial carcinoma: multicentre randomised trial , 2000, The Lancet.

[10]  P. Pollock,et al.  Refining prognosis and identifying targetable pathways for high-risk endometrial cancer; a TransPORTEC initiative , 2015, Modern Pathology.

[11]  T. Bosse,et al.  Adjuvant therapy for endometrial cancer in the era of molecular classification: radiotherapy, chemoradiation and novel targets for therapy , 2020, International Journal of Gynecological Cancer.

[12]  Swarat Chaudhuri,et al.  HOUDINI: Lifelong Learning as Program Synthesis , 2018, NeurIPS.

[13]  Mario Lucic,et al.  Are GANs Created Equal? A Large-Scale Study , 2017, NeurIPS.

[14]  Samuel Leung,et al.  Evaluation of endometrial carcinoma prognostic immunohistochemistry markers in the context of molecular classification , 2017, The journal of pathology. Clinical research.

[15]  Chris Hawblitzel,et al.  Safe to the last instruction: automated verification of a type-safe operating system , 2010, PLDI '10.

[16]  H. Putter,et al.  Ten-year results of the PORTEC-2 trial for high-intermediate risk endometrial carcinoma: improving patient selection for adjuvant therapy , 2018, British Journal of Cancer.

[17]  Bibb Allen The Role of the FDA in Ensuring the Safety and Efficacy of Artificial Intelligence Software and Devices. , 2019, Journal of the American College of Radiology : JACR.

[18]  Jonas Peters,et al.  Causal inference by using invariant prediction: identification and confidence intervals , 2015, 1501.01332.

[19]  Robert Platt,et al.  Specifying a target trial prevents immortal time bias and other self-inflicted injuries in observational analyses. , 2016, Journal of clinical epidemiology.

[20]  D. Lambrechts,et al.  Prognostic Significance of POLE Proofreading Mutations in Endometrial Cancer , 2014, Journal of the National Cancer Institute.

[21]  Chuang Gan,et al.  The Neuro-Symbolic Concept Learner: Interpreting Scenes Words and Sentences from Natural Supervision , 2019, ICLR.

[22]  Marcus Rohrbach,et al.  Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering , 2019, ICML.

[23]  Heather Dawson,et al.  Next-generation tissue microarray (ngTMA) increases the quality of biomarker studies: an example using CD3, CD8, and CD45RO in the tumor microenvironment of six different solid tumor types , 2013, Journal of Translational Medicine.

[24]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[25]  T. Bosse,et al.  L1 cell adhesion molecule is a strong predictor for distant recurrence and overall survival in early stage endometrial cancer: pooled PORTEC trial results. , 2014, European journal of cancer.

[26]  M. Kloor,et al.  The Immune Biology of Microsatellite-Unstable Cancer. , 2016, Trends in cancer.

[27]  H. Putter,et al.  Vaginal brachytherapy versus pelvic external beam radiotherapy for patients with endometrial cancer of high-intermediate risk (PORTEC-2): an open-label, non-inferiority, randomised trial , 2010, The Lancet.

[28]  Issa J Dahabreh,et al.  Extending inferences from a randomized trial to a new target population , 2018, Statistics in Medicine.

[29]  M. Rugge,et al.  Mismatch Repair Deficiency, Microsatellite Instability, and Survival: An Exploratory Analysis of the Medical Research Council Adjuvant Gastric Infusional Chemotherapy (MAGIC) Trial , 2017, JAMA oncology.

[30]  T. Bosse,et al.  Substantial lymph-vascular space invasion (LVSI) is a significant risk factor for recurrence in endometrial cancer--A pooled analysis of PORTEC 1 and 2 trials. , 2015, European journal of cancer.

[31]  Christina Heinze-Deml,et al.  Invariant Causal Prediction for Nonlinear Models , 2017, Journal of Causal Inference.

[32]  Zhitang Chen,et al.  Causal Discovery with Reinforcement Learning , 2019, ICLR.

[33]  P. Lambin,et al.  Nomograms for prediction of outcome with or without adjuvant radiation therapy for patients with endometrial cancer: a pooled analysis of PORTEC-1 and PORTEC-2 trials. , 2015, International journal of radiation oncology, biology, physics.

[34]  J. Pearl Causal inference in statistics: An overview , 2009 .

[35]  C. Villani Optimal Transport: Old and New , 2008 .

[36]  H. Putter,et al.  Molecular Classification of the PORTEC-3 Trial for High-Risk Endometrial Cancer: Impact on Prognosis and Benefit From Adjuvant Therapy , 2020, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[37]  J. Locascio,et al.  Randomised controlled trials – the gold standard for effectiveness research , 2018, BJOG : an international journal of obstetrics and gynaecology.

[38]  M. Hernán,et al.  Emulating a target trial of antiretroviral therapy regimens started before conception and risk of adverse birth outcomes , 2018, AIDS.

[39]  A. Talhouk,et al.  Molecular Subtype Not Immune Response Drives Outcomes in Endometrial Carcinoma , 2018, Clinical Cancer Research.

[40]  H. Putter,et al.  Improved Risk Assessment by Integrating Molecular and Clinicopathological Factors in Early-stage Endometrial Cancer—Combined Analysis of the PORTEC Cohorts , 2016, Clinical Cancer Research.

[41]  Spiros C. Denaxas,et al.  Avoidable flaws in observational analyses: an application to statins and cancer , 2019, Nature Medicine.

[42]  M. Lipsitch,et al.  Negative Controls: A Tool for Detecting Confounding and Bias in Observational Studies , 2010, Epidemiology.

[43]  Armando Solar-Lezama,et al.  DreamCoder: growing generalizable, interpretable knowledge with wake–sleep Bayesian program learning , 2020, Philosophical Transactions of the Royal Society A.

[44]  Benjamin J. Raphael,et al.  Mutational landscape and significance across 12 major cancer types , 2013, Nature.

[45]  Marc Brockschmidt,et al.  Differentiable Programs with Neural Libraries , 2016, ICML.

[46]  K. Mertz,et al.  Prognostic Integrated Image-Based Immune and Molecular Profiling in Early-Stage Endometrial Cancer , 2020, Cancer Immunology Research.

[47]  Judea Pearl,et al.  Causal Inference in the Health Sciences: A Conceptual Introduction , 2001, Health Services and Outcomes Research Methodology.

[48]  Jens Rittscher,et al.  Precision immunoprofiling by image analysis and artificial intelligence , 2018, Virchows Archiv.

[49]  Jiang Bian,et al.  Causal inference and counterfactual prediction in machine learning for actionable healthcare , 2020, Nature Machine Intelligence.

[50]  J. Pfanzagl,et al.  Studies in the history of probability and statistics XLIV A forerunner of the t-distribution , 1996 .

[51]  Juan L Gamella,et al.  Active Invariant Causal Prediction: Experiment Selection through Stability , 2020, NeurIPS.

[52]  Chandler Squires,et al.  ABCD-Strategy: Budgeted Experimental Design for Targeted Causal Structure Discovery , 2019, AISTATS.

[53]  Cyrus Chargari,et al.  ESGO/ESTRO/ESP guidelines for the management of patients with endometrial carcinoma , 2020, International Journal of Gynecological Cancer.

[54]  H. Mackay,et al.  Prognostic significance of L1CAM expression and its association with mutant p53 expression in high-risk endometrial cancer , 2016, Modern Pathology.

[55]  Sepp Hochreiter,et al.  GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[56]  H. Levene Robust tests for equality of variances , 1961 .

[57]  Ida Scheel,et al.  Time-to-Event Prediction with Neural Networks and Cox Regression , 2019, J. Mach. Learn. Res..