Simulating time to event prediction with spatiotemporal echocardiography deep learning

Integrating methods for time-to-event prediction with diagnostic imaging modalities is of considerable interest, as accurate estimates of survival requires accounting for censoring of individuals within the observation period. New methods for time-to-event prediction have been developed by extending the cox-proportional hazards model with neural networks. In this paper, to explore the feasibility of these methods when applied to deep learning with echocardiography videos, we utilize the Stanford EchoNet-Dynamic dataset with over 10,000 echocardiograms, and generate simulated survival datasets based on the expert annotated ejection fraction readings. By training on just the simulated survival outcomes, we show that spatiotemporal convolutional neural networks yield accurate survival estimates. Introduction Recent advances in diagnostic echocardiography AI systems have shown that not only is it possible to automate standard measures of cardiac function (ejection fraction), but even measure common biomarkers that are typically never measured via echocardiography scans (Hemoglobin, blood urea nitrogen).1–3 Our group has shown that predicting post-operative right ventricular failure in the setting of left ventricular assist device implantation is feasible by analyzing the full spatiotemporal density of information within each pre-operative echocardiogram.4 Others have now shown that predicting 1-year mortality is possible using just baseline echocardiography.5 Though this was developed using one of the largest echocardiographic datasets, the work was limited by the use of potentially under-parameterized neural networks, and treatment of right censored survival data as a binary event. Time-to-event predictions rather than simple binary predictions will find more relevance in the clinical setting when working with ‘prognostic’ rather than ‘diagnostic’ AI systems.6 Time-to-event analyses are characterized by the ability to predict event probabilities as a function of time, whereas binary classifiers can only provide predictions for one predetermined duration. One of the advantages of this is that it accounts for censoring of individuals within the observation period.7 This more accurately represents the clinical ground truth where individuals may be either lost to follow up, or have yet to develop an event of interest. Baseline Echocardiogram Numpy Pixel Array) VideoTensor Input (32,3,112,112) 0 5 10 1.905 1.910 1.915 1.920 0 5 10 1.905 1.910 1.915 1.920 0.1240 0.1245 0.1250 0.1255 0.1260 0.1265 mod el-1 mod elmod el-3 ++++++++++++++++++++++++++++ +++ +++++++ + + + 0.00 0.25 0.50 0.75 1.00 0 200 400 600 800 Time 7 x 7 x 7 2x c on v2 _x b lo ck s 2x c on v3 _x b lo ck s 2x c on v4 _x b lo ck s 2x c on v5 _x b lo ck s Av er ag e Po ol in g

[1]  Jonathan H. Chen,et al.  Deep Learning Prediction of Biomarkers from Echocardiogram Videos , 2021, medRxiv.

[2]  Uri Shaham,et al.  DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network , 2016, BMC Medical Research Methodology.

[3]  V. Roger Asymptomatic Left Ventricular Dysfunction: To Screen or Not to Screen? , 2016, JACC. Heart failure.

[4]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[5]  Jaakko Reinikainen,et al.  Time-varying covariates and coefficients in Cox regression models. , 2018, Annals of translational medicine.

[6]  M. Rubenfire,et al.  Echocardiography in pulmonary arterial hypertension: from diagnosis to prognosis. , 2013, Journal of the American Society of Echocardiography : official publication of the American Society of Echocardiography.

[7]  Brandon K. Fornwalt,et al.  Deep-learning-assisted analysis of echocardiographic videos improves predictions of all-cause mortality , 2021, Nature Biomedical Engineering.

[8]  Jessica L. Howard,et al.  Predicting right ventricular failure in the modern, continuous flow left ventricular assist device era. , 2013, The Annals of thoracic surgery.

[9]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[10]  E Graf,et al.  Assessment and comparison of prognostic classification schemes for survival data. , 1999, Statistics in medicine.

[11]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[12]  John P. Cunningham,et al.  Predicting post-operative right ventricular failure using video-based deep learning , 2021, Nature Communications.

[13]  D.,et al.  Regression Models and Life-Tables , 2022 .

[14]  Martin Wattenberg,et al.  SmoothGrad: removing noise by adding noise , 2017, ArXiv.

[15]  D. Levy,et al.  Natural History of Asymptomatic Left Ventricular Systolic Dysfunction in the Community , 2003, Circulation.

[16]  Ida Scheel,et al.  Time-to-Event Prediction with Neural Networks and Cox Regression , 2019, J. Mach. Learn. Res..

[17]  Curtis P. Langlotz,et al.  Video-based AI for beat-to-beat assessment of cardiac function , 2020, Nature.

[18]  F. Harrell,et al.  Evaluating the yield of medical tests. , 1982, JAMA.

[19]  Jeffrey J. Harden,et al.  Simulating Duration Data for the Cox Model , 2018, Political Science Research and Methods.

[20]  Alexander Binder,et al.  On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation , 2015, PloS one.

[21]  Yutaka Satoh,et al.  Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[22]  C. Watson,et al.  Natriuretic peptide-based screening and collaborative care for heart failure: the STOP-HF randomized trial. , 2013, JAMA.

[23]  M. Wheeler,et al.  The Incremental Value of Right Ventricular Size and Strain in the Risk Assessment of Right Heart Failure Post - Left Ventricular Assist Device Implantation. , 2018, Journal of cardiac failure.

[24]  Jonathan H. Chen,et al.  Deep learning interpretation of echocardiograms , 2019, bioRxiv.

[25]  Yann LeCun,et al.  A Closer Look at Spatiotemporal Convolutions for Action Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[26]  L. Shapley A Value for n-person Games , 1988 .

[27]  Hugh Tunstall-Pedoe,et al.  Symptomatic and asymptomatic left-ventricular systolic dysfunction in an urban population , 1997, The Lancet.

[28]  A. C. Kwan,et al.  High-throughput digitization of analog human echocardiography data , 2020, medRxiv.