Feature rearrangement based deep learning system for predicting heart failure mortality

BACKGROUND AND OBJECTIVE Heart Failure is a clinical syndrome commonly caused by any structural or functional impairment. Fast and accurate mortality prediction for Heart Failure is essential to improve the health care of patients and prevent them from death. However, due to the imbalance problem and poor feature representation in Heart Failure data, mortality prediction of Heart Failure is difficult with some simple models. To handle these problems, this study is focused on proposing a fast and accurate Heart Failure mortality prediction framework. METHODS This paper proposes a feature rearrangement based deep learning system for heart failure mortality prediction. The proposed framework improves the performance of predicting heart failure mortality by handling imbalance problem and achieving better feature representation. This paper also proposes a method named Feature rearrangement based convolutional layer, which demonstrates that the order of the input features is essential for the convolutional network. RESULTS The proposed system is experimentally evaluated on real-world Heart Failure data collected from the EHR system of Shanghai Shuguang Hospital, where 10,198 in-patients records are extracted between March 2009 and April 2016. Internal comparison results illustrate that the proposed framework achieves the best performance for Heart Failure mortality prediction. Extensive experimental results compared with other machine learning methods demonstrate that the proposed method has the highest average accuracy and area under the curve while predicting the three goals of in-hospital mortality, 30-day mortality, and 1-year mortality. Finally, top 12 essential clinical features are mined with their chi-square scores, which can help to assist clinicians in the treatment and research of heart failure. CONCLUSIONS The proposed method successfully predict different target in three observation windows. Feature rearrangement based convolutional layer and Focal loss are employed into the proposed framework, which helps promote the prediction accuracy of Heart Failure death. The proposed method is fast and accurate for predicting heart failure mortality, especially for imbalance situation. This paper also provide a reasonable pipeline to model EHRs data and handle imbalance problem in medical data.

[1]  Kwang-Hyun Cho,et al.  Encyclopedia of Systems Biology , 2013, Springer New York.

[2]  Daniela Fischer,et al.  Digital Design And Computer Architecture , 2016 .

[3]  Girish Chavan,et al.  NOBLE – Flexible concept recognition for large-scale biomedical natural language processing , 2016, BMC Bioinformatics.

[4]  Marleen de Bruijne,et al.  Machine learning approaches in medical image analysis: From detection to diagnosis , 2016, Medical Image Anal..

[5]  U. Rajendra Acharya,et al.  Application of new deep genetic cascade ensemble of SVM classifiers to predict the Australian credit scoring , 2019, Appl. Soft Comput..

[6]  Peter J. F. Lucas,et al.  Bayesian networks in biomedicine and health-care , 2004, Artif. Intell. Medicine.

[7]  Mehdi T. Harandi,et al.  Workshop on software specification and design , 1988, SOEN.

[8]  Gaetano D Gargiulo,et al.  Towards Real-Time Heartbeat Classification: Evaluation of Nonlinear Morphological Features and Voting Method , 2019, Sensors.

[9]  N. Azad,et al.  Management of chronic heart failure in the older population , 2014, Journal of geriatric cardiology : JGC.

[10]  Ioannis Stamos,et al.  CNN-Based Object Segmentation in Urban LIDAR with Missing Points , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[11]  Hayit Greenspan,et al.  A comparative study for chest radiograph image retrieval using binary texture and deep learning classification , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[12]  Juan Cui,et al.  Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity , 2006, Proteomics.

[13]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Zhe Wang,et al.  Cascade interpolation learning with double subspaces and confidence disturbance for imbalanced problems , 2019, Neural Networks.

[15]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[16]  Michael Fowler,et al.  An informatics-based approach to reducing heart failure all-cause readmissions: the Stanford heart failure dashboard , 2017, J. Am. Medical Informatics Assoc..

[17]  Adeeb Noor,et al.  An Optimized Stacked Support Vector Machines Based Expert System for the Effective Prediction of Heart Failure , 2019, IEEE Access.

[18]  Sengul Dogan,et al.  Automated arrhythmia detection using novel hexadecimal local pattern and multilevel wavelet transform with ECG signals , 2019, Knowl. Based Syst..

[19]  R. Modre-Osprian,et al.  HerzMobil, an Integrated and Collaborative Telemonitoring-Based Disease Management Program for Patients With Heart Failure: A Feasibility Study Paving the Way to Routine Care , 2018, JMIR cardio.

[20]  Cindy C. Parman ICD-10-CM. , 2004, The Journal of oncology management : the official journal of the American College of Oncology Administrators.

[21]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[22]  I. Piña,et al.  Forecasting the Impact of Heart Failure in the United States: A Policy Statement From the American Heart Association , 2013, Circulation. Heart failure.

[23]  Nassir Navab,et al.  AggNet: Deep Learning From Crowds for Mitosis Detection in Breast Cancer Histology Images , 2016, IEEE Trans. Medical Imaging.

[24]  Juan Pablo Martínez,et al.  Automatic SVM classification of sudden cardiac death and pump failure death from autonomic and repolarization ECG markers. , 2015, Journal of electrocardiology.

[25]  Mark V. Williams,et al.  Interventions to Reduce 30-Day Rehospitalization: A Systematic Review , 2011, Annals of Internal Medicine.

[26]  Yang Wang,et al.  Applications of Support Vector Machine (SVM) Learning in Cancer Genomics. , 2018, Cancer genomics & proteomics.

[27]  Hongyuan Zha,et al.  Boundary-Eliminated Pseudoinverse Linear Discriminant for Imbalanced Problems , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Rustam M. Vahidov,et al.  Application of machine learning techniques for supply chain demand forecasting , 2008, Eur. J. Oper. Res..

[29]  Marco Zaffalon,et al.  Time for a change: a tutorial for comparing multiple classifiers through Bayesian analysis , 2016, J. Mach. Learn. Res..

[30]  Juho Kannala,et al.  Cell Segmentation Proposal Network for Microscopy Image Analysis , 2016, LABELS/DLMIA@MICCAI.

[31]  U. Rajendra Acharya,et al.  A new machine learning technique for an accurate diagnosis of coronary artery disease , 2019, Comput. Methods Programs Biomed..

[32]  Zhe Wang,et al.  Geometric Structural Ensemble Learning for Imbalanced Problems , 2020, IEEE Transactions on Cybernetics.

[33]  Dongdong Li,et al.  Collaborative and geometric multi-kernel learning for multi-class classification , 2020, Pattern Recognit..

[34]  P E Leaverton,et al.  Prevalence and mortality rate of congestive heart failure in the United States. , 1992, Journal of the American College of Cardiology.

[35]  U. Rajendra Acharya,et al.  Novel deep genetic ensemble of classifiers for arrhythmia detection using ECG signals , 2019, Neural Computing and Applications.

[36]  M. Fornage,et al.  Heart Disease and Stroke Statistics—2017 Update: A Report From the American Heart Association , 2017, Circulation.

[37]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[38]  Shafiq R. Joty,et al.  Correction of: Sleep Quality Prediction From Wearable Data Using Deep Learning , 2016, JMIR mHealth and uHealth.

[39]  Shahram Ebadollahi,et al.  Early detection of heart failure with varying prediction windows by structured and unstructured data in electronic health records , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[40]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[41]  Alexander Gammerman,et al.  Machine learning classification with confidence: Application of transductive conformal predictors to MRI-based diagnostic and prognostic markers in depression , 2011, NeuroImage.

[42]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..