Correlation-driven framework based on graph convolutional network for clinical disease classification

With the increasing popularity of computer-aided technology applied in medicine, great achievements have been made in certain diseases. However, due to the similarity of clinical and histological features, the problem of disease classification has not been well resolved, especially the Crohn’s disease (CD) and intestinal tuberculosis (ITB). In this paper, a novel sample connection driven framework named RFG-GCN is presented to overcome this. Firstly, the RFG-GCN employs a random forest based graph generation algorithm (RFG) to convert structured data into graph data, which considers the correlation between samples. Under the training of two layer-wise graph convolution neural network, an efficient model is established. Our findings show that the classification performance has been significantly improved compared withother methods. Furthermore, to show the extensibility and consistency, we apply our framework to other four public medical datasets. The results indicate that the RFG-GCN framework can achieve competitive and better classification performance.

[1]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[2]  V. Sreenivas,et al.  Accuracy of computed tomographic features in differentiating intestinal tuberculosis from Crohn's disease: a systematic review with meta-analysis , 2017, Intestinal research.

[3]  S. Rauschert,et al.  Machine learning and clinical epigenetics: a review of challenges for diagnosis and classification , 2020, Clinical Epigenetics.

[4]  S. N. Deepa,et al.  Medical Dataset Classification: A Machine Learning Paradigm Integrating Particle Swarm Optimization with Extreme Learning Machine Classifier , 2015, TheScientificWorldJournal.

[5]  Shun-ichi Amari,et al.  Information-Geometric Measures as Robust Estimators of Connection Strengths and External Inputs , 2009, Neural Computation.

[6]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[7]  Hongzhe Li,et al.  Model selection and estimation in the matrix normal graphical model , 2012, J. Multivar. Anal..

[8]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  D. Cox The Regression Analysis of Binary Sequences , 1958 .

[10]  Syed Muhammad Anwar,et al.  Medical Image Analysis using Convolutional Neural Networks: A Review , 2017, Journal of Medical Systems.

[11]  Hongyuan Wang,et al.  New Fuzzy Support Vector Machine for the Class Imbalance Problem in Medical Datasets Classification , 2014, TheScientificWorldJournal.

[12]  Ji Feng,et al.  Deep Forest: Towards An Alternative to Deep Neural Networks , 2017, IJCAI.

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  P. Higgins,et al.  Meta-Analytic Bayesian Model For Differentiating Intestinal Tuberculosis from Crohn's Disease , 2017, The American Journal of Gastroenterology.

[15]  Pietro Perona,et al.  Self-Tuning Spectral Clustering , 2004, NIPS.

[16]  Bradley N. Miller,et al.  Problem solving with algorithms and data structures using Python , 2005 .

[17]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[18]  John Scott Bridle,et al.  Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition , 1989, NATO Neurocomputing.

[19]  Philip S. Yu,et al.  A Comprehensive Survey on Graph Neural Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[20]  Sandeep Kumar Singh,et al.  Hybrid prediction model with missing value imputation for medical data , 2015, Expert Syst. Appl..

[21]  Ryan A. Rossi,et al.  The Network Data Repository with Interactive Graph Analytics and Visualization , 2015, AAAI.

[22]  C. M. Pandey,et al.  Differentiation Between Crohn Disease and Intestinal Tuberculosis in Children , 2018, Journal of pediatric gastroenterology and nutrition.

[23]  Simon Tiberi,et al.  Intestinal tuberculosis: a diagnostic challenge , 2017, Tropical medicine & international health : TM & IH.

[24]  Santosh Kumar,et al.  Medical Dataset Classification Using k -NN and Genetic Algorithm , 2017 .

[25]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[26]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[27]  P. Hebert,et al.  Methods for using data abstracted from medical charts to impute longitudinal missing data in a clinical trial. , 2011, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[28]  Vladimir B. Bajic,et al.  DDR: efficient computational method to predict drug–target interactions using graph mining and machine learning approaches , 2017, Bioinform..

[29]  Shuheng Zhou Gemini: Graph estimation with matrix variate normal instances , 2012, 1209.5075.

[30]  Lior Rokach,et al.  AutoGRD: Model Recommendation Through Graphical Dataset Representation , 2019, CIKM.

[31]  G. Brohee [Crohn's disease and intestinal tuberculosis]. , 1950, Acta gastro-enterologica Belgica.

[32]  Davide Bacciu,et al.  Deep Tree Transductions - A Short Survey , 2019, INNSBDDL.

[33]  Xi Chen,et al.  Graph Estimation for Matrix-variate Gaussian Data , 2015, 1509.05453.

[34]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[35]  Pascal Frossard,et al.  The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains , 2012, IEEE Signal Processing Magazine.

[36]  Ying Li,et al.  Analysis of Phenotypic Variables and Differentiation Between Untypical Crohn’s Disease and Untypical Intestinal Tuberculosis , 2019, Digestive Diseases and Sciences.

[37]  Rommel M. Barbosa,et al.  Medical data set classification using a new feature selection algorithm combined with twin-bounded support vector machine , 2020, Medical & Biological Engineering & Computing.

[38]  John M. Beggs,et al.  A Maximum Entropy Model Applied to Spatial and Temporal Correlations from Cortical Networks In Vitro , 2008, The Journal of Neuroscience.

[39]  Aimin Li,et al.  Clinical, Endoscopic and Histological Differentiations between Crohn’s Disease and Intestinal Tuberculosis , 2012, Digestion.