Prediction of progression in idiopathic pulmonary fibrosis using CT scans at baseline: A quantum particle swarm optimization - Random forest approach

Idiopathic pulmonary fibrosis (IPF) is a fatal lung disease characterized by an unpredictable progressive decline in lung function. Natural history of IPF is unknown and the prediction of disease progression at the time of diagnosis is notoriously difficult. High resolution computed tomography (HRCT) has been used for the diagnosis of IPF, but not generally for monitoring purpose. The objective of this work is to develop a novel predictive model for the radiological progression pattern at voxel-wise level using only baseline HRCT scans. Mainly, there are two challenges: (a) obtaining a data set of features for region of interest (ROI) on baseline HRCT scans and their follow-up status; and (b) simultaneously selecting important features from high-dimensional space, and optimizing the prediction performance. We resolved the first challenge by implementing a study design and having an expert radiologist contour ROIs at baseline scans, depending on its progression status in follow-up visits. For the second challenge, we integrated the feature selection with prediction by developing an algorithm using a wrapper method that combines quantum particle swarm optimization to select a small number of features with random forest to classify early patterns of progression. We applied our proposed algorithm to analyze anonymized HRCT images from 50 IPF subjects from a multi-center clinical trial. We showed that it yields a parsimonious model with 81.8% sensitivity, 82.2% specificity and an overall accuracy rate of 82.1% at the ROI level. These results are superior to other popular feature selections and classification methods, in that our method produces higher accuracy in prediction of progression and more balanced sensitivity and specificity with a smaller number of selected features. Our work is the first approach to show that it is possible to use only baseline HRCT scans to predict progressive ROIs at 6 months to 1year follow-ups using artificial intelligence.

[1]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[2]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[3]  A. Rahmim,et al.  Smoothly clipped absolute deviation (SCAD) regularization for compressed sensing MRI using an augmented Lagrangian scheme , 2012, 2012 IEEE Nuclear Science Symposium and Medical Imaging Conference Record (NSS/MIC).

[4]  Joyce S Lee,et al.  A Multidimensional Index and Staging System for Idiopathic Pulmonary Fibrosis , 2012, Annals of Internal Medicine.

[5]  F. Martinez,et al.  Fibroblastic foci in usual interstitial pneumonia: idiopathic versus collagen vascular disease. , 2003, American journal of respiratory and critical care medicine.

[6]  R. Sussman,et al.  A phase 3 trial of pirfenidone in patients with idiopathic pulmonary fibrosis. , 2014, The New England journal of medicine.

[7]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.

[8]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[9]  Thomas G. Dietterich An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[10]  Antonio Criminisi,et al.  Decision Forests for Computer Vision and Medical Image Analysis , 2013, Advances in Computer Vision and Pattern Recognition.

[11]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[12]  P. Mecocci,et al.  Random Forest ensembles for detection and prediction of Alzheimer's disease with a good between-cohort robustness , 2014, NeuroImage: Clinical.

[13]  Sumit K. Shah,et al.  Classification of parenchymal abnormality in scleroderma lung using a novel approach to denoise images collected via a multicenter study. , 2008, Academic radiology.

[14]  Abhyuday Mandal,et al.  d-QPSO: A Quantum-Behaved Particle Swarm Technique for Finding D-Optimal Designs With Discrete and Continuous Factors and a Binary Response , 2018, Technometrics.

[15]  E. Hoffman,et al.  Computer recognition of regional lung disease patterns. , 1999, American journal of respiratory and critical care medicine.

[16]  Christopher Bowd,et al.  Learning from healthy and stable eyes: A new approach for detection of glaucomatous progression , 2015, Artif. Intell. Medicine.

[17]  Xin Yao,et al.  A Survey on Evolutionary Computation Approaches to Feature Selection , 2016, IEEE Transactions on Evolutionary Computation.

[18]  H. K. Huang,et al.  Feature selection in the pattern classification problem of digital chest radiograph segmentation , 1995, IEEE Trans. Medical Imaging.

[19]  P. Suganthan Particle swarm optimiser with neighbourhood operator , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[20]  N. Sverzellati,et al.  Predicting survival in newly diagnosed idiopathic pulmonary fibrosis: a 3-year prospective study , 2012, European Respiratory Journal.

[21]  Percy Liang,et al.  Semi-Supervised Learning for Natural Language , 2005 .

[22]  R. Elashoff,et al.  Relationship between quantitative radiographic assessments of interstitial lung disease and physiological and clinical features of systemic sclerosis , 2014, Annals of the rheumatic diseases.

[23]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[25]  Yangyang Li,et al.  Dynamic-context cooperative quantum-behaved particle swarm optimization based on multilevel thresholding applied to medical image segmentation , 2015, Inf. Sci..

[26]  P. J. Huber The behavior of maximum likelihood estimates under nonstandard conditions , 1967 .

[27]  D. Hansell,et al.  Obstructive lung diseases: texture classification for differentiation at CT. , 2003, Radiology.

[28]  Yuichi Motai,et al.  Kernel Association for Classification and Prediction: A Survey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[29]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[30]  G. Raghu,et al.  FG-3019 anti-connective tissue growth factor monoclonal antibody: results of an open-label clinical trial in idiopathic pulmonary fibrosis , 2016, European Respiratory Journal.

[31]  G. Moody,et al.  Predicting in-hospital mortality of ICU patients: The PhysioNet/Computing in cardiology challenge 2012 , 2012, 2012 Computing in Cardiology.

[32]  Jing Zhao,et al.  A Binary Quantum-behaved Particle Swarm Optimization Algorithm with Cooperative Approach , 2013 .

[33]  Wenbo Xu,et al.  Particle swarm optimization with particles having quantum behavior , 2004, Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753).

[34]  Jian Shen,et al.  Medical image classification based on multi-scale non-negative sparse coding , 2017, Artif. Intell. Medicine.

[35]  Ilias Maglogiannis,et al.  An intelligent system for automated breast cancer diagnosis and prognosis using SVM based classifiers , 2009, Applied Intelligence.

[36]  Russell C. Eberhart,et al.  A discrete binary version of the particle swarm algorithm , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[37]  R.M. Haralick,et al.  Statistical and structural approaches to texture , 1979, Proceedings of the IEEE.

[38]  Kotagiri Ramamohanarao,et al.  Classification of healthy and diseased retina using SD-OCT imaging and Random Forest algorithm , 2018, PloS one.

[39]  Shih-Yin Chen,et al.  Idiopathic pulmonary fibrosis in US Medicare beneficiaries aged 65 years and older: incidence, prevalence, and survival, 2001-11. , 2014, The Lancet. Respiratory medicine.

[40]  Cong Jin,et al.  Prediction approach of software fault-proneness based on hybrid artificial neural network and quantum particle swarm optimization , 2015, Appl. Soft Comput..

[41]  O. Weck,et al.  A COMPARISON OF PARTICLE SWARM OPTIMIZATION AND THE GENETIC ALGORITHM , 2005 .

[42]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[43]  Takeshi Johkoh,et al.  American Thoracic Society Documents An Official ATS / ERS / JRS / ALAT Statement : Idiopathic Pulmonary Fibrosis : Evidence-based Guidelines for Diagnosis and Management , 2011 .

[44]  Wenbo Xu,et al.  Quantum-Behaved Particle Swarm Optimization with Binary Encoding , 2007, ICANNGA.

[45]  Benson Mwangi,et al.  A Review of Feature Reduction Techniques in Neuroimaging , 2013, Neuroinformatics.

[46]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[47]  Antonio Bolufé Röhler,et al.  An Analysis of Sub-swarms in Multi-swarm Systems , 2011, Australasian Conference on Artificial Intelligence.

[48]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[49]  Mengjie Zhang,et al.  Particle Swarm Optimisation with genetic operators for feature selection , 2017, 2017 IEEE Congress on Evolutionary Computation (CEC).

[50]  Jin Mo Goo,et al.  Computer-aided diagnosis of localized ground-glass opacity in the lung at CT: initial experience. , 2005, Radiology.

[51]  Caroline Petitjean,et al.  A Random Forest Based Approach for One Class Classification in Medical Imaging , 2012, MLMI.

[52]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[53]  Anand Devaraj,et al.  Evaluating disease severity in idiopathic pulmonary fibrosis , 2017, European Respiratory Review.

[54]  Wenbo Xu,et al.  An improved quantum-behaved particle swarm optimization algorithm with weighted mean best position , 2008, Appl. Math. Comput..

[55]  Choi-Hong Lai,et al.  Particle Swarm Optimisation: Classical and Quantum Perspectives , 2011 .

[56]  Michael S. Lew,et al.  Deep learning for visual understanding: A review , 2016, Neurocomputing.

[57]  J. Goldin,et al.  Comparison of the quantitative CT imaging biomarkers of idiopathic pulmonary fibrosis at baseline and early change with an interval of 7 months. , 2015, Academic radiology.

[58]  Su Ruan,et al.  Robust feature selection to predict tumor treatment outcome , 2014, Artif. Intell. Medicine.

[59]  Sun I. Kim,et al.  Application of irregular and unbalanced data to predict diabetic nephropathy using visualization and feature selection methods , 2008, Artif. Intell. Medicine.

[60]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.