Topological data analysis identifies molecular phenotypes of idiopathic pulmonary fibrosis

Background Idiopathic pulmonary fibrosis (IPF) is a debilitating, progressive disease with a median survival time of 3–5 years. Diagnosis remains challenging and disease progression varies greatly, suggesting the possibility of distinct subphenotypes. Methods and results We analysed publicly available peripheral blood mononuclear cell expression datasets for 219 IPF, 411 asthma, 362 tuberculosis, 151 healthy, 92 HIV and 83 other disease samples, totalling 1318 patients. We integrated the datasets and split them into train (n=871) and test (n=477) cohorts to investigate the utility of a machine learning model (support vector machine) for predicting IPF. A panel of 44 genes predicted IPF in a background of healthy, tuberculosis, HIV and asthma with an area under the curve of 0.9464, corresponding to a sensitivity of 0.865 and a specificity of 0.89. We then applied topological data analysis to investigate the possibility of subphenotypes within IPF. We identified five molecular subphenotypes of IPF, one of which corresponded to a phenotype enriched for death/transplant. The subphenotypes were molecularly characterised using bioinformatic and pathway analysis tools identifying distinct subphenotype features including one which suggests an extrapulmonary or systemic fibrotic disease. Conclusions Integration of multiple datasets, from the same tissue, enabled the development of a model to accurately predict IPF using a panel of 44 genes. Furthermore, topological data analysis identified distinct subphenotypes of patients with IPF which were defined by differences in molecular pathobiology and clinical characteristics.

[1]  Ivana V. Yang,et al.  Cluster analysis of transcriptomic datasets to identify endotypes of idiopathic pulmonary fibrosis , 2021, Thorax.

[2]  P. Skipp,et al.  Identification of proteins associated with development of metastasis from cutaneous squamous cell carcinomas (cSCCs) via proteomic analysis of primary cSCCs * , 2020, The British journal of dermatology.

[3]  P. Spagnolo,et al.  Subclinical liver fibrosis in patients with idiopathic pulmonary fibrosis , 2020, Internal and Emergency Medicine.

[4]  T. Greulich,et al.  Transcriptional analysis identifies potential biomarkers and molecular regulators in pneumonia and COPD exacerbation , 2020, Pneumologie.

[5]  C. Auffray,et al.  IL-17-high asthma with features of a psoriasis immunophenotype. , 2019, The Journal of allergy and clinical immunology.

[6]  Hyun J. Kim,et al.  Peripheral blood proteomic profiling of idiopathic pulmonary fibrosis biomarkers in the multicentre IPF-PRO Registry , 2019, Respiratory Research.

[7]  C. Auffray,et al.  Stratification of asthma phenotypes by airway proteomic signatures. , 2019, The Journal of allergy and clinical immunology.

[8]  C. Auffray,et al.  Sputum proteomic signature of gastro-oesophageal reflux in patients with severe asthma. , 2019, Respiratory medicine.

[9]  Takeshi Johkoh,et al.  Diagnosis of Idiopathic Pulmonary Fibrosis. An Official ATS/ERS/JRS/ALAT Clinical Practice Guideline , 2018, American journal of respiratory and critical care medicine.

[10]  Anthony Rowe,et al.  A computational framework for complex disease stratification from multiple large-scale datasets , 2018, BMC Systems Biology.

[11]  C. Auffray,et al.  A Severe Asthma Disease Signature from Gene Expression Profiling of Peripheral Blood from U‐BIOPRED Cohorts , 2017, American journal of respiratory and critical care medicine.

[12]  C. Greenwood,et al.  Novel insights into systemic autoimmune rheumatic diseases using shared molecular signatures and an integrative analysis , 2017, Epigenetics.

[13]  Xinxin Zhang,et al.  Characterization of gene expression profiles in HBV-related liver fibrosis patients and identification of ITGBL1 as a key regulator of fibrogenesis , 2017, Scientific Reports.

[14]  Anthony Atala,et al.  Common pathway signature in lung and liver fibrosis , 2016, Cell cycle.

[15]  G. Song,et al.  Direct Reprogramming of Hepatic Myofibroblasts into Hepatocytes In Vivo Attenuates Liver Fibrosis. , 2016, Cell stem cell.

[16]  N. Burke,et al.  Epidemiology and survival of idiopathic pulmonary fibrosis from national data in Canada , 2016, European Respiratory Journal.

[17]  P. Boor,et al.  The role of PDGF-D in healthy and fibrotic kidneys. , 2016, Kidney international.

[18]  Xiaobing Fu,et al.  Epithelial-mesenchymal transition: An emerging target in tissue fibrosis , 2016, Experimental biology and medicine.

[19]  R. Hubbard,et al.  Global incidence and mortality of idiopathic pulmonary fibrosis: a systematic review , 2015, European Respiratory Journal.

[20]  Ash A. Alizadeh,et al.  Robust enumeration of cell subsets from tissue expression profiles , 2015, Nature Methods.

[21]  I. Noth,et al.  The mitochondrial cardiolipin remodeling enzyme lysocardiolipin acyltransferase is a novel target in pulmonary fibrosis. , 2014, American journal of respiratory and critical care medicine.

[22]  T. Bein,et al.  Medical nanoparticles for next generation drug delivery to the lungs , 2014, European Respiratory Journal.

[23]  A. Suzuki,et al.  Gender and menopause impact severity of fibrosis among patients with nonalcoholic steatohepatitis , 2014, Hepatology.

[24]  George C Tseng,et al.  Peripheral Blood Mononuclear Cell Gene Expression Profiles Predict Poor Outcome in Idiopathic Pulmonary Fibrosis , 2013, Science Translational Medicine.

[25]  Andreas Heider,et al.  virtualArray: a R/bioconductor package to merge raw data from different microarray platforms , 2013, BMC Bioinformatics.

[26]  Joyce S Lee,et al.  A Multidimensional Index and Staging System for Idiopathic Pulmonary Fibrosis , 2012, Annals of Internal Medicine.

[27]  I. Noth,et al.  Profiling Of Gene Expression In Idiopathic Pulmonary Fibrosis , 2012, Asian Test Symposium.

[28]  S. Sahn,et al.  Pirfenidone in patients with idiopathic pulmonary fibrosis (CAPACITY): two randomised trials , 2011, The Lancet.

[29]  H. Collard,et al.  Clinical course and prediction of survival in idiopathic pulmonary fibrosis. , 2011, American journal of respiratory and critical care medicine.

[30]  Kun Wu,et al.  Hepatocyte nuclear factor 4 alpha suppresses the development of hepatocellular carcinoma. , 2010, Cancer research.

[31]  Bei-fang Ning,et al.  Hepatocyte nuclear factor 4α attenuates hepatic fibrosis in rats , 2009, Gut.

[32]  K. Korach,et al.  Male sex hormones exacerbate lung function impairment after bleomycin-induced pulmonary fibrosis. , 2008, American journal of respiratory cell and molecular biology.

[33]  A. Gressner,et al.  Pro-fibrogenic potential of PDGF-D in liver fibrosis. , 2007, Journal of hepatology.

[34]  F. Martinez,et al.  The Clinical Course of Patients with Idiopathic Pulmonary Fibrosis , 2005, Annals of Internal Medicine.

[35]  A. Pardo,et al.  Idiopathic Pulmonary Fibrosis: Prevailing and Evolving Hypotheses about Its Pathogenesis and Implications for Therapy , 2001, Annals of Internal Medicine.

[36]  N. Kohno,et al.  Circulating KL-6 predicts the outcome of rapidly progressive idiopathic pulmonary fibrosis. , 1998, American journal of respiratory and critical care medicine.

[37]  Efficacy and Safety of Nintedanib in Idiopathic Pulmonary Fibrosis. , 2015, The New England journal of medicine.

[38]  A. Azuma,et al.  A Phase 3 Trial of Pirfenidone in Patients with Idiopathic Pulmonary Fibrosis , 2015 .

[39]  S. Rosselot Idiopathic pulmonary fibrosis. , 2014, Nursing standard (Royal College of Nursing (Great Britain) : 1987).

[40]  D. Schroeder,et al.  Incidence, prevalence, and clinical course of idiopathic pulmonary fibrosis: a population-based study. , 2010, Chest.