Analysis of defective pathways and drug repositioning in Multiple Sclerosis via machine learning approaches

BACKGROUND Although some studies show that there could be a genetic predisposition to develop Multiple Sclerosis (MS), attempts to find genetic signatures related to MS diagnosis and development are extremely rare. METHOD We carried out a retrospective analysis of two different microarray datasets, using machine learning techniques to understand the defective pathways involved in this disease. We have modeled two data sets that are publicly accessible. The first was used to establish the list of most discriminatory genes; whereas, the second one was utilized for validation purposes. RESULTS The analysis provided a list of high discriminatory genes with predictive cross-validation accuracy higher than 95%, both in learning and in blind validation. The results were confirmed via the holdout sampler. The most discriminatory genes were related to the production of Hemoglobin. The biological processes involved were related to T-cell Receptor Signaling and co-stimulation, Interferon-Gamma Signaling and Antigen Processing and Presentation. Drug repositioning via CMAP methodologies highlighted the importance of Trichostatin A and other HDAC inhibitors. CONCLUSIONS The defective pathways suggest viral or bacterial infections as plausible mechanisms involved in MS development. The pathway analysis also confirmed coincidences with Epstein-Barr virus, Influenza A, Toxoplasmosis, Tuberculosis and Staphylococcus Aureus infections. Th17 Cell differentiation, and CD28 co-stimulation seemed to be crucial in the development of this disease. Furthermore, the additional knowledge provided by this analysis helps to identify new therapeutic targets.

[1]  K. Pearson VII. Note on regression and inheritance in the case of two parents , 1895, Proceedings of the Royal Society of London.

[2]  S. Burrows,et al.  Epstein–Barr virus and multiple sclerosis: potential opportunities for immunotherapy , 2014, Clinical & translational immunology.

[3]  Christopher Chang Managing Autoimmune Disorders through Personalized Epigenetic Approaches , 2015 .

[4]  R. Hintzen,et al.  Genetics of multiple sclerosis. , 2011, Biochimica et biophysica acta.

[5]  Enrique J. deAndrés-Galiana,et al.  mGluR5 mediates post-radiotherapy fatigue development in cancer patients , 2018, Translational Psychiatry.

[6]  Intawat Nookaew,et al.  MicroRNA regulate immune pathways in T-cells in multiple sclerosis (MS) , 2013, BMC Immunology.

[7]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[8]  Enrique J. deAndrés-Galiana,et al.  Genomic data integration in chronic lymphocytic leukemia , 2017, The journal of gene medicine.

[9]  S. Möller,et al.  Intergenomic consensus in multifactorial inheritance loci: the case of multiple sclerosis , 2004, Genes and Immunity.

[10]  J. Maranchie,et al.  Hsa-miRNA-765 as a Key Mediator for Inhibiting Growth, Migration and Invasion in Fulvestrant-Treated Prostate Cancer , 2014, PloS one.

[11]  Y. Guan,et al.  Microarray gene expression profiling analysis combined with bioinformatics in multiple sclerosis , 2013, Molecular Biology Reports.

[12]  Juan Luis Fernández-Martínez,et al.  From Bayes to Tarantola: New insights to understand uncertainty in inverse problems☆ , 2013 .

[13]  C. Tench,et al.  Tobacco smoking and disability progression in multiple sclerosis: United Kingdom cohort study , 2013, Brain : a journal of neurology.

[14]  Ramnik J. Xavier,et al.  Gene enrichment profiles reveal T-cell development, differentiation, and lineage-specific transcription factors including ZBTB25 as a novel NF-AT repressor. , 2010, Blood.

[15]  S. Hauser,et al.  The Neurobiology of Multiple Sclerosis: Genes, Inflammation, and Neurodegeneration , 2006, Neuron.

[16]  R. Lucas,et al.  Environmental Risk Factors for Multiple Sclerosis: A Review with a Focus on Molecular Mechanisms , 2012, International journal of molecular sciences.

[17]  Angela N. Brooks,et al.  A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles , 2017, Cell.

[18]  Breysse Denys,et al.  The uncertainty analysis in linear and nonlinear regression revisited: application to concrete strength estimation , 2018, Inverse Problems in Science and Engineering.

[19]  María M. Abad-Grau,et al.  Multiple Sclerosis Risk Variant HLA-DRB1*1501 Associates with High Expression of DRB1 Gene in Different Human Populations , 2012, PloS one.

[20]  Xuan Wang,et al.  Breaking the paradigm: Dr Insight empowers signature-free, enhanced drug repurposing , 2019, Bioinform..

[21]  Enrique J. deAndrés-Galiana,et al.  A predictive algorithm to identify genes that discriminate individuals with fibromyalgia syndrome diagnosis from healthy controls , 2018, Journal of pain research.

[22]  H. Hanwell,et al.  Assessment of evidence for a protective role of vitamin D in multiple sclerosis. , 2011, Biochimica et biophysica acta.

[23]  François Berger,et al.  New clues about vitamin D functions in the nervous system , 2002, Trends in Endocrinology & Metabolism.

[24]  Yudong Zhang,et al.  Multiple sclerosis identification by convolutional neural network with dropout and parametric ReLU , 2018, J. Comput. Sci..

[25]  H. DeLuca,et al.  Vitamin D and Multiple Sclerosis , 1997, Proceedings of the Society for Experimental Biology and Medicine. Society for Experimental Biology and Medicine.

[26]  C. Polman,et al.  Higher levels of 25-hydroxyvitamin D are associated with a lower incidence of multiple sclerosis only in women , 2009, Multiple sclerosis.

[27]  N. Risch,et al.  Twin concordance and sibling recurrence rates in multiple sclerosis , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[28]  G. Rosati,et al.  The prevalence of multiple sclerosis in the world: an update , 2001, Neurological Sciences.

[29]  R C Coombes,et al.  Trichostatin A is a histone deacetylase inhibitor with potent antitumor activity against breast cancer in vivo. , 2001, Clinical cancer research : an official journal of the American Association for Cancer Research.

[30]  H. Weiner Multiple sclerosis is an inflammatory T-cell-mediated autoimmune disease. , 2004, Archives of neurology.

[31]  Stephen L. Hauser,et al.  Abrogation of T cell quiescence characterizes patients at high risk for multiple sclerosis after the initial neurological event , 2008, Proceedings of the National Academy of Sciences.

[32]  S. Guloksuz,et al.  Hemoglobins as new players in multiple sclerosis: metabolic and immune aspects , 2016, Metabolic Brain Disease.

[33]  C. Brodley,et al.  Exploration of machine learning techniques in predicting multiple sclerosis disease course , 2017, PloS one.

[34]  José Ramón Quevedo,et al.  Graphical Feature Selection for Multilabel Classification Tasks , 2011, IDA.

[35]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Silke Schmidt,et al.  Interleukin 7 receptor α chain ( IL7R ) shows allelic and functional association with multiple sclerosis , 2007, Nature Genetics.

[37]  H. Koeffler,et al.  Longitudinal inhibition of PI3K/Akt/mTOR signaling by LY294002 and rapamycin induces growth arrest of adult T-cell leukemia cells. , 2007, Leukemia research.

[38]  C. Ross,et al.  Nuclear Receptor NR1H3 in Familial Multiple Sclerosis , 2016, Neuron.

[39]  B. Trapp,et al.  Multiple sclerosis: an immune or neurodegenerative disorder? , 2008, Annual review of neuroscience.

[40]  H. Harbo,et al.  Variation in interleukin 7 receptor α chain (IL7R) influences risk of multiple sclerosis , 2007, Nature Genetics.

[41]  Guohui Li,et al.  Long Non-coding RNAs: Regulators of Viral Infection and the Interferon Antiviral Response , 2018, Front. Microbiol..

[42]  Enrique J. deAndrés-Galiana,et al.  Sampling Defective Pathways in Phenotype Prediction Problems via the Holdout Sampler , 2018, IWBBIO.

[43]  Enrique J. deAndrés-Galiana,et al.  Design of Biomedical Robots for Phenotype Prediction Problems , 2016, J. Comput. Biol..

[44]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[45]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[46]  Manuel A. Friese,et al.  Immunopathology of multiple sclerosis , 2015, Nature Reviews Immunology.

[47]  Enrique J. deAndrés-Galiana,et al.  Supervised Classification by Filter Methods and Recursive Feature Elimination Predicts Risk of Radiotherapy-Related Fatigue in Patients with Prostate Cancer , 2014, Cancer informatics.

[48]  M. Alter,et al.  Multiple sclerosis and nutrition. , 1974, Archives of neurology.

[49]  Tsviya Olender,et al.  GeneDecks: paralog hunting and gene-set distillation with GeneCards annotation. , 2009, Omics : a journal of integrative biology.

[50]  A. Rao,et al.  Jarid2 is induced by TCR signalling and controls iNKT cell maturation , 2014, Nature Communications.