A Machine Learning Approach to Parkinson’s Disease Blood Transcriptomics

The increased incidence and the significant health burden associated with Parkinson’s disease (PD) have stimulated substantial research efforts towards the identification of effective treatments and diagnostic procedures. Despite technological advancements, a cure is still not available and PD is often diagnosed a long time after onset when irreversible damage has already occurred. Blood transcriptomics represents a potentially disruptive technology for the early diagnosis of PD. We used transcriptome data from the PPMI study, a large cohort study with early PD subjects and age matched controls (HC), to perform the classification of PD vs. HC in around 550 samples. Using a nested feature selection procedure based on Random Forests and XGBoost we reached an AUC of 72% and found 493 candidate genes. We further discussed the importance of the selected genes through a functional analysis based on GOs and KEGG pathways.

[1]  S. Papageorgiou,et al.  Environmental Impact on the Epigenetic Mechanisms Underlying Parkinson’s Disease Pathogenesis: A Narrative Review , 2022, Brain sciences.

[2]  G. Pesole,et al.  A primer on machine learning techniques for genomic applications , 2021, Computational and structural biotechnology journal.

[3]  T. Sobrino,et al.  Ceramide Metabolism and Parkinson’s Disease—Therapeutic Targets , 2021, Biomolecules.

[4]  A. Adarmes-Gómez,et al.  Peripheral Immune Profile and Neutrophil‐to‐Lymphocyte Ratio in Parkinson's Disease , 2021, Movement disorders : official journal of the Movement Disorder Society.

[5]  Sayed Hadi Hashemi,et al.  Multi-modality machine learning predicting Parkinson’s disease , 2021, npj Parkinson's Disease.

[6]  A. Toga,et al.  Quality Control Metrics for Whole Blood Transcriptome Analysis in the Parkinson's Progression Markers Initiative (PPMI) , 2021, medRxiv.

[7]  Derek P. Narendra,et al.  Mitochondrial Dysfunction and Mitophagy in Parkinson's Disease: From Mechanism to Therapy. , 2020, Trends in biochemical sciences.

[8]  O. Rötzschke,et al.  New Insights into Immune-Mediated Mechanisms in Parkinson’s Disease , 2020, International journal of molecular sciences.

[9]  Ester Pantaleo,et al.  Identifying potential gene biomarkers for Parkinson’s disease through an information entropy based approach , 2020, Physical biology.

[10]  I. Huijbers,et al.  Dysregulation of metabolic pathways by carnitine palmitoyl-transferase 1 plays a key role in central nervous system disorders: experimental evidence based on animal models , 2020, Scientific Reports.

[11]  Fei Wang,et al.  Mining genetic and transcriptomic data using machine learning approaches in Parkinson’s disease , 2020, npj Parkinson's Disease.

[12]  A. Sehirli,et al.  The Role of the SLC Transporters Protein in the Neurodegenerative Disorders , 2020, Clinical psychopharmacology and neuroscience : the official scientific journal of the Korean College of Neuropsychopharmacology.

[13]  P. Lewis,et al.  Vesicular Dysfunction and the Pathogenesis of Parkinson’s Disease: Clues From Genetic Studies , 2020, Frontiers in Neuroscience.

[14]  I. Jonassen,et al.  Common gene expression signatures in Parkinson’s disease are driven by changes in cell composition , 2019, bioRxiv.

[15]  Feng Jiang,et al.  Identification of potential diagnostic biomarkers for Parkinson's disease , 2019, FEBS open bio.

[16]  N. Erekat Apoptosis and its Role in Parkinson’s Disease , 2018, Parkinson’s Disease: Pathogenesis and Clinical Aspects.

[17]  Haniye Sadat Sajadi,et al.  Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017 , 2018, Lancet.

[18]  Norbert Schuff,et al.  The Parkinson's progression markers initiative (PPMI) – establishing a PD biomarker cohort , 2018, Annals of clinical and translational neurology.

[19]  Sabina Sonia Tangaro,et al.  Complex networks reveal early MRI markers of Parkinson's disease , 2018, Medical Image Anal..

[20]  Yong Cheng,et al.  Oxidative Stress in Parkinson's Disease: A Systematic Review and Meta-Analysis , 2018, Front. Mol. Neurosci..

[21]  B. Pakkenberg,et al.  Inflammatory bowel disease increases the risk of Parkinson’s disease: a Danish nationwide cohort study 1977–2014 , 2018, Gut.

[22]  Claire Duvallet,et al.  Correcting for batch effects in case-control microbiome studies , 2018, bioRxiv.

[23]  G. Deuschl,et al.  Analysis of blood-based gene expression in idiopathic Parkinson disease , 2017, Neurology.

[24]  M. Nalls,et al.  A meta-analysis of genome-wide association studies identifies 17 new Parkinson's disease risk loci , 2017, Nature Genetics.

[25]  K. Chaudhuri,et al.  Non-motor features of Parkinson disease , 2017, Nature Reviews Neuroscience.

[26]  S. Mallal,et al.  T cells of Parkinson’s disease patients recognize α–synuclein peptides , 2017, Nature.

[27]  S. Orozco-Suárez,et al.  Transient glutathione depletion in the substantia nigra compacta is associated with neuroinflammation in rats , 2016, Neuroscience.

[28]  Andrew D. Rouillard,et al.  Enrichr: a comprehensive gene set enrichment analysis web server 2016 update , 2016, Nucleic Acids Res..

[29]  Xuejing Yue,et al.  Risk of Parkinson Disease in Diabetes Mellitus: An Updated Meta-Analysis of Population-Based Cohort Studies , 2016, Medicine.

[30]  Nicola Pavese,et al.  Age at onset and Parkinson disease phenotype , 2016, Neurology.

[31]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[32]  D. di Bernardo,et al.  Blood transcriptomics of drug-naïve sporadic Parkinson’s disease patients , 2015, BMC Genomics.

[33]  Margaret Sutherland,et al.  Diagnosis of Parkinson's disease on the basis of clinical and genetic classification: a population-based modelling study , 2015, The Lancet Neurology.

[34]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[35]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[36]  Hagai Bergman,et al.  Long Non-Coding RNA and Alternative Splicing Modulations in Parkinson's Leukocytes Identified by RNA Sequencing , 2014, PLoS Comput. Biol..

[37]  A. Papavassiliou,et al.  Oxidative Stress-Induced Signaling Pathways Implicated in the Pathogenesis of Parkinson’s Disease , 2014, NeuroMolecular Medicine.

[38]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[39]  David S. Greenberg,et al.  Small RNA sequencing-microarray analyses in Parkinson leukocytes reveal deep brain stimulation-induced splicing changes that classify brain region transcriptomes , 2013, Front. Mol. Neurosci..

[40]  Sundaram Suresh,et al.  Parkinson's disease prediction using gene expression - A projection based learning meta-cognitive neural classifier approach , 2013, Expert Syst. Appl..

[41]  Jun Chen,et al.  The integral nuclear membrane protein nurim plays a role in the suppression of apoptosis. , 2012, Current molecular medicine.

[42]  S. Mandel,et al.  A molecular signature in blood identifies early Parkinson’s disease , 2012, Molecular Neurodegeneration.

[43]  G. Paratcha,et al.  The GTPase-activating protein Rap1GAP: A new player to modulate Ret signaling , 2011, Cell Research.

[44]  P. Riederer,et al.  Pilot study: peripheral biomarkers for diagnosing sporadic Parkinson’s disease , 2010, Journal of Neural Transmission.

[45]  J. Vance,et al.  SRRM2, a Potential Blood Biomarker Revealing High Alternative Splicing in Parkinson's Disease , 2010, PloS one.

[46]  J. Growdon,et al.  Molecular markers of early Parkinson's disease based on gene expression in blood , 2007, Proceedings of the National Academy of Sciences.

[47]  Charles M Perou,et al.  Evaluating the comparability of gene expression in blood and brain , 2006, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[48]  S. Mandel,et al.  Gene expression profiling of parkinsonian substantia nigra pars compacta; alterations in ubiquitin-proteasome, heat shock protein, iron and oxidative stress regulated proteins, cell adhesion/cellular matrix and vesicle trafficking genes , 2004, Journal of Neural Transmission.

[49]  Makoto Sawada,et al.  Distribution of major histocompatibility complex class II-positive microglia and cytokine profile of Parkinson's disease brains , 2003, Acta Neuropathologica.

[50]  L. Breiman Random Forests , 2001, Machine Learning.

[51]  G. Lenzi,et al.  Increased expression of dopamine receptors on lymphocytes in Parkinson's disease , 1999, Movement disorders : official journal of the Movement Disorder Society.

[52]  Robert Hill,et al.  Low platelet mitochondrial complex I and complex II/III activity in early untreated parkinson's disease , 1995, Annals of neurology.

[53]  A. Chen-Plotkin,et al.  Parkinson disease: Blood transcriptomics for Parkinson disease? , 2018, Nature Reviews Neurology.

[54]  M. Toft,et al.  Found in transcription: accurate Parkinson's disease classification in peripheral blood. , 2013, Journal of Parkinson's disease.

[55]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[56]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[57]  Ashley Harms Inflammation in Parkinson's disease. , 2001, Advances in neurology.