Stratification of candidate genes for Parkinson’s disease using weighted protein-protein interaction network analysis

Background: Genome wide association studies (GWAS) have helped identify large numbers of genetic loci that significantly associate with increased risk of developing diseases. However, translating genetic knowledge into understanding of the molecular mechanisms underpinning disease (i.e. disease-specific impacted biological processes) has to date proved to be a major challenge. This is primarily due to difficulties in confidently defining candidate genes at GWAS-risk loci. The goal of this study was to better characterize candidate genes within GWAS loci using a protein interactome based approach and with Parkinson’s disease (PD) data as a test case. Results: We applied a recently developed Weighted Protein-Protein Interaction Network Analysis (WPPINA) pipeline as a means to define impacted biological processes, risk pathways and therein key functional players. We used previously established Mendelian forms of PD to identify seed proteins, and to construct a protein network for genetic Parkinson’s and carried out functional enrichment analyses. We isolated PD-specific processes indicating ‘mitochondria stressors mediated cell death’, ‘immune response and signaling’, and ‘waste disposal’ mediated through ‘autophagy’. Merging the resulting protein network with data from Parkinson’s GWAS we confirmed 10 candidate genes previously selected by pure proximity and were able to nominate 17 novel candidate genes for sporadic PD. Conclusions: With this study, we were able to better characterize the underlying genetic and functional architecture of idiopathic PD, thus validating WPPINA as a robust pipeline for the in silico genetic and functional dissection of complex disorders.

[1]  John Hardy,et al.  Genome, transcriptome and proteome: the rise of omics data and their integration in biomedical sciences , 2016, Briefings Bioinform..

[2]  Judy H. Cho,et al.  Transcriptional Risk Scores link GWAS to eQTL and Predict Complications in Crohn's Disease , 2017, Nature Genetics.

[3]  P. Visscher,et al.  10 Years of GWAS Discovery: Biology, Function, and Translation. , 2017, American journal of human genetics.

[4]  A. Lusis,et al.  Considerations for the design of omics studies , 2017 .

[5]  Dawn M. Toolan,et al.  TMEM175 deficiency impairs lysosomal and mitochondrial function and increases α-synuclein aggregation , 2017, Proceedings of the National Academy of Sciences.

[6]  J. Hardy,et al.  Weighted Protein Interaction Network Analysis of Frontotemporal Dementia , 2016, Journal of proteome research.

[7]  Kara Dolinski,et al.  The BioGRID interaction database: 2017 update , 2016, Nucleic Acids Res..

[8]  P. Visscher,et al.  Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets , 2016, Nature Genetics.

[9]  Hedi Peterson,et al.  g:Profiler—a web server for functional interpretation of gene lists (2016 update) , 2016, Nucleic Acids Res..

[10]  Andrew B Singleton,et al.  Genetics in Parkinson disease: Mendelian versus non‐Mendelian inheritance , 2016, Journal of neurochemistry.

[11]  E. Chang,et al.  Purification and Characterization of Progenitor and Mature Human Astrocytes Reveals Transcriptional and Functional Differences with Mouse , 2016, Neuron.

[12]  Daniel Marbach,et al.  Fast and Rigorous Computation of Gene and Pathway Scores from SNP-Based Summary Statistics , 2016, PLoS Comput. Biol..

[13]  T. Lehtimäki,et al.  Integrative approaches for large-scale transcriptome-wide association studies , 2015, Nature Genetics.

[14]  R. F. Hashimoto,et al.  NERI: network-medicine based integrative approach for disease gene prioritization by relative importance , 2015, BMC Bioinformatics.

[15]  A. Singleton,et al.  Parkinson’s disease: From human genetics to clinical trials , 2015, Science Translational Medicine.

[16]  Kaanan P. Shah,et al.  A gene-based association method for mapping traits using reference transcriptome data , 2015, Nature Genetics.

[17]  M. Vidal,et al.  Selecting causal genes from genome-wide association studies via functionally coherent subnetworks , 2014, Nature Methods.

[18]  Lili Wang,et al.  PINBPA: Cytoscape app for network analysis of GWAS data , 2015, Bioinform..

[19]  Joris M. Mooij,et al.  MAGMA: Generalized Gene-Set Analysis of GWAS Data , 2015, PLoS Comput. Biol..

[20]  Sebo Withoff,et al.  Genetic variation in the non-coding genome: Involvement of micro-RNAs and long non-coding RNAs in disease. , 2014, Biochimica et biophysica acta.

[21]  Eleazar Eskin,et al.  Identifying Causal Variants at Loci with Multiple Signals of Association , 2014, Genetics.

[22]  Chuong B. Do,et al.  Large-scale meta-analysis of genome-wide association data identifies six new risk loci for Parkinson’s disease , 2014, Nature Genetics.

[23]  Suneil K. Kalia,et al.  Unbiased screen for interactors of leucine-rich repeat kinase 2 supports a common pathway for sporadic and familial Parkinson disease , 2014, Proceedings of the National Academy of Sciences.

[24]  Rafael C. Jimenez,et al.  The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases , 2013, Nucleic Acids Res..

[25]  A. Dunning,et al.  Beyond GWASs: illuminating the dark road from association to function. , 2013, American journal of human genetics.

[26]  L. Furlong Human diseases through the lens of network biology. , 2013, Trends in genetics : TIG.

[27]  Karin Breuer,et al.  InnateDB: systems biology of innate immunity and beyond—recent updates and continuing curation , 2012, Nucleic Acids Res..

[28]  Manolis Kellis,et al.  Interpreting non-coding variation in complex disease genetics , 2012, Nature Biotechnology.

[29]  Johannes Goll,et al.  Protein interaction data curation: the International Molecular Exchange (IMEx) consortium , 2012, Nature Methods.

[30]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[31]  Maria Victoria Schneider,et al.  MINT: a Molecular INTeraction database. , 2002, FEBS letters.

[32]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[33]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.