Gene Prioritization for Imaging Genetics Studies Using Gene Ontology and a Stratified False Discovery Rate Approach

Imaging genetics is an emerging field in which the association between genes and neuroimaging-based quantitative phenotypes are used to explore the functional role of genes in neuroanatomy and neurophysiology in the context of healthy function and neuropsychiatric disorders. The main obstacle for researchers in the field is the high dimensionality of the data in both the imaging phenotypes and the genetic variants commonly typed. In this article, we develop a novel method that utilizes Gene Ontology, an online database, to select and prioritize certain genes, employing a stratified false discovery rate (sFDR) approach to investigate their associations with imaging phenotypes. sFDR has the potential to increase power in genome wide association studies (GWAS), and is quickly gaining traction as a method for multiple testing correction. Our novel approach addresses both the pressing need in genetic research to move beyond candidate gene studies, while not being overburdened with a loss of power due to multiple testing. As an example of our methodology, we perform a GWAS of hippocampal volume using both the Enhancing NeuroImaging Genetics through Meta-Analysis (ENIGMA2) and the Alzheimer's Disease Neuroimaging Initiative datasets. The analysis of ENIGMA2 data yielded a set of SNPs with sFDR values between 10 and 20%. Our approach demonstrates a potential method to prioritize genes based on biological systems impaired in a disease.

[1]  F. Dudbridge,et al.  Estimation of significance thresholds for genomewide association scans , 2008, Genetic epidemiology.

[2]  Thomas E. Nichols,et al.  Common genetic variants influence human subcortical brain structures , 2015, Nature.

[3]  Manuel A. R. Ferreira,et al.  Gene ontology analysis of GWA study data sets provides insights into the biology of bipolar disorder. , 2009, American journal of human genetics.

[4]  Jing Chen,et al.  Improved human disease candidate gene prioritization using mouse phenotype , 2007, BMC Bioinformatics.

[5]  J. Hirschhorn,et al.  A comprehensive review of genetic association studies , 2002, Genetics in Medicine.

[6]  Michael Gill,et al.  Gene-ontology enrichment analysis in two independent family-based samples highlights biologically plausible processes for autism spectrum disorders , 2011, European Journal of Human Genetics.

[7]  M. Owen,et al.  Distribution and Expression of Picalm in Alzheimer Disease , 2010, Journal of neuropathology and experimental neurology.

[8]  Naoyuki Katayama,et al.  Gene expression profiling of peripheral T-cell lymphoma including gammadelta T-cell lymphoma. , 2009, Blood.

[9]  Gary D Bader,et al.  A travel guide to Cytoscape plugins , 2012, Nature Methods.

[10]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[11]  Arthur W Toga,et al.  Recent Advances in Imaging Alzheimer's Disease 2 3 , 2022 .

[12]  Robert W. Williams,et al.  Neuroinformatic analyses of common and distinct genetic components associated with major neuropsychiatric disorders , 2014, Front. Neurosci..

[13]  Dajiang Wang,et al.  Hypothesis on the Relationship Between the Change in Intracellular pH and Incidence of Sporadic Alzheimer's Disease or Vascular Dementia , 2010, The International journal of neuroscience.

[14]  Nick C Fox,et al.  Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer's disease , 2013, Nature Genetics.

[15]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[16]  Seth Love,et al.  Genetic Evidence Implicates the Immune System and Cholesterol Metabolism in the Aetiology of Alzheimer's Disease , 2010, PloS one.

[17]  A Hofman,et al.  Estimation of the genetic contribution of presenilin-1 and -2 mutations in a population-based study of presenile Alzheimer disease. , 1998, Human molecular genetics.

[18]  Colm O'Dushlaine,et al.  INRICH: interval-based enrichment analysis for genome-wide association studies , 2012, Bioinform..

[19]  Marisa O. Hollinshead,et al.  Identification of common variants associated with human hippocampal and intracranial volumes , 2012, Nature Genetics.

[20]  Michael R Knowles,et al.  Multiple apical plasma membrane constituents are associated with susceptibility to meconium ileus in individuals with cystic fibrosis , 2012, Nature Genetics.

[21]  Kimberly Van Auken,et al.  A method for increasing expressivity of Gene Ontology annotations using a compositional approach , 2014, BMC Bioinformatics.

[22]  Jae-Young Koh,et al.  Contribution by synaptic zinc to the gender-disparate plaque formation in human Swedish mutant APP transgenic mice , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[23]  C. Lendon,et al.  A common biological mechanism in cancer and Alzheimer's disease? , 2009, Current Alzheimer research.

[24]  L. Hersh,et al.  Substrate Activation of Insulin-degrading Enzyme (Insulysin) , 2003, Journal of Biological Chemistry.

[25]  M. Pericak-Vance,et al.  Segregation of a missense mutation in the amyloid precursor protein gene with familial Alzheimer's disease , 1991, Nature.

[26]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[27]  E. Snitkin,et al.  Genome-wide prioritization of disease genes and identification of disease-disease associations from an integrated human functional linkage network , 2009, Genome Biology.

[28]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[29]  Junzhou Huang,et al.  IMAGING GENOMICS. , 2018, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[30]  R. Eeles,et al.  Genome-wide association studies in cancer. , 2008, Human molecular genetics.

[31]  Paul M. Thompson,et al.  Scalar connectivity measures from fast-marching tractography reveal heritability of white matter architecture , 2010, 2010 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[32]  Y. Benjamini,et al.  Quantitative Trait Loci Analysis Using the False Discovery Rate , 2005, Genetics.

[33]  Jennifer Williamson,et al.  Rare Variants in APP, PSEN1 and PSEN2 Increase Risk for AD in Late-Onset Alzheimer's Disease Families , 2012, PloS one.

[34]  Thomas E. Nichols,et al.  The ENIGMA Consortium: large-scale collaborative analyses of neuroimaging and genetic data , 2014, Brain Imaging and Behavior.

[35]  J. Jia,et al.  Hypoxia and reoxygenation increased BACE1 mRNA and protein levels in human neuroblastoma SH-SY5Y cells , 2006, Neuroscience Letters.

[36]  Eurie L. Hong,et al.  Annotation of functional variation in personal genomes using RegulomeDB , 2012, Genome research.

[37]  Hans Lehrach,et al.  The role of clusterin, complement receptor 1, and phosphatidylinositol binding clathrin assembly protein in Alzheimer disease risk and cerebrospinal fluid biomarker levels. , 2011, Archives of general psychiatry.

[38]  R. Myers,et al.  Candidate-gene approaches for studying complex genetic traits: practical considerations , 2002, Nature Reviews Genetics.

[39]  Ching-Hsing Yu,et al.  SciNet: Lessons Learned from Building a Power-efficient Top-20 System and Data Centre , 2010 .

[40]  Jun Fan,et al.  Low Micromolar Zinc Accelerates the Fibrillization of Human Tau via Bridging of Cys-291 and Cys-322* , 2009, The Journal of Biological Chemistry.

[41]  Radu V. Craiu,et al.  Stratified false discovery control for large‐scale hypothesis testing with application to genome‐wide association studies , 2006, Genetic epidemiology.