Integrating omics and MRI data with kernel-based tests and CNNs to identify rare genetic markers for Alzheimer's disease

For precision medicine and personalized treatment, we need to identify predictive markers of disease. We focus on Alzheimer's disease (AD), where magnetic resonance imaging scans provide information about the disease status. By combining imaging with genome sequencing, we aim at identifying rare genetic markers associated with quantitative traits predicted from convolutional neural networks (CNNs), which traditionally have been derived manually by experts. Kernel-based tests are a powerful tool for associating sets of genetic variants, but how to optimally model rare genetic variants is still an open research question. We propose a generalized set of kernels that incorporate prior information from various annotations and multi-omics data. In the analysis of data from the Alzheimer's Disease Neuroimaging Initiative (ADNI), we evaluate whether (i) CNNs yield precise and reliable brain traits, and (ii) the novel kernel-based tests can help to identify loci associated with AD. The results indicate that CNNs provide a fast, scalable and precise tool to derive quantitative AD traits and that new kernels integrating domain knowledge can yield higher power in association tests of very rare variants.

[1]  Mark E. Schmidt,et al.  The Alzheimer's Disease Neuroimaging Initiative: Progress report and future plans , 2010, Alzheimer's & Dementia.

[2]  R. Marioni,et al.  GWAS on family history of Alzheimer’s disease , 2018, bioRxiv.

[3]  Xihong Lin,et al.  Optimal tests for rare variant effects in sequencing association studies. , 2012, Biostatistics.

[4]  C. Schooling,et al.  Re-thinking Alzheimer's disease therapeutic targets using gene-based tests , 2018, EBioMedicine.

[5]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[6]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[7]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[8]  Seunggeun Lee,et al.  Rare variant testing across methods and thresholds using the multi-kernel sequence kernel association test (MK-SKAT). , 2015, Statistics and its interface.

[9]  David Heckerman,et al.  Greater power and computational efficiency for kernel-based association testing of sets of genetic variants , 2014, Bioinform..

[10]  T. Pischon,et al.  Comparison of single-marker and multi-marker tests in rare variant association studies of quantitative traits , 2017, PloS one.

[11]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[12]  A. Dale,et al.  Whole Brain Segmentation Automated Labeling of Neuroanatomical Structures in the Human Brain , 2002, Neuron.

[13]  Hui Zhan,et al.  Prediction of Alzheimer’s Disease-Associated Genes by Integration of GWAS Summary Data and Expression Data , 2018, Front. Genet..

[14]  Eurie L. Hong,et al.  Annotation of functional variation in personal genomes using RegulomeDB , 2012, Genome research.

[15]  A. Hofman,et al.  Prevalence of dementia and major subtypes in Europe: A collaborative study of population-based cohorts. Neurologic Diseases in the Elderly Research Group. , 2000, Neurology.

[16]  K Johansson,et al.  Heritability for Alzheimer's disease: the study of dementia in Swedish twins. , 1997, The journals of gerontology. Series A, Biological sciences and medical sciences.

[17]  Mark E. Schmidt,et al.  The Alzheimer’s Disease Neuroimaging Initiative: A review of papers published since its inception , 2012, Alzheimer's & Dementia.

[18]  Margaret A. Pericak-Vance,et al.  Genome-Wide Association Meta-analysis of Neuropathologic Features of Alzheimer's Disease and Related Dementias , 2014, PLoS genetics.

[19]  Michael W. Weiner,et al.  Association analysis of rare variants near the APOE region with CSF and neuroimaging biomarkers of Alzheimer’s disease , 2017, BMC Medical Genomics.

[20]  David Heckerman,et al.  A powerful and efficient set test for genetic markers that handles confounders , 2012, Bioinform..