Genome-wide Association Study of Dimensional Psychopathology Using Electronic Health Records

BACKGROUND Genetic studies of neuropsychiatric disease strongly suggest an overlap in liability. There are growing efforts to characterize these diseases dimensionally rather than categorically, but the extent to which such dimensional models correspond to biology is unknown. METHODS We applied a newly developed natural language processing method to extract five symptom dimensions based on the National Institute of Mental Health Research Domain Criteria definitions from narrative hospital discharge notes in a large biobank. We conducted a genome-wide association study to examine whether common variants were associated with each of these dimensions as quantitative traits. RESULTS Among 4687 individuals, loci in three of five domains exceeded a genome-wide threshold for statistical significance. These included a locus spanning the neocortical development genes RFPL3 and RFPL3S for arousal (p = 2.29 × 10-8) and one spanning the FPR3 gene for cognition (p = 3.22 × 10-8). CONCLUSIONS Natural language processing identifies dimensional phenotypes that may facilitate the discovery of common genetic variation that is relevant to psychopathology.

[1]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[2]  Griffin M. Weber,et al.  Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2) , 2010, J. Am. Medical Informatics Assoc..

[3]  I. Kohane,et al.  Instrumenting the health care enterprise for discovery research in the genomic era. , 2009, Genome research.

[4]  M. Fava,et al.  Rare Copy Number Variation in Treatment-Resistant Major Depressive Disorder , 2014, Biological Psychiatry.

[5]  J. Cohn,et al.  Comparison of the effects of bupropion and amitriptyline on cardiac conduction in depressed patients. , 1983, The Journal of clinical psychiatry.

[6]  Tianxi Cai,et al.  High Throughput Phenotyping for Dimensional Psychopathology in Electronic Health Records , 2018, Biological Psychiatry.

[7]  Jianxin Shi,et al.  Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs , 2013, Nature Genetics.

[8]  P. Brambilla,et al.  Electrophysiological changes of cardiac function during antidepressant treatment , 2009, Therapeutic advances in cardiovascular disease.

[9]  J. Smoller,et al.  Contributions of the social environment to first-onset and recurrent mania , 2014, Molecular Psychiatry.

[10]  Itsik Pe'er,et al.  Cryptic Distant Relatives Are Common in Both Isolated and Cosmopolitan Genetic Samples , 2012, PloS one.

[11]  Ji Ming Wang,et al.  Role of formyl peptide receptor-like 1 (FPRL1/FPR2) in mononuclear phagocyte responses in alzheimer disease , 2005, Immunologic research.

[12]  O. Delaneau,et al.  A linear complexity phasing method for thousands of genomes , 2011, Nature Methods.

[13]  M. Daly,et al.  LD Score regression distinguishes confounding from polygenicity in genome-wide association studies , 2014, Nature Genetics.

[14]  Isaac S. Kohane,et al.  Architecture of the Open-source Clinical Research Chart from Informatics for Integrating Biology and the Bedside , 2007, AMIA.

[15]  Gonçalo R. Abecasis,et al.  Minimac2: Faster Genotype Imputation , 2015, Bioinform..

[16]  I. Kohane,et al.  A clinical perspective on the relevance of research domain criteria in electronic health records. , 2015, The American journal of psychiatry.

[17]  Alisha R Pollastri,et al.  Validation of electronic health record phenotyping of bipolar disorder cases and controls. , 2015, The American journal of psychiatry.

[18]  A. Dolphin,et al.  alpha2delta expression sets presynaptic calcium channel abundance and release probability , 2012, Nature.

[19]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[20]  T. Insel,et al.  Wesleyan University From the SelectedWorks of Charles A . Sanislow , Ph . D . 2010 Research Domain Criteria ( RDoC ) : Toward a New Classification Framework for Research on Mental Disorders , 2018 .

[21]  Sheng Ding,et al.  Scalable Production of iPSC-Derived Human Neurons to Identify Tau-Lowering Compounds by High-Content Screening , 2017, Stem cell reports.

[22]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[23]  M. Daly,et al.  An Atlas of Genetic Correlations across Human Diseases and Traits , 2015, Nature Genetics.

[24]  Nich Wattanasin,et al.  The Biobank Portal for Partners Personalized Medicine: A Query Tool for Working with Consented Biobank Samples, Genotypes, and Phenotypes Using i2b2 , 2016, Journal of personalized medicine.

[25]  M. Kozak,et al.  Developing constructs for psychopathology research: research domain criteria. , 2010, Journal of abnormal psychology.

[26]  Andreas Hess,et al.  A Genome-wide Drosophila Screen for Heat Nociception Identifies α2δ3 as an Evolutionarily Conserved Pain Gene , 2010, Cell.

[27]  Michael Boehnke,et al.  LocusZoom: regional visualization of genome-wide association scan results , 2010, Bioinform..

[28]  P. Khaitovich,et al.  Evolutionary forces shape the human RFPL1,2,3 genes toward a role in neocortex development. , 2008, American journal of human genetics.