Retrospective Association Analysis of Longitudinal Binary Traits Identifies Important Loci and Pathways in Cocaine Use

Longitudinal phenotypes have been increasingly available in genome-wide association studies (GWAS) and electronic health record-based studies for identification of genetic variants that influence complex traits over time. For longitudinal binary data, there remain significant challenges in gene mapping, including misspecification of the model for the phenotype distribution due to ascertainment. Here, we propose L-BRAT, a retrospective, generalized estimating equations-based method for genetic association analysis of longitudinal binary outcomes. We also develop RGMMAT, a retrospective, generalized linear mixed model-based association test. Both tests are retrospective score approaches in which genotypes are treated as random conditional on phenotype and covariates. They allow both static and time-varying covariates to be included in the analysis. Through simulations, we illustrated that retrospective association tests are robust to ascertainment and other types of phenotype model misspecification, and gain power over previous association methods. We applied L-BRAT and RGMMAT to a genome-wide association analysis of repeated measures of cocaine use in a longitudinal cohort. Pathway analysis implicated association with opioid signaling and axonal guidance signaling pathways. Lastly, we replicated important pathways in an independent cocaine dependence case-control GWAS. Our results illustrate that L-BRAT is able to detect important loci and pathways in a genome scan and to provide insights into genetic architecture of cocaine use.

[1]  Kesheng Wang,et al.  A genome-wide meta-analysis identifies novel loci associated with schizophrenia and bipolar disorder , 2010, Schizophrenia Research.

[2]  Ke Xu,et al.  Longitudinal SNP‐set association analysis of quantitative phenotypes , 2017, Genetic epidemiology.

[3]  Kuo-mei Chen,et al.  A novel method for analyzing genetic association with longitudinal phenotypes , 2013, Statistical applications in genetics and molecular biology.

[4]  Patrick J Heagerty,et al.  On outcome-dependent sampling designs for longitudinal binary response data with time-varying covariates. , 2008, Biostatistics.

[5]  David Kaplan,et al.  The Sage handbook of quantitative methodology for the social sciences , 2004 .

[6]  Runze Li,et al.  A dynamic model for genome-wide association studies , 2011, Human Genetics.

[7]  A. Bahi,et al.  Cocaine-induced expression changes of axon guidance molecules in the adult rat brain , 2005, Molecular and Cellular Neuroscience.

[8]  Mary Sara McPeek,et al.  MASTOR: mixed-model association mapping of quantitative traits in samples with related individuals. , 2013, American journal of human genetics.

[9]  Jianxin Shi,et al.  Optimal methods for meta‐analysis of genome‐wide association studies , 2011, Genetic epidemiology.

[10]  Albert Hofman,et al.  Fast linear mixed model computations for genome‐wide association studies with longitudinal data , 2013, Statistics in medicine.

[11]  Luigi Ferrucci,et al.  SHAVE: shrinkage estimator measured for multiple visits increases power in GWAS of quantitative traits , 2012, European Journal of Human Genetics.

[12]  Thomas Lumley,et al.  Generalized estimating equations for genome‐wide association studies using longitudinal phenotype data , 2015, Statistics in medicine.

[13]  P. Donnelly,et al.  A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies , 2009, PLoS genetics.

[14]  Hongyu Zhao,et al.  Genomewide association study of cocaine dependence and related traits: FAM53B identified as a risk gene , 2013, Molecular Psychiatry.

[15]  Sheldon Brown,et al.  Veterans Aging Cohort Study (VACS): Overview and Description , 2006, Medical care.

[16]  N. Patterson,et al.  Mixed Model Association with Family-Biased Case-Control Ascertainment , 2016, bioRxiv.

[17]  K. Berridge,et al.  Optogenetic Central Amygdala Stimulation Intensifies and Narrows Motivation for Cocaine , 2017, The Journal of Neuroscience.

[18]  Alisha R Pollastri,et al.  Diagnostic reliability of the Semi-structured Assessment for Drug Dependence and Alcoholism (SSADDA). , 2005, Drug and alcohol dependence.

[19]  E. Unterwald,et al.  Cocaine-induced mu opioid receptor occupancy within the striatum is mediated by dopamine D2 receptors , 2009, Brain Research.

[20]  S. Redline,et al.  Control for Population Structure and Relatedness for Binary Traits in Genetic Association Studies via Logistic Mixed Models. , 2016, American journal of human genetics.

[21]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[22]  S. Hyman,et al.  Addiction and the brain: The neurobiology of compulsion and its persistence , 2001, Nature Reviews Neuroscience.

[23]  Joseph R. Scarpa,et al.  Gene Network Dysregulation in Dorsolateral Prefrontal Cortex Neurons of Humans with Cocaine Use Disorder , 2017, Scientific Reports.

[24]  Jonathan S Schildcrout,et al.  Outcome-related, Auxiliary Variable Sampling Designs for Longitudinal Binary Data , 2018, Epidemiology.

[25]  Hong Yang,et al.  MOL 21998 1 RECEPTOR REGULATION OF AXON GUIDANCE MOLECULE GENE EXPRESSION , 2006 .

[26]  Vladimir Vacic,et al.  Genome‐wide association study of schizophrenia in Ashkenazi Jews , 2015, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[27]  K. Befort,et al.  Reward processing by the opioid system in the brain. , 2009, Physiological reviews.

[28]  Patrick J Heagerty,et al.  Extending the Case–Control Design to Longitudinal Data: Stratified Sampling Based on Repeated Binary Outcomes , 2018, Epidemiology.

[29]  P. Visscher,et al.  Mixed model with correction for case-control ascertainment increases association power. , 2015, American journal of human genetics.

[30]  K. Rickels,et al.  Genome-wide association study of treatment response to venlafaxine XR in generalized anxiety disorder , 2017, Psychiatry Research.

[31]  Mary Sara McPeek,et al.  Retrospective Binary-Trait Association Test Elucidates Genetic Architecture of Crohn Disease. , 2016, American journal of human genetics.

[32]  M. Wong,et al.  The PHF21B gene is associated with major depression and modulates the stress response , 2016, Molecular Psychiatry.

[33]  S. Gabriel,et al.  Calibrating a coalescent simulation of human genome sequence variation. , 2005, Genome research.

[34]  Mary Sara McPeek,et al.  CERAMIC: Case-Control Association Testing in Samples with Related Individuals, Based on Retrospective Mixed Model Analysis with Adjustment for Covariates , 2016, PLoS genetics.

[35]  Eleazar Eskin,et al.  Genome‐Wide Association Mapping With Longitudinal Data , 2012, Genetic epidemiology.

[36]  M. O’Donovan,et al.  Genome-wide association analysis identifies 30 new susceptibility loci for schizophrenia , 2017, Nature Genetics.

[37]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[38]  N. Schork,et al.  Genome-wide association study of paliperidone efficacy , 2016, Pharmacogenetics and genomics.

[39]  M. McPeek,et al.  L-GATOR: Genetic Association Testing for a Longitudinally Measured Quantitative Trait in Samples with Related Individuals. , 2018, American journal of human genetics.

[40]  Lars G Fritsche,et al.  Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies , 2017, Nature Genetics.

[41]  M. McPeek,et al.  Retrospective Association Analysis of Binary Traits: Overcoming Some Limitations of the Additive Polygenic Model , 2016, Human Heredity.

[42]  Nicola J. Rinaldi,et al.  Genetic effects on gene expression across human tissues , 2017, Nature.

[43]  Bengt Muthén,et al.  Latent Variable Analysis: Growth Mixture Modeling and Related Techniques for Longitudinal Data , 2004 .