On the interpretation of transcriptome-wide association studies

Transcriptome-wide association studies (TWAS) aim to detect relationships between gene expression and a phenotype, and are commonly used for secondary analysis of genome-wide association study (GWAS) results. Results from TWAS analyses are often interpreted as indicating a geneticrelationship between gene expression and a phenotype, but this interpretation is not consistent with the null hypothesis that is evaluated in the traditional TWAS framework. In this study we provide a mathematical outline of this TWAS framework, and elucidate what interpretations are warrantedgiven the null hypothesis it actually tests. We then use both simulations and real data analysis to assess the implications of misinterpreting TWAS results as indicative of a genetic relationship between gene expression and the phenotype. Our simulation results show considerably inflated type 1 error rates for TWAS when interpreted this way, with 41% of significant TWAS associations detected in the real data analysis found to have insufficient statistical evidence to infer such a relationship. This demonstrates that in current implementations, TWAS cannot reliably be used to investigate genetic relationships between gene expression and a phenotype, but that local genetic correlation analysis can serve as a potential alternative.

[1]  S. Djurovic,et al.  Shared genetic architecture between schizophrenia and subcortical brain volumes implicates early neurodevelopmental processes and brain development in childhood , 2022, Molecular Psychiatry.

[2]  H. Snieder,et al.  Bioinformatic Prioritization and Functional Annotation of GWAS-Based Candidate Genes for Primary Open-Angle Glaucoma , 2022, Genes.

[3]  Qianqian Zhu,et al.  UACA locus is associated with breast cancer chemoresistance and survival , 2022, NPJ breast cancer.

[4]  Shanyawen Li,et al.  A novel genetic variant potentially altering the expression of MANBA in the cerebellum associated with attention deficit hyperactivity disorder in Han Chinese children , 2021, The world journal of biological psychiatry : the official journal of the World Federation of Societies of Biological Psychiatry.

[5]  K. Hao,et al.  Transcriptome wide association study of coronary artery disease identifies novel susceptibility genes , 2021, bioRxiv.

[6]  P. Sullivan,et al.  Transcriptome-wide association analysis of brain structures yields insights into pleiotropy with complex neuropsychiatric traits , 2021, Nature Communications.

[7]  D. Posthuma,et al.  LAVA: An integrated framework for local genetic correlation analysis , 2021, bioRxiv.

[8]  S. Ripke,et al.  Mapping genomic loci prioritises genes and implicates synaptic biology in schizophrenia , 2020, medRxiv.

[9]  Xiang Zhou,et al.  Transcriptome-wide association studies: a view from Mendelian randomization , 2020, Quantitative Biology.

[10]  D. Bennett,et al.  Novel Variance-Component TWAS method for studying complex human diseases with applications to Alzheimer’s dementia , 2021, PLoS genetics.

[11]  G. Trynka,et al.  From GWAS to Function: Using Functional Genomics to Identify the Mechanisms Underlying Complex Diseases , 2020, Frontiers in Genetics.

[12]  Jianguo Liu,et al.  A fast and powerful eQTL weighted method to detect genes associated with complex trait using GWAS summary data. , 2020, Genetic epidemiology.

[13]  Yun Li,et al.  MOSTWAS: Multi-Omic Strategies for Transcriptome-Wide Association Studies , 2020, bioRxiv.

[14]  Justin M. Luningham,et al.  Bayesian Genome-wide TWAS method to leverage both cis- and trans- eQTL information through summary statistics , 2020, bioRxiv.

[15]  E. Gamazon,et al.  Transcriptome‐wide association analysis offers novel opportunities for clinical translation of genetic discoveries on mental disorders , 2020, World psychiatry : official journal of the World Psychiatric Association.

[16]  Gabriëlle H S Buitendijk,et al.  A transcriptome-wide association study based on 27 tissues identifies 106 genes potentially relevant for disease pathology in age-related macular degeneration , 2020, Scientific Reports.

[17]  W. Pan,et al.  Some statistical consideration in transcriptome‐wide association studies , 2019, Genetic epidemiology.

[18]  Xiang Zhou,et al.  Testing and controlling for horizontal pleiotropy with probabilistic Mendelian randomization in transcriptome-wide association studies , 2019, Nature Communications.

[19]  David A. Knowles,et al.  Opportunities and challenges for transcriptome-wide association studies , 2019, Nature Genetics.

[20]  Hongyu Zhao,et al.  Leveraging functional annotation to identify genes associated with complex diseases , 2019, bioRxiv.

[21]  Jingjing Yang,et al.  TIGAR: An Improved Bayesian Tool for Transcriptomic Data Imputation Enhances Gene Mapping of Complex Traits , 2018, bioRxiv.

[22]  O. Andreassen,et al.  A global overview of pleiotropy and genetic architecture in complex traits , 2019, Nature Genetics.

[23]  Jin Liu,et al.  CoMM: a collaborative mixed model to dissecting genetic contributions to complex traits by leveraging regulatory information , 2018, Bioinform..

[24]  P. Donnelly,et al.  The UK Biobank resource with deep phenotyping and genomic data , 2018, Nature.

[25]  Jing Zhao,et al.  Interpretation of differential gene expression results of RNA-seq data: review and integration , 2018, Briefings Bioinform..

[26]  Zoltán Kutalik,et al.  Mendelian randomization integrating GWAS and eQTL data reveals genetic determinants of complex and clinical traits , 2019, Nature Communications.

[27]  Jonathan P. Beauchamp,et al.  Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals , 2018, Nature Genetics.

[28]  D. Lawlor,et al.  Improving the accuracy of two-sample summary-data Mendelian randomization: moving beyond the NOME assumption , 2018, bioRxiv.

[29]  Xinyuan Dong,et al.  A Mixed-Effects Model for Powerful Association Tests in Integrative Functional Genomics. , 2018, American journal of human genetics.

[30]  Samuel E. Jones,et al.  Meta-analysis of genome-wide association studies for body fat distribution in 694 649 individuals of European ancestry , 2018, bioRxiv.

[31]  Hongyu Zhao,et al.  A statistical framework for cross-tissue transcriptome-wide association analysis , 2018, bioRxiv.

[32]  A. Gusev,et al.  Probabilistic fine-mapping of transcriptome-wide association studies , 2017, bioRxiv.

[33]  Bogdan Pasaniuc,et al.  Local genetic correlation gives insights into the shared genetic architecture of complex traits , 2016, bioRxiv.

[34]  Mary Goldman,et al.  Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics , 2016, Nature Communications.

[35]  Wei Pan,et al.  A Powerful Framework for Integrating eQTL and GWAS Summary Data , 2017, Genetics.

[36]  Alexander Gusev,et al.  Integrating Gene Expression with Summary Association Statistics to Identify Genes Associated with 30 Complex Traits. , 2017, American journal of human genetics.

[37]  Luke R. Lloyd-Jones,et al.  The Genetic Architecture of Gene Expression in Peripheral Blood. , 2017, American journal of human genetics.

[38]  Ayellet V. Segrè,et al.  Colocalization of GWAS and eQTL Signals Detects Target Genes , 2016, bioRxiv.

[39]  P. Visscher,et al.  Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets , 2016, Nature Genetics.

[40]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[41]  T. Lehtimäki,et al.  Integrative approaches for large-scale transcriptome-wide association studies , 2015, Nature Genetics.

[42]  Kaanan P. Shah,et al.  A gene-based association method for mapping traits using reference transcriptome data , 2015, Nature Genetics.

[43]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[44]  C. Wallace,et al.  Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics , 2013, PLoS genetics.

[45]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.