Integrating germline and somatic genetics to identify genes associated with lung cancer

Genome‐wide association studies (GWAS) have successfully identified many genetic variants associated with complex traits. However, GWAS experience power issues, resulting in the failure to detect certain associated variants. Additionally, GWAS are often unable to parse the biological mechanisms of driving associations. An existing gene‐based association test framework, Transcriptome‐Wide Association Studies (TWAS), leverages expression quantitative trait loci data to increase the power of association tests and illuminate the biological mechanisms by which genetic variants modulate complex traits. We extend the TWAS methodology to incorporate somatic information from tumors. By integrating germline and somatic data we are able to leverage information from the nuanced somatic landscape of tumors. Thus we can augment the power of TWAS‐type tests to detect germline genetic variants associated with cancer phenotypes. We use somatic and germline data on lung adenocarcinomas from The Cancer Genome Atlas in conjunction with a meta‐analyzed lung cancer GWAS to identify novel genes associated with lung cancer.

[1]  Michael Wainberg,et al.  Opportunities and challenges for transcriptome-wide association studies , 2019, Nature Genetics.

[2]  Alexander Gusev,et al.  A transcriptome-wide association study of high grade serous epithelial ovarian cancer identifies novel susceptibility genes and splice variants , 2019, Nature Genetics.

[3]  Hongyu Zhao,et al.  A statistical framework for cross-tissue transcriptome-wide association analysis , 2018, Nature Genetics.

[4]  R. Grossman,et al.  Cancer expression quantitative trait loci (eQTLs) can be determined from heterogeneous tumor gene expression data by modeling variation in tumor purity , 2018, Genome Biology.

[5]  Julio Saez-Rodriguez,et al.  The germline genetic component of drug sensitivity in cancer cell lines , 2018, Nature Communications.

[6]  Marylyn D. Ritchie,et al.  Large-Scale Analysis of Genetic and Clinical Patient Data , 2018, Annual Review of Biomedical Data Science.

[7]  Yuntao Guo,et al.  Oncogenic TRIM31 confers gemcitabine resistance in pancreatic cancer via activating the NF-κB signaling pathway , 2018, Theranostics.

[8]  Xinyuan Dong,et al.  A Mixed-Effects Model for Powerful Association Tests in Integrative Functional Genomics. , 2018, American journal of human genetics.

[9]  Paul Ellis,et al.  Carboplatin in BRCA1/2-mutated and triple-negative breast cancer BRCAness subgroups: the TNT Trial , 2018, Nature Medicine.

[10]  Li Ding,et al.  Perspective on Oncogenic Processes at the End of the Beginning of Cancer Genomics , 2018, Cell.

[11]  Lingyun Zhang,et al.  Pyridoxine 5′-phosphate oxidase is a novel therapeutic target and regulated by the TGF-β signalling pathway in epithelial ovarian cancer , 2017, Cell Death & Disease.

[12]  Zhiyuan Xu,et al.  Imaging-wide association study: Integrating imaging endophenotypes in GWAS , 2017, NeuroImage.

[13]  Wei Pan,et al.  A Powerful Framework for Integrating eQTL and GWAS Summary Data , 2017, Genetics.

[14]  T. Cai,et al.  Genetic predisposition to lung cancer: comprehensive literature integration, meta-analysis, and multiple evidence assessment of candidate-gene association studies , 2017, Scientific Reports.

[15]  Y. Bossé,et al.  A Decade of GWAS Results in Lung Cancer , 2017, Cancer Epidemiology, Biomarkers & Prevention.

[16]  Trey Ideker,et al.  Interaction Landscape of Inherited Polymorphisms with Somatic Events in Cancer. , 2017, Cancer discovery.

[17]  Alan M. Kwong,et al.  Next-generation genotype imputation service and methods , 2016, Nature Genetics.

[18]  J. Taipale,et al.  The role of enhancers in cancer , 2016, Nature Reviews Cancer.

[19]  T. Lehtimäki,et al.  Integrative approaches for large-scale transcriptome-wide association studies , 2015, Nature Genetics.

[20]  A. Butte,et al.  Systematic pan-cancer analysis of tumour purity , 2015, Nature Communications.

[21]  Kaanan P. Shah,et al.  A gene-based association method for mapping traits using reference transcriptome data , 2015, Nature Genetics.

[22]  Ross M. Fraser,et al.  Genetic studies of body mass index yield new insights for obesity biology , 2015, Nature.

[23]  Ross M. Fraser,et al.  Defining the role of common variation in the genomic and biological architecture of adult human height , 2014, Nature Genetics.

[24]  Yaroslav A. Kainov,et al.  Expression and clinical significance of CRABP1 and CRABP2 in non-small cell lung cancer , 2014, Tumor Biology.

[25]  Xiaotong Shen,et al.  A Powerful and Adaptive Association Test for Rare Variants , 2014, Genetics.

[26]  P. He,et al.  TRIM31 is downregulated in non-small cell lung cancer and serves as a potential tumor suppressor , 2014, Tumor Biology.

[27]  Gaurav Bhatia,et al.  Fast and accurate imputation of summary statistics enhances evidence of functional enrichment , 2013, Bioinform..

[28]  Tanya M. Teslovich,et al.  Discovery and refinement of loci associated with lipid levels , 2013, Nature Genetics.

[29]  Xia Yang,et al.  Sherlock: detecting gene-disease associations by matching patterns of expression QTL and GWAS. , 2013, American journal of human genetics.

[30]  A. McKenna,et al.  Integrative eQTL-Based Analyses Reveal the Biology of Breast Cancer Risk Loci , 2013, Cell.

[31]  Xiang Zhou,et al.  Polygenic Modeling with Bayesian Sparse Linear Mixed Models , 2012, PLoS genetics.

[32]  W. Hahn,et al.  Genetic and functional analyses implicate the NUDT11, HNF1B, and SLC22A3 genes in prostate cancer pathogenesis , 2012, Proceedings of the National Academy of Sciences.

[33]  F. Markowetz,et al.  The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups , 2012, Nature.

[34]  R. Durbin,et al.  Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses , 2012, Nature Protocols.

[35]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[36]  G. Getz,et al.  GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers , 2011, Genome Biology.

[37]  Joseph K. Pickrell,et al.  DNA methylation patterns associate with genetic and gene expression variation in HapMap cell lines , 2011, Genome Biology.

[38]  M. Esteller,et al.  Epigenetic modifications and human disease , 2010, Nature Biotechnology.

[39]  Yun Li,et al.  METAL: fast and efficient meta-analysis of genomewide association scans , 2010, Bioinform..

[40]  Leopold Parts,et al.  A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies , 2010, PLoS Comput. Biol..

[41]  Wei Pan,et al.  Test Selection with Application to Detecting Disease Association with Multiple SNPs , 2009, Human Heredity.

[42]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[43]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[44]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[45]  W. Thilly,et al.  A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST). , 2007, Mutation research.

[46]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .