CGene: an R package for implementation of causal genetic analyses

The excitement over findings from Genome-Wide Association Studies (GWASs) has been tempered by the difficulty in finding the location of the true causal disease susceptibility loci (DSLs), rather than markers that are correlated with the causal variants. In addition, many recent GWASs have studied multiple phenotypes – often highly correlated – making it difficult to understand which associations are causal and which are seemingly causal, induced by phenotypic correlations. In order to identify DSLs, which are required to understand the genetic etiology of the observed associations, statistical methodology has been proposed that distinguishes between a direct effect of a genetic locus on the primary phenotype and an indirect effect induced by the association with the intermediate phenotype that is also correlated with the primary phenotype. However, so far, the application of this important methodology has been challenging, as no user-friendly software implementation exists. The lack of software implementation of this sophisticated methodology has prevented its large-scale use in the genetic community. We have now implemented this statistical approach in a user-friendly and robust R package that has been thoroughly tested. The R package ‘CGene’ is available for download at http://cran.r-project.org/. The R code is also available at http://people.hsph.harvard.edu/~plipman.

[1]  Ming D. Li,et al.  Genome-wide meta-analyses identify multiple loci associated with smoking behavior , 2010, Nature Genetics.

[2]  Teri A Manolio,et al.  Genomewide association studies and assessment of the risk of disease. , 2010, The New England journal of medicine.

[3]  Tariq Ahmad,et al.  Meta-analysis and imputation refines the association of 15q25 with smoking quantity , 2010, Nature Genetics.

[4]  H. Boezen,et al.  Genome-wide association studies: what do they teach us about asthma and chronic obstructive pulmonary disease? , 2009, Proceedings of the American Thoracic Society.

[5]  G. Gamble,et al.  Lung cancer gene associated with COPD: triple whammy or possible confounding effect? , 2008, European Respiratory Journal.

[6]  S. Vansteelandt,et al.  On the adjustment for covariates in genetic association analysis: a novel, simple principle to infer direct causal effects , 2009, Genetic epidemiology.

[7]  Christoph Lange,et al.  Inferring genetic causal effects on survival data with associated endo‐phenotypes , 2011, Genetic epidemiology.

[8]  Inês Barroso,et al.  Meta-analysis and imputation refines the association of 15q25 with smoking quantity , 2010, Nature Genetics.

[9]  K. Shianna,et al.  A Genome-Wide Association Study in Chronic Obstructive Pulmonary Disease (COPD): Identification of Two Major Susceptibility Loci , 2009, PLoS genetics.

[10]  C. Gieger,et al.  Sequence variants at CHRNB3–CHRNA6 and CYP2A6 affect smoking behavior , 2010, Nature Genetics.

[11]  Christopher I. Amos,et al.  Mediating effects of smoking and chronic obstructive pulmonary disease on the relation between the CHRNA5‐A3 genetic locus and lung cancer risk , 2010, Cancer.

[12]  C. Gieger,et al.  Sequence variants at CHRNB 3 – CHRNA 6 and CYP 2 A 6 affect smoking behavior , 2010 .