An Expanded Association Approach for Rare Germline Variants with Copy-Number Alternation

Tumorigenesis is considered as a complex process that is often driven by close interactions between germline variants and accumulated somatic mutational events. Recent studies report that some somatic copy-number alternations show such interactions by harboring germline susceptibility variants under potential selection in clonal expansions. Incorporating these interactions into genetic association approach could be valuable in not only discovering novel susceptibility variants, but providing insight into tumor heterogeneity and clinical implications. To address this need, in this article, we propose RareProb-G, an expanded version of a computational method, which is designed for identifying rare germline susceptibility variants located in the somatic allelic amplification or loss of heterozygosity regions. RareProb-G is based on a hidden Markov random field model. The interactions among germline variants and somatic events are modeled by a neighborhood system, which is bounded by a t-test on variant allelic frequencies. Each variant is assigned four hidden states, which represent the regional status and causal/neutral status, respectively. A hidden Markov model is also introduced to estimate the initial values of the hidden states and unknown model parameters. To verify this approach, we conduct a series of simulation experiments under different configurations, and RareProb-G outperforms than RareProb on both sensitivity and specificity.

[1]  Joshua F. McMichael,et al.  Systematic Discovery of Complex Indels in Human Cancers , 2015, Nature medicine.

[2]  Jin Zhang,et al.  A probabilistic method for identifying rare variants underlying complex traits , 2013, BMC Genomics.

[3]  Gaurav Bhatia,et al.  A Covering Method for Detecting Genetic Associations between Rare Variants and Common Phenotypes , 2010, PLoS Comput. Biol..

[4]  Nazneen Rahman,et al.  Realizing the promise of cancer predisposition genes , 2014, Nature.

[5]  Li Ding,et al.  Patterns and functional implications of rare germline variants across 12 cancer types , 2015, Nature Communications.

[6]  Ying Zhang,et al.  Computational pan-genomics: status, promises and challenges , 2016, bioRxiv.

[7]  Benjamin J. Raphael,et al.  Advances for studying clonal evolution in cancer. , 2013, Cancer letters.

[8]  Eleazar Eskin,et al.  An Optimal Weighted Aggregated Association Test for Identification of Rare Variants Involved in Common Diseases , 2011, Genetics.

[9]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[10]  Matthew B. Callaway,et al.  MuSiC: Identifying mutational significance in cancer genomes , 2012, Genome research.

[11]  Xiuqing Zhang,et al.  Double restriction-enzyme digestion improves the coverage and accuracy of genome-wide CpG methylation profiling by reduced representation bisulfite sequencing , 2013, BMC Genomics.

[12]  Benjamin J. Raphael,et al.  Mutational landscape and significance across 12 major cancer types , 2013, Nature.

[13]  X. Estivill,et al.  Copy number variants and genetic traits: closer to the resolution of phenotypic to genotypic variability , 2007, Nature Reviews Genetics.