To rigorously determine whether a gene or a set of genes have alterations that are involved in carcinogenesis requires a comparison of the prevalence of identified changes to a control mutation frequency present in tumor DNA. To facilitate this task, we develop a testing approach and the associated R library, called TRAB, that evaluates whether the frequency of somatic mutation in a given gene is higher than that observed in a control group of genes. Specifically, we test the null hypothesis that the frequency belongs to a control population of frequencies, against the alternative hypothesis that the frequency is higher. Mutation frequencies in the control group are themselves allowed to be variable. TRAB computes the a posteriori probability and the Bayes factor for the hypothesis using a hierarchical Bayesian approach.
[1]
Jean YH Yang,et al.
Bioconductor: open software development for computational biology and bioinformatics
,
2004,
Genome Biology.
[2]
Ross Ihaka,et al.
Gentleman R: R: A language for data analysis and graphics
,
1996
.
[3]
Giovanni Parmigiani,et al.
Prevalence of somatic alterations in the colorectal cancer cell genome
,
2002,
Proceedings of the National Academy of Sciences of the United States of America.
[4]
M. Schervish.
Theory of Statistics
,
1995
.
[5]
G. Parmigiani,et al.
The Consensus Coding Sequences of Human Breast and Colorectal Cancers
,
2006,
Science.