论文信息 - A web application for sample size and power calculation in case-control microbiome studies

A web application for sample size and power calculation in case-control microbiome studies

UNLABELLED : When designing a case-control study to investigate differences in microbial composition, it is fundamental to assess the sample sizes needed to detect an hypothesized difference with sufficient statistical power. Our application includes power calculation for (i) a recoded version of the two-sample generalized Wald test of the 'HMP' R-package for comparing community composition, and (ii) the Wilcoxon-Mann-Whitney test for comparing operational taxonomic unit-specific abundances between two samples (optional). The simulation-based power calculations make use of the Dirichlet-Multinomial model to describe and generate abundances. The web interface allows for easy specification of sample and effect sizes. As an illustration of our application, we compared the statistical power of the two tests, with and without stratification of samples. We observed that statistical power increases considerably when stratification is employed, meaning that less samples are needed to detect the same effect size with the same power. AVAILABILITY AND IMPLEMENTATION The web interface is written in R code using Shiny (RStudio Inc., 2016) and it is available at https://fedematt.shinyapps.io/shinyMB The R code for the recoded generalized Wald test can be found at https://github.com/mafed/msWaldHMP CONTACT: Federico.Mattiello@UGent.be.

[1] David J. Edwards,et al. Hypothesis Testing and Power Calculations for Taxonomic-Based Human Microbiome Data , 2012, PloS one.

[2] Peer Bork,et al. Enterotypes of the human gut microbiome , 2011, Nature.

[3] Y. Benjamini,et al. Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[4] P. Bork,et al. Enterotypes of the human gut microbiome , 2011, Nature.

[5] Susan P. Holmes,et al. Waste Not , Want Not : Why Rarefying Microbiome Data is Inadmissible . October 1 , 2013 , 2013 .

[6] R Core Team,et al. R: A language and environment for statistical computing. , 2014 .

[7] Kenneth J. Koehler,et al. Chi–square tests for comparing vectors of proportions for several cluster samples , 1986 .