The Power Decoder Simulator for the Evaluation of Pooled shRNA Screen Performance

RNA interference screening using pooled, short hairpin RNA (shRNA) is a powerful, high-throughput tool for determining the biological relevance of genes for a phenotype. Assessing an shRNA pooled screen’s performance is difficult in practice; one can estimate the performance only by using reproducibility as a proxy for power or by employing a large number of validated positive and negative controls. Here, we develop an open-source software tool, the Power Decoder simulator, for generating shRNA pooled screening experiments in silico that can be used to estimate a screen’s statistical power. Using the negative binomial distribution, it models both the relative abundance of multiple shRNAs within a single screening replicate and the biological noise between replicates for each individual shRNA. We demonstrate that this simulator can successfully model the data from an actual laboratory experiment. We then use it to evaluate the effects of biological replicates and sequencing counts on the performance of a pooled screen, without the necessity of gathering additional data. The Power Decoder simulator is written in R and Python and is available for download under the GNU General Public License v3.0.

[1]  Abel Licon,et al.  Optimized PCR Conditions and Increased shRNA Fold Representation Improve Reproducibility of Pooled shRNA Screens , 2012, PloS one.

[2]  Neville E. Sanjana,et al.  Genome-Scale CRISPR-Cas9 Knockout Screening in Human Cells , 2014, Science.

[3]  Qisheng Li,et al.  A genome-wide genetic screen for host factors required for hepatitis C virus propagation , 2009, Proceedings of the National Academy of Sciences.

[4]  K. Kok,et al.  siRNA and shRNA screens advance key understanding of host factors required for HIV-1 replication , 2009, Retrovirology.

[5]  Q. Boese,et al.  Insulin signaling through Akt/protein kinase B analyzed by small interfering RNA-mediated gene silencing , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Michael Peyton,et al.  Synthetic lethal screen identification of chemosensitizer loci in cancer cells , 2007, Nature.

[7]  M. Manns,et al.  Caspase 8 small interfering RNA prevents acute liver failure in mice , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[8]  J. Bernstein,et al.  Sample size and statistical power of randomised, controlled trials in orthopaedics. , 2001, The Journal of bone and joint surgery. British volume.

[9]  栗崎 惠子 Nuclear factor YY1 inhibits transforming growth factor β- and bone morphogenetic protein-induced cell differentiation , 2005 .

[10]  P. D. Finch Substantive difference and the analysis of histograms from very large samples. , 1979, The journal of histochemistry and cytochemistry : official journal of the Histochemistry Society.

[11]  Benjamin M. Bolker,et al.  Ecological Models and Data in R , 2008 .

[12]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[13]  M. Boutros,et al.  A Genome-Wide RNA Interference Screen Identifies Caspase 4 as a Factor Required for Tumor Necrosis Factor Alpha Signaling , 2012, Molecular and Cellular Biology.

[14]  J. Manola,et al.  A library of siRNA duplexes targeting the phosphoinositide 3-kinase pathway: determinants of gene silencing for use in cell-based screens. , 2004, Nucleic acids research.

[15]  Marketa Zvelebil,et al.  High-throughput RNA interference screening using pooled shRNA libraries and next generation sequencing , 2011, Genome Biology.

[16]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[17]  A. Lipovsky,et al.  Genome-wide siRNA screen identifies the retromer as a cellular entry factor for human papillomavirus , 2013, Proceedings of the National Academy of Sciences.

[18]  John Quackenbush,et al.  Genome-wide siRNA screen for mediators of NF-κB activation , 2012, Proceedings of the National Academy of Sciences.

[19]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[20]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[21]  High-throughput RNAi screening for N-glycosylation dependent loci in Caenorhabditis elegans. , 2010, Methods in enzymology.

[22]  Amanda Birmingham,et al.  NoiseMaker: simulated screens for statistical assessment , 2010, Bioinform..

[23]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[24]  P. Lachenbruch Statistical Power Analysis for the Behavioral Sciences (2nd ed.) , 1989 .

[25]  E. Lander,et al.  Genetic Screens in Human Cells Using the CRISPR-Cas9 System , 2013, Science.

[26]  Yilong Li,et al.  Genome-wide recessive genetic screening in mammalian cells with a lentiviral CRISPR-guide RNA library , 2013, Nature Biotechnology.

[27]  Susan R. Wilson,et al.  Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing , 2012, BMC Genomics.