Using BEAN-counter to quantify genetic interactions from multiplexed barcode sequencing experiments

The construction of genome-wide mutant collections has enabled high-throughput, high-dimensional quantitative characterization of gene and chemical function, particularly via genetic and chemical–genetic interaction experiments. As the throughput of such experiments increases with improvements in sequencing technology and sample multiplexing, appropriate tools must be developed to handle the large volume of data produced. Here, we describe how to apply our approach to high-throughput, fitness-based profiling of pooled mutant yeast collections using the BEAN-counter software pipeline (Barcoded Experiment Analysis for Next-generation sequencing) for analysis. The software has also successfully processed data from Schizosaccharomyces pombe, Escherichia coli, and Zymomonas mobilis mutant collections. We provide general recommendations for the design of large-scale, multiplexed barcode sequencing experiments. The procedure outlined here was used to score interactions for ~4 million chemical-by-mutant combinations in our recently published chemical–genetic interaction screen of nearly 14,000 chemical compounds across seven diverse compound collections. Here we selected a representative subset of these data on which to demonstrate our analysis pipeline. BEAN-counter is open source, written in Python, and freely available for academic use. Users should be proficient at the command line; advanced users who wish to analyze larger datasets with hundreds or more conditions should also be familiar with concepts in analysis of high-throughput biological data. BEAN-counter encapsulates the knowledge we have accumulated from, and successfully applied to, our multiplexed, pooled barcode sequencing experiments. This protocol will be useful to those interested in generating their own high-dimensional, quantitative characterizations of gene or chemical function in a high-throughput manner.Multiplexed sequencing of barcoded mutant collections enables high-throughput, fitness-based condition profiling. BEAN-counter is a computational pipeline for quantifying mutant sensitivity or resistance for a few to thousands of conditions.

[1]  H. Mori,et al.  Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection , 2006, Molecular systems biology.

[2]  Gavin Sherlock,et al.  Quantitative evolutionary dynamics using high-resolution lineage tracking , 2015, Nature.

[3]  Gary D Bader,et al.  Quantitative analysis of fitness and genetic interactions in yeast on a genome scale , 2010, Nature Methods.

[4]  Sean R. Collins,et al.  Hierarchical modularity and the evolution of genetic interactomes across species. , 2012, Molecular cell.

[5]  Gary D Bader,et al.  The Genetic Landscape of a Cell , 2010, Science.

[6]  Gregory McAllister,et al.  Identification of a novel NAMPT inhibitor by CRISPR/Cas9 chemogenomic profiling in mammalian cells , 2017, Scientific Reports.

[7]  Dong-Uk Kim,et al.  Genome-wide functional analysis using the barcode sequence alignment and statistical analysis (Barcas) tool , 2016, BMC Bioinformatics.

[8]  Robert P. St.Onge,et al.  The Chemical Genomic Portrait of Yeast: Uncovering a Phenotype for All Genes , 2008, Science.

[9]  David G. Robinson,et al.  Design and Analysis of Bar-seq Experiments , 2013, G3: Genes, Genomes, Genetics.

[10]  John A. Tallarico,et al.  High-resolution chemical dissection of a model eukaryote reveals targets, pathways and gene functions. , 2014, Microbiological research.

[11]  Kerry Andrusiak,et al.  Adapting S. cerevisiae Chemical Genomics for Identifying the Modes of Action of Natural Compounds , 2012 .

[12]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[13]  With contributions from , 2007 .

[14]  Grant W. Brown,et al.  Integration of chemical-genetic and genetic interaction data links bioactive compounds to cellular target pathways , 2004, Nature Biotechnology.

[15]  Elizabeth A. Winzeler,et al.  Genomic profiling of drug sensitivities via induced haploinsufficiency , 1999, Nature Genetics.

[16]  Kathryn A. O’Donnell,et al.  Toward a comprehensive temperature-sensitive mutant repository of the essential genes of Saccharomyces cerevisiae. , 2008, Molecular cell.

[17]  Corey Nislow,et al.  Genome-wide analysis of barcoded Saccharomyces cerevisiae gene-deletion mutants in pooled cultures , 2007, Nature Protocols.

[18]  Petr Baldrian,et al.  SEED 2: a user-friendly platform for amplicon high-throughput sequencing data analyses , 2018, Bioinform..

[19]  J. Hayles,et al.  S. pombe genome deletion project: An update , 2010, Cell cycle.

[20]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[21]  John D. Storey,et al.  Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis , 2007, PLoS genetics.

[22]  Roland Arnold,et al.  A negative genetic interaction map in isogenic cancer cell lines reveals cancer cell vulnerabilities , 2013, Molecular systems biology.

[23]  Adam P. Rosebrock,et al.  A global genetic interaction network maps a wiring diagram of cellular function , 2016, Science.

[24]  Tao Jiang,et al.  SEED: efficient clustering of next-generation sequences , 2011, Bioinform..

[25]  Minoru Yoshida,et al.  Functional Annotation of Chemical Libraries across Diverse Biological Processes , 2017, bioRxiv.

[26]  Philip M. Kim,et al.  Quantitative Genome-Wide Genetic Interaction Screens Reveal Global Epistatic Relationships of Protein Complexes in Escherichia coli , 2014, PLoS genetics.

[27]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[28]  Irene M Ong,et al.  Plant-derived antifungal agent poacic acid targets β-1,3-glucan , 2015, Proceedings of the National Academy of Sciences.

[29]  Inmar E. Givoni,et al.  Exploring the Mode-of-Action of Bioactive Compounds by Chemical-Genetic Profiling in Yeast , 2006, Cell.

[30]  G. Giaever,et al.  Quantitative Phenotyping via Deep Barcode Sequencing , 2022 .

[31]  Frédéric Mahé,et al.  Swarm: robust and fast clustering method for amplicon-based studies , 2014, PeerJ.

[32]  Sean R. Collins,et al.  Conservation and Rewiring of Functional Modules Revealed by an Epistasis Map in Fission Yeast , 2008, Science.

[33]  Paul J. McMurdie,et al.  DADA2: High resolution sample inference from Illumina amplicon data , 2016, Nature Methods.

[34]  Andrew E. Jaffe,et al.  Bioinformatics Applications Note Gene Expression the Sva Package for Removing Batch Effects and Other Unwanted Variation in High-throughput Experiments , 2022 .

[35]  Robert P. St.Onge,et al.  Highly-multiplexed barcode sequencing: an efficient method for parallel analysis of pooled samples , 2010, Nucleic acids research.

[36]  Lu Zhao,et al.  Bartender: a fast and accurate clustering algorithm to count barcode reads , 2018, Bioinform..

[37]  Mike Tyers,et al.  Prediction of Synergism from Chemical-Genetic Interactions by Machine Learning. , 2015, Cell systems.

[38]  Julie M Sheridan,et al.  edgeR: a versatile tool for the analysis of shRNA-seq and CRISPR-Cas9 genetic screens , 2014, F1000Research.

[39]  Gary D. Bader,et al.  Mapping the Cellular Response to Small Molecules Using Chemogenomic Fitness Signatures , 2014, Science.

[40]  W. Cleveland LOWESS: A Program for Smoothing Scatterplots by Robust Locally Weighted Regression , 1981 .

[41]  Sheena C. Li,et al.  Chemical genomic profiling via barcode sequencing to predict compound mode of action. , 2015, Methods in molecular biology.

[42]  Corey Nislow,et al.  The Yeast Deletion Collection: A Decade of Functional Genomics , 2014, Genetics.

[43]  Guillaume J. Filion,et al.  Starcode: sequence clustering based on all-pairs search , 2015, Bioinform..

[44]  D. Botstein,et al.  A molecular barcoded yeast ORF library enables mode-of-action analysis of bioactive compounds , 2009, Nature Biotechnology.

[45]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[46]  Scott W. Simpkins,et al.  Accumulation of heme biosynthetic intermediates contributes to the antibacterial action of the metalloid tellurite , 2017, Nature Communications.

[47]  Adam Frost,et al.  Functional Repurposing Revealed by Comparing S. pombe and S. cerevisiae Genetic Interactions , 2012, Cell.

[48]  D. Durocher,et al.  High-Resolution CRISPR Screens Reveal Fitness Genes and Genotype-Specific Cancer Liabilities , 2015, Cell.

[49]  Kana Shimizu,et al.  SlideSort: all pairs similarity search for short reads , 2010, Bioinform..