poolMC: Smart pooling of mRNA samples in microarray experiments

BackgroundTypically, pooling of mRNA samples in microarray experiments implies mixing mRNA from several biological-replicate samples before hybridization onto a microarray chip. Here we describe an alternative smart pooling strategy in which different samples, not necessarily biological replicates, are pooled in an information theoretic efficient way. Further, each sample is tested on multiple chips, but always in pools made up of different samples. The end goal is to exploit the compressibility of microarray data to reduce the number of chips used and increase the robustness to noise in measurements.ResultsA theoretical framework to perform smart pooling of mRNA samples in microarray experiments was established and the software implementation of the pooling and decoding algorithms was developed in MATLAB. A proof-of-concept smart pooled experiment was performed using validated biological samples on commercially available gene chips. Differential-expression analysis of the smart pooled data was performed and compared against the unpooled control experiment.ConclusionsThe theoretical developments and experimental demonstration in this paper provide a useful starting point to investigate smart pooling of mRNA samples in microarray experiments. Although the smart pooled experiment did not compare favorably with the control, the experiment highlighted important conditions for the successful implementation of smart pooling - linearity of measurements, sparsity in data, and large experiment size.

[1]  E. Candes,et al.  11-magic : Recovery of sparse signals via convex programming , 2005 .

[2]  Piotr Indyk,et al.  Sparse Recovery Using Sparse Random Matrices , 2010, LATIN.

[3]  R A Irizarry,et al.  On the utility of pooling biological samples in microarray experiments. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[4]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[5]  R. Myers,et al.  Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data , 2005, Nucleic acids research.

[6]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[7]  R. Tibshirani The lasso method for variable selection in the Cox model. , 1997, Statistics in medicine.

[8]  Christopher D. Pilcher,et al.  Optimizing Screening for Acute Human Immunodeficiency Virus Infection with Pooled Nucleic Acid Amplification Tests , 2008, Journal of Clinical Microbiology.

[9]  Snehit Prabhu,et al.  Overlapping Pools for High Throughput Targeted Resequencing , 2009, RECOMB.

[10]  D. Du,et al.  Combinatorial Group Testing and Its Applications , 1993 .

[11]  Ronald A. DeVore,et al.  Deterministic constructions of compressed sensing matrices , 2007, J. Complex..

[12]  Daniel L. Mace,et al.  Cell Identity Mediates the Response of Arabidopsis Roots to Abiotic Stress , 2008, Science.

[13]  Shu-Dong Zhang,et al.  Bioinformatics Original Paper Effect of Pooling Samples on the Efficiency of Comparative Studies Using Microarrays , 2022 .

[14]  Dan Nettleton,et al.  Pooling mRNA in microarray experiments and its effect on power , 2007, Bioinform..

[15]  Raghunandan M Kainkaryam,et al.  Pooling in high-throughput drug screening. , 2009, Current opinion in drug discovery & development.

[16]  E. Candès,et al.  Stable signal recovery from incomplete and inaccurate measurements , 2005, math/0503066.

[17]  Fulai Jin,et al.  A yeast two-hybrid smart-pool-array system for protein-interaction mapping , 2007, Nature Methods.

[18]  M. Vidal,et al.  Shifted Transversal Design smart-pooling for high coverage interactome mapping. , 2009, Genome research.

[19]  G. Hannon,et al.  DNA Sudoku--harnessing high-throughput sequencing for multiplexed specimen analysis. , 2009, Genome research.

[20]  D. Du,et al.  Pooling Designs And Nonadaptive Group Testing: Important Tools For Dna Sequencing , 2006 .

[21]  Arnold J. Stromberg,et al.  Statistical implications of pooling RNA samples for microarray experiments , 2003, BMC Bioinform..

[22]  E.J. Candes,et al.  An Introduction To Compressive Sampling , 2008, IEEE Signal Processing Magazine.

[23]  Peter J. Woolf,et al.  poolHiTS: A Shifted Transversal Design based pooling strategy for high-throughput drug screening , 2008, BMC Bioinformatics.

[24]  Jean-Jacques Daudin,et al.  Biases induced by pooling samples in microarray experiments , 2007, ISMB/ECCB.

[25]  Richard G. Baraniuk,et al.  Compressive Sensing DNA Microarrays , 2008, EURASIP J. Bioinform. Syst. Biol..

[26]  Piotr Indyk,et al.  Combining geometry and combinatorics: A unified approach to sparse signal recovery , 2008, 2008 46th Annual Allerton Conference on Communication, Control, and Computing.

[27]  Ann M. Hess,et al.  Filtering for increased power for microarray data analysis , 2009, BMC Bioinformatics.

[28]  Kevin Dobbin,et al.  Effects of pooling mRNA in microarray class comparisons , 2004, Bioinform..