Background: The recent development of semi-automated techniques for staining and analyzing flow cytometry samples has presented new challenges. Quality control and quality assessment are critical when developing new high throughput technologies and their associated information services. Our experience suggests that significant bottlenecks remain in the development of high throughput flow cytometry methods for data analysis and display. Especially, data quality control and quality assessment are crucial steps in processing and analyzing high throughput flow cytometry data. Methods: We propose a variety of graphical exploratory data analytic tools for exploring ungated flow cytometry data. We have implemented a number of specialized functions and methods in the Bioconductor package rflowcyt. We demonstrate the use of these approaches by investigating two independent sets of high throughput flow cytometry data. Results: We found that graphical representations can reveal substantial non-biological differences in samples. Empirical Cumulative Distribution Function and summary scatterplots were especially useful in the rapid identification of problems not identified by manual review. Conclusions: Graphical exploratory data analytic tools are quick and useful means of assessing data quality. We propose that the described visualizations should be used as quality assessment tools and where possible, be used for quality control. Data Quality Assessment of Ungated Flow Cytometry Data in High Throughput Experiments Nolwenn Le Meur a∗, Anthony Rossini b, Maura Gasparetto c, Clay Smith c, Ryan R. Brinkman c and Robert Gentlemana a Fred Hutchinson Cancer Research Center, Seattle, Washington, USA; b Novartis Pharma AG, Basel, Switzerland; c Terry Fox Laboratory, British Columbia Cancer Agency, Vancouver, BC, Canada ∗Correspondence to Nolwenn Le Meur, Fred Hutchinson Cancer Research Center, Computational Biology, Division of Public Health Science, 1100 Fairview Ave. N., M2-B876, Seattle, Washington 98109-1024 Phone: 206-667-5434 Fax: 206-667-1319 Email: nlemeur@fhcrc.org Funded by: NIH-NIBIB
[1]
Mark S. Boguski,et al.
Biomedical informatics for proteomics
,
2003,
Nature.
[2]
Perry Haaland,et al.
Identification of compounds that enhance the anti-lymphoma activity of rituximab using flow cytometric high-content screening.
,
2004,
Journal of immunological methods.
[3]
D Barnett,et al.
Impact of standardization on clinical cell analysis by flow cytometry.
,
2004,
Journal of biological regulators and homeostatic agents.
[4]
Rafael A. Irizarry,et al.
Bioinformatics and Computational Biology Solutions using R and Bioconductor
,
2005
.
[5]
F. E. Grubbs.
Sample Criteria for Testing Outlying Observations
,
1950
.
[6]
Tudor I. Oprea,et al.
Flow cytometry for high-throughput, high-content screening.
,
2004,
Current opinion in chemical biology.
[7]
W. J. Conover,et al.
Practical Nonparametric Statistics
,
1972
.
[8]
L A Sklar,et al.
High throughput screening of G-protein coupled receptors via flow cytometry.
,
2003,
Combinatorial chemistry & high throughput screening.
[9]
Alvis Brazma,et al.
On the Importance of Standardisation in Life Sciences
,
2001,
Bioinform..
[10]
B S Edwards,et al.
Comprehensive quality assessment approach for flow cytometric immunophenotyping of human lymphocytes.
,
1989,
Cytometry.
[11]
L A Sklar,et al.
High throughput flow cytometry.
,
2001,
Cytometry.