Statistics and decision making in high-throughput screening.

Screening is about making decisions on the modulating activity of one particular compound on a biological system. When a compound testing experiment is repeated under the same conditions or as close to the same conditions as possible, the observed results are never exactly the same, and there is an apparent random and uncontrolled source of variability in the system under study. Nevertheless, randomness is not haphazard. In this context, we can see statistics as the science of decision making under uncertainty. Thus, the usage of statistical tools in the analysis of screening experiments is the right approach to the interpretation of screening data, with the aim of making them meaningful and converting them into valuable information that supports sound decision making.In the HTS workflow, there are at least three key stages where key decisions have to be made based on experimental data: (1) assay development (i.e. how to assess whether our assay is good enough to be put into screening production for the identification of modulators of the target of interest), (2) HTS campaign process (i.e. monitoring that screening process is performing at the expected quality and assessing possible patterned signs of experimental response that may adversely bias and mislead hit identification) and (3) data analysis of primary HTS data (i.e. flagging which compounds are giving a positive response in the assay, namely hit identification).In this chapter we will focus on how some statistical tools can help to cope with these three aspects. Assessment of assay quality is reviewed in other chapters, so in Section 1 we will briefly make some further considerations. Section 2 will review statistical process control, Section 3 will cover methodologies for detecting and dealing with HTS patterns and Section 4 will describe approaches for statistically guided selection of hits in HTS.

[1]  Robert Nadon,et al.  An efficient method for the detection and elimination of systematic error in high-throughput screening , 2007, Bioinform..

[2]  Ji-Hu Zhang,et al.  Further Comparison of Primary Hit Identification by Different Assay Technologies and Effects of Assay Measurement Variability , 2005, Journal of biomolecular screening.

[3]  Bert Gunter,et al.  Improved Statistical Methods for Hit Selection in High-Throughput Screening , 2003, Journal of biomolecular screening.

[4]  Dongmei Liu,et al.  Quantitative assessment of hit detection and confirmation in single and duplicate high-throughput screenings. , 2008, Journal of biomolecular screening.

[5]  Gavin Harper,et al.  Process Validation and Screen Reproducibility in High-Throughput Screening , 2009, Journal of biomolecular screening.

[6]  Thomas D. Y. Chung,et al.  A Simple Statistical Parameter for Use in Evaluation and Validation of High Throughput Screening Assays , 1999, Journal of biomolecular screening.

[7]  Yunxia Sui,et al.  Alternative Statistical Parameter for High-Throughput Screening Assay Quality Assessment , 2007, Journal of biomolecular screening.

[8]  Bert Gunter,et al.  Statistical and Graphical Methods for Quality Control Determination of High-Throughput Screening Data , 2003, Journal of biomolecular screening.

[9]  Tina Garyantes,et al.  The Confirmation Rate of Primary Hits: A Predictive Model , 2002, Journal of biomolecular screening.

[10]  J H Zhang,et al.  Confirmation of primary active substances from high throughput screening of chemical and biological populations: a statistical approach and practical considerations. , 2000, Journal of combinatorial chemistry.

[11]  Brian J Eastwood,et al.  A Comparison of Assay Performance Measures in Screening Assays: Signal Window, Z' Factor, and Assay Variability Ratio , 2006, Journal of biomolecular screening.

[12]  David E Root,et al.  Detecting Spatial Patterns in Biological Array Experiments , 2003, Journal of biomolecular screening.

[13]  Vladimir Makarenkov,et al.  Using Clustering Techniques to Improve Hit Selection in High-Throughput Screening , 2006, Journal of biomolecular screening.

[14]  Frederick Mosteller,et al.  Understanding robust and exploratory data analysis , 1983 .

[15]  Z Li,et al.  An Ultra-High Throughput Screening Approach for an Adenine Transferase Using Fluorescence Polarization , 2000, Journal of biomolecular screening.

[16]  C Nicholas Hodge,et al.  Streamlined system for purifying and quantifying a diverse library of compounds and the effect of compound concentration measurements on the accurate interpretation of biological assay results. , 2004, Analytical chemistry.

[17]  Peter J. Rousseeuw,et al.  Robust Regression and Outlier Detection , 2005, Wiley Series in Probability and Statistics.

[18]  Adam Yasgar,et al.  Quantitative high-throughput screening: a titration-based approach that efficiently identifies biological activities in large chemical libraries. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Yue Yuan,et al.  Determining Appropriate Substrate Conversion for Enzymatic Assays in High-Throughput Screening , 2003, Journal of biomolecular screening.

[20]  Norbert Lloyd Enrick,et al.  Quality, reliability, and process improvement , 1985 .

[21]  Robert Nadon,et al.  Statistical practice in high-throughput screening data analysis , 2006, Nature Biotechnology.

[22]  G. Sitta Sittampalam,et al.  Design of Signal Windows in High Throughput Screening Assays for Drug Discovery , 1997 .

[23]  Ramesh Padmanabha,et al.  HTS quality control and data analysis: a process to maximize information from a high-throughput screen. , 2005, Combinatorial chemistry & high throughput screening.

[24]  Sean T. Quinn,et al.  Automated Assay Optimization with Integrated Statistics and Smart Robotics , 2000, Journal of biomolecular screening.

[25]  Kim E. Garbison,et al.  The Minimum Significant Ratio: A Statistical Parameter to Characterize the Reproducibility of Potency Estimates from Concentration-Response Assays and Estimation by Replicate-Experiment Studies , 2006, Journal of biomolecular screening.

[26]  Xiaohua Douglas Zhang,et al.  A New Method with Flexible and Balanced Control of False Negatives and False Positives for Hit Selection in RNA Interference High-Throughput Screening Assays , 2007, Journal of biomolecular screening.

[27]  Andreas Sewing,et al.  Evaluating Real-Life High-Throughput Screening Data , 2005, Journal of biomolecular screening.

[28]  V. Makarenkov,et al.  Statistical Analysis of Systematic Errors in High-Throughput Screening , 2005, Journal of biomolecular screening.

[29]  John W. Tukey,et al.  Exploratory Data Analysis. , 1979 .

[30]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .