An efficient method for the detection and elimination of systematic error in high-throughput screening

MOTIVATION High-throughput screening (HTS) is an early-stage process in drug discovery which allows thousands of chemical compounds to be tested in a single study. We report a method for correcting HTS data prior to the hit selection process (i.e. selection of active compounds). The proposed correction minimizes the impact of systematic errors which may affect the hit selection in HTS. The introduced method, called a well correction, proceeds by correcting the distribution of measurements within wells of a given HTS assay. We use simulated and experimental data to illustrate the advantages of the new method compared to other widely-used methods of data correction and hit selection in HTS. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Thomas D. Y. Chung,et al.  A Simple Statistical Parameter for Use in Evaluation and Validation of High Throughput Screening Assays , 1999, Journal of biomolecular screening.

[2]  Adesh. kaul,et al.  The Impact of Sophisticated Data Analysis on the Drug Discovery Process 2 Heading Sub Head , 2004 .

[3]  Andrei V. Gagarin,et al.  Comparison of Two Methods for Detecting and Correcting Systematic Error in High-throughput Screening Data , 2006, Data Science and Classification.

[4]  M. Braga,et al.  Exploratory Data Analysis , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[5]  Vladimir Makarenkov,et al.  Using Clustering Techniques to Improve Hit Selection in High-Throughput Screening , 2006, Journal of biomolecular screening.

[6]  Vladimir Batagelj,et al.  Data Science and Classification , 2006, Studies in Classification, Data Analysis, and Knowledge Organization.

[7]  Bert Gunter,et al.  Improved Statistical Methods for Hit Selection in High-Throughput Screening , 2003, Journal of biomolecular screening.

[8]  Nadine H. Elowe,et al.  Experimental Screening of Dihydrofolate Reductase Yields a “Test Set” of 50,000 Small Molecules for a Computational Data-Mining and Docking Competition , 2005, Journal of biomolecular screening.

[9]  V. Makarenkov,et al.  Statistical Analysis of Systematic Errors in High-Throughput Screening , 2005, Journal of biomolecular screening.

[10]  Robert Nadon,et al.  Statistical practice in high-throughput screening data analysis , 2006, Nature Biotechnology.

[11]  E. Brown,et al.  High throughput screening identifies novel inhibitors of Escherichia coli dihydrofolate reductase that are competitive with dihydrofolate. , 2003, Bioorganic & medicinal chemistry letters.

[12]  P Willett,et al.  Visual and computational analysis of structure--activity relationships in high-throughput screening data. , 2001, Current opinion in chemical biology.

[13]  Robert Nadon,et al.  HTS-Corrector: software for the statistical analysis and correction of experimental high-throughput screening data , 2006, Bioinform..

[14]  H. L. Le Roy,et al.  Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability; Vol. IV , 1969 .

[15]  Bert Gunter,et al.  Statistical and Graphical Methods for Quality Control Determination of High-Throughput Screening Data , 2003, Journal of biomolecular screening.

[16]  J H Zhang,et al.  Confirmation of primary active substances from high throughput screening of chemical and biological populations: a statistical approach and practical considerations. , 2000, Journal of combinatorial chemistry.

[17]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[18]  Stephan Heyse,et al.  Comprehensive analysis of high-throughput screening data , 2002, SPIE BiOS.