Modification of Kolmogorov-Smirnov test for DNA content data analysis through distribution alignment.

The Kolmogorov-Smirnov (K-S) test is a statistical method often used for comparing two distributions. In high-throughput screening (HTS) studies, such distributions usually arise from the phenotype of independent cell populations. However, the K-S test has been criticized for being overly sensitive in applications, and it often detects a statistically significant difference that is not biologically meaningful. One major reason is that there is a common phenomenon in HTS studies that systematic drifting exists among the distributions due to reasons such as instrument variation, plate edge effect, accidental difference in sample handling, etc. In particular, in high-content cellular imaging experiments, the location shift could be dramatic since some compounds themselves are fluorescent. This oversensitivity of the K-S test is particularly overpowered in cellular assays where the sample sizes are very big (usually several thousands). In this paper, a modified K-S test is proposed to deal with the nonspecific location-shift problem in HTS studies. Specifically, we propose that the distributions are "normalized" by density curve alignment before the K-S test is conducted. In applications to simulation data and real experimental data, the results show that the proposed method has improved specificity.

[1]  T. Lynn Eudey,et al.  Statistical considerations in DNA flow cytometry , 1996 .

[2]  Paolo Cappella,et al.  Multiparametric Cell Cycle Analysis by Automated Microscopy , 2006, Journal of biomolecular screening.

[3]  D. L. Taylor,et al.  Multiplexed high content screening assays create a systems cell biology approach to drug discovery. , 2005, Drug discovery today. Technologies.

[4]  F. Lampariello,et al.  On the use of the Kolmogorov-Smirnov statistical test for immunofluorescence histogram comparison. , 2000, Cytometry.

[5]  U. Holst,et al.  Statistical evaluation of cell kinetic data from DNA flow cytometry (FCM) by the EM algorithm. , 1989, Cytometry.

[6]  I. T. Young Proof without prejudice: use of the Kolmogorov-Smirnov test for the analysis of histograms from flow systems and other sources. , 1977, The journal of histochemistry and cytochemistry : official journal of the Histochemistry Society.

[7]  James V. Watson Proof without prejudice revisited: immunofluorescence histogram analysis using cumulative frequency subtraction plus ratio analysis of means. , 2001, Cytometry.

[8]  L L Vindeløv,et al.  A review of techniques and results obtained in one laboratory by an integrated system of methods designed for routine clinical flow cytometric DNA analysis. , 1990, Cytometry.

[9]  R E Hand,et al.  Parametric analysis of histograms measured in flow cytometry. , 1983, Cytometry.

[10]  D L Taylor,et al.  Real-time molecular and cellular analysis: the new frontier of drug discovery. , 2001, Current opinion in biotechnology.

[11]  D. A. Sprott,et al.  Statistical analysis of nuclear genome size of plants with flow cytometer data. , 2001, Cytometry.