Ranking of multidimensional drug profiling data by fractional-adjusted bi-partitional scores

Motivation: The recent development of high-throughput drug profiling (high content screening or HCS) provides a large amount of quantitative multidimensional data. Despite its potentials, it poses several challenges for academia and industry analysts alike. This is especially true for ranking the effectiveness of several drugs from many thousands of images directly. This paper introduces, for the first time, a new framework for automatically ordering the performance of drugs, called fractional adjusted bi-partitional score (FABS). This general strategy takes advantage of graph-based formulations and solutions and avoids many shortfalls of traditionally used methods in practice. We experimented with FABS framework by implementing it with a specific algorithm, a variant of normalized cut—normalized cut prime (FABS-NC′), producing a ranking of drugs. This algorithm is known to run in polynomial time and therefore can scale well in high-throughput applications. Results: We compare the performance of FABS-NC′ to other methods that could be used for drugs ranking. We devise two variants of the FABS algorithm: FABS-SVM that utilizes support vector machine (SVM) as black box, and FABS-Spectral that utilizes the eigenvector technique (spectral) as black box. We compare the performance of FABS-NC′ also to three other methods that have been previously considered: center ranking (Center), PCA ranking (PCA), and graph transition energy method (GTEM). The conclusion is encouraging: FABS-NC′ consistently outperforms all these five alternatives. FABS-SVM has the second best performance among these six methods, but is far behind FABS-NC′: In some cases FABS-NC′ produces over half correctly predicted ranking experiment trials than FABS-SVM. Availability: The system and data for the evaluation reported here will be made available upon request to the authors after this manuscript is accepted for publication. Contact: yxy128@berkeley.edu

[1]  Thomas D. Y. Chung,et al.  A Simple Statistical Parameter for Use in Evaluation and Validation of High Throughput Screening Assays , 1999, Journal of biomolecular screening.

[2]  T. Mitchison,et al.  Phenotypic screening of small molecule libraries by high throughput cell imaging. , 2003, Combinatorial chemistry & high throughput screening.

[3]  Kai Huang,et al.  Boosting accuracy of automated classification of fluorescence microscope images for location proteomics , 2004, BMC Bioinformatics.

[4]  K. Yeow,et al.  Cellular imaging in drug discovery , 2006, Nature Reviews Drug Discovery.

[5]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[6]  Yi-Hung Huang,et al.  Automatic Morphological Subtyping Reveals New Roles of Caspases in Mitochondrial Dynamics , 2011, PLoS Comput. Biol..

[7]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[8]  John A. Tallarico,et al.  Multi-parameter phenotypic profiling: using cellular effects to characterize small-molecule compounds , 2009, Nature Reviews Drug Discovery.

[9]  Anthony Nichols,et al.  High content screening as a screening tool in drug discovery. , 2007, Methods in molecular biology.

[10]  Dorit S. Hochbaum,et al.  An efficient algorithm for image segmentation, Markov random fields and related problems , 2001, JACM.

[11]  Dorit S. Hochbaum Polynomial Time Algorithms for Ratio Regions and a Variant of Normalized Cut , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Yi-Hung Huang,et al.  A spectral graph theoretic approach to quantification and calibration of collective morphological differences in cell images , 2010, Bioinform..

[13]  Lani F. Wu,et al.  Cellular Heterogeneity: Do Differences Make a Difference? , 2010, Cell.

[14]  Xiaobo Zhou,et al.  Informatics challenges of high-throughput microscopy , 2006, IEEE Signal Processing Magazine.

[15]  C. Conrad,et al.  Automated microscopy for high-content RNAi screening , 2010, The Journal of cell biology.

[16]  L. Williams,et al.  Contents , 2020, Ophthalmology (Rochester, Minn.).

[17]  Lani F. Wu,et al.  Multidimensional Drug Profiling By Automated Microscopy , 2004, Science.

[18]  Philip Denner,et al.  High-content analysis in preclinical drug discovery. , 2008, Combinatorial chemistry & high throughput screening.

[19]  Jeffrey H Price,et al.  Statistics of assay validation in high throughput cell imaging of nuclear factor kappaB nuclear translocation. , 2005, Assay and drug development technologies.

[20]  Chun-Nan Hsu,et al.  Boosting multiclass learning with repeating codes and weak detectors for protein subcellular localization , 2007, Bioinform..

[21]  Timothy J Mitchison,et al.  Small‐Molecule Screening and Profiling by Using Automated Microscopy , 2005, Chembiochem : a European journal of chemical biology.

[22]  Toshihiko Oka,et al.  Mitotic Phosphorylation of Dynamin-related GTPase Drp1 Participates in Mitochondrial Fission* , 2007, Journal of Biological Chemistry.

[23]  Jitendra Malik,et al.  Normalized Cut and Image Segmentation , 1997 .

[24]  Dorit S. Hochbaum,et al.  A Computational Study of the Pseudoflow and Push-Relabel Algorithms for the Maximum Flow Problem , 2009, Oper. Res..

[25]  Dorit S. Hochbaum,et al.  The Pseudoflow Algorithm: A New Algorithm for the Maximum-Flow Problem , 2008, Oper. Res..

[26]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[27]  Takashi Washio,et al.  State of the art of graph-based data mining , 2003, SKDD.

[28]  Anil K. Bera,et al.  A test for normality of observations and regression residuals , 1987 .

[29]  K D Paull,et al.  Identification of novel antimitotic agents acting at the tubulin level by computer-assisted evaluation of differential cytotoxicity data. , 1992, Cancer research.