Informatics challenges of high-throughput microscopy

In this article, we discussed the emerging informatics issues of high-throughput screening (HTS) using automated fluorescence microscopy technology, otherwise known as high-content screening (HCS) in the pharmaceutical industry. Optimal methods of scoring biomarkers and identifying candidate hits have been actively studied in academia and industry, with the exception of data modeling topics. To find candidate hits, we need to score the images associated with different compound interventions. In the application example of RNAi genome-wide screening, we aim to find the candidate effectors or genes which correspond to the images acquired using the three channels. Scoring the effectors is equivalent to scoring the images based on the number of phenotypes existing in those images. Our ultimate objective of studying HTS is to model the relationship between gene networks and cellular phenotypes, investigate cellular communication via protein interaction, and study the disease mechanism beyond the prediction based on the molecular structure of the compound. Finally, computational image analysis has become a powerful tool in cellular and molecular biology studies. Signal processing and modeling for high-throughput image screening is an emerging filed that requires novel algorithms for dynamical system analysis, image processing, and statistical modeling. We hope that this article will motivate the signal processing communities to address challenging data modeling and other informatics issues of HTS

[1]  Xiaobo Zhou,et al.  Dynamic sub-cellular behavior study in high content imaging using a multi-threshold approach , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[2]  Douglas A. Creager,et al.  The Open Microscopy Environment (OME) Data Model and XML file: open tools for informatics and quantitative analysis in biological imaging , 2005, Genome Biology.

[3]  Kai Huang,et al.  Boosting accuracy of automated classification of fluorescence microscope images for location proteomics , 2004, BMC Bioinformatics.

[4]  N. Perrimon,et al.  Genome-Wide RNAi Analysis of Growth and Viability in Drosophila Cells , 2004, Science.

[5]  Xiaobo Zhou,et al.  High content cellular imaging for drug development , 2006 .

[6]  Kai Huang,et al.  Image content-based retrieval and automated interpretation of fluorescence microscope images via the protein subcellular location image database , 2002, Proceedings IEEE International Symposium on Biomedical Imaging.

[7]  Kartik Chandran,et al.  Endocytosis by Random Initiation and Stabilization of Clathrin-Coated Pits , 2004, Cell.

[8]  E. O’Shea,et al.  Global analysis of protein localization in budding yeast , 2003, Nature.

[9]  Vannary Meas-Yedid,et al.  Segmentation and tracking of migrating cells in videomicroscopy with parametric active contours: a tool for cell-based drug testing , 2002, IEEE Transactions on Medical Imaging.

[10]  Stephen T. C. Wong,et al.  TIME-LAPSE CELL CYCLE QUANTITATIVE DATA ANALYSIS USING GAUSSIAN MIXTURE MODELS , 2006 .

[11]  Robert P. Sheridan,et al.  Random Forest: A Classification and Regression Tool for Compound Classification and QSAR Modeling , 2003, J. Chem. Inf. Comput. Sci..

[12]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[13]  B. Ripley The Second-Order Analysis of Stationary Point Processes , 1976 .

[14]  R. Eils,et al.  Computational imaging in cell biology , 2003, The Journal of cell biology.

[15]  Steen Knudsen,et al.  Guide to analysis of DNA microarray data , 2004 .

[16]  Luc Vincent,et al.  Morphological grayscale reconstruction in image analysis: applications and efficient algorithms , 1993, IEEE Trans. Image Process..

[17]  Lani F. Wu,et al.  Multidimensional Drug Profiling By Automated Microscopy , 2004, Science.

[18]  Xiaobo Zhou,et al.  Towards Automated Cellular Image Segmentation for RNAi Genome-Wide Screening , 2005, MICCAI.

[19]  T. Mitchison,et al.  Phenotypic screening of small molecule libraries by high throughput cell imaging. , 2003, Combinatorial chemistry & high throughput screening.

[20]  M K Cheezum,et al.  Quantitative comparison of algorithms for tracking single fluorescent particles. , 2001, Biophysical journal.

[21]  V. Makarenkov,et al.  Statistical Analysis of Systematic Errors in High-Throughput Screening , 2005, Journal of biomolecular screening.

[22]  Robert Tibshirani,et al.  Estimating the number of clusters in a data set via the gap statistic , 2000 .

[23]  R. Murphy,et al.  Objective Clustering of Proteins Based on Subcellular Location Patterns , 2005, Journal of biomedicine & biotechnology.

[24]  Milan Sonka,et al.  Mitotic cell recognition with hidden Markov models , 2004, Medical Imaging: Image-Guided Procedures.

[25]  Xiaobo Zhou,et al.  Automated segmentation, classification, and tracking of cancer cell nuclei in time-lapse microscopy , 2006, IEEE Transactions on Biomedical Engineering.

[26]  Robert F. Murphy,et al.  A neural network classifier capable of recognizing the patterns of all major subcellular structures in fluorescence microscope images of HeLa cells , 2001, Bioinform..

[27]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[28]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.