An image score inference system for RNAi genome-wide screening based on fuzzy mixture regression modeling

With recent advances in fluorescence microscopy imaging techniques and methods of gene knock down by RNA interference (RNAi), genome-scale high-content screening (HCS) has emerged as a powerful approach to systematically identify all parts of complex biological processes. However, a critical barrier preventing fulfillment of the success is the lack of efficient and robust methods for automating RNAi image analysis and quantitative evaluation of the gene knock down effects on huge volume of HCS data. Facing such opportunities and challenges, we have started investigation of automatic methods towards the development of a fully automatic RNAi-HCS system. Particularly important are reliable approaches to cellular phenotype classification and image-based gene function estimation. We have developed a HCS analysis platform that consists of two main components: fluorescence image analysis and image scoring. For image analysis, we used a two-step enhanced watershed method to extract cellular boundaries from HCS images. Segmented cells were classified into several predefined phenotypes based on morphological and appearance features. Using statistical characteristics of the identified phenotypes as a quantitative description of the image, a score is generated that reflects gene function. Our scoring model integrates fuzzy gene class estimation and single regression models. The final functional score of an image was derived using the weighted combination of the inference from several support vector-based regression models. We validated our phenotype classification method and scoring system on our cellular phenotype and gene database with expert ground truth labeling. We built a database of high-content, 3-channel, fluorescence microscopy images of Drosophila Kc(167) cultured cells that were treated with RNAi to perturb gene function. The proposed informatics system for microscopy image analysis is tested on this database. Both of the two main components, automated phenotype classification and image scoring system, were evaluated. The robustness and efficiency of our system were validated in quantitatively predicting the biological relevance of genes.

[1]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[2]  K. Schittkowski,et al.  NONLINEAR PROGRAMMING , 2022 .

[3]  Stephen T. C. Wong,et al.  Cellular Phenotype Recognition for High-Content RNAi Genome-Wide Screening , 2007 .

[4]  I. Daubechies,et al.  Biorthogonal bases of compactly supported wavelets , 1992 .

[5]  Robert F. Murphy,et al.  A neural network classifier capable of recognizing the patterns of all major subcellular structures in fluorescence microscope images of HeLa cells , 2001, Bioinform..

[6]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[7]  Wilson S. Geisler,et al.  Multichannel Texture Analysis Using Localized Spatial Filters , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[9]  M. Teague Image analysis via the general theory of moments , 1980 .

[10]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[11]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[12]  Denis I. Crane,et al.  Extraction of fluorescent cell puncta by adaptive fuzzy segmentation , 2004, Bioinform..

[13]  Stephen T. C. Wong,et al.  Cellular Phenotype Recognition for High-Content RNA Interference Genome-Wide Screening , 2008, Journal of biomolecular screening.

[14]  D. Yamazaki,et al.  Regulation of cancer cell motility through actin reorganization , 2005, Cancer science.

[15]  T. W. Ridler,et al.  Picture thresholding using an iterative selection method. , 1978 .

[16]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[17]  David G. Stork,et al.  Pattern Classification , 1973 .

[18]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[19]  Jeff A. Bilmes,et al.  A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models , 1998 .

[20]  Xiaobo Zhou,et al.  Informatics challenges of high-throughput microscopy , 2006, IEEE Signal Processing Magazine.

[21]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Anne E Carpenter,et al.  CellProfiler: image analysis software for identifying and quantifying cell phenotypes , 2006, Genome Biology.

[23]  Nicholas Ayache,et al.  Medical Image Analysis: Progress over Two Decades and the Challenges Ahead , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  John G. Daugman,et al.  Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression , 1988, IEEE Trans. Acoust. Speech Signal Process..

[25]  J. Settleman Rac 'n Rho: the music that shapes a developing embryo. , 2001, Developmental cell.

[26]  Lani F. Wu,et al.  Multidimensional Drug Profiling By Automated Microscopy , 2004, Science.

[27]  von F. Zernike Beugungstheorie des schneidenver-fahrens und seiner verbesserten form, der phasenkontrastmethode , 1934 .

[28]  A. Hall,et al.  Rho GTPases and their effector proteins. , 2000, The Biochemical journal.

[29]  Xiaobo Zhou,et al.  Towards Automated Cellular Image Segmentation for RNAi Genome-Wide Screening , 2005, MICCAI.

[30]  A. Coulson,et al.  A functional genomic analysis of cell morphology using RNA interference , 2003, Journal of biology.

[31]  Stephen T. C. Wong,et al.  An automated feedback system with the hybrid model of scoring and classification for solving over‐segmentation problems in RNAi high content screening , 2007, Journal of microscopy.

[32]  N. Perrimon,et al.  High-throughput RNAi screening in cultured cells: a user's guide , 2006, Nature Reviews Genetics.

[33]  Krister Wennerberg,et al.  Rho and Rac Take Center Stage , 2004, Cell.

[34]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[35]  T. Mitchison,et al.  Phenotypic screening of small molecule libraries by high throughput cell imaging. , 2003, Combinatorial chemistry & high throughput screening.

[36]  M.,et al.  Statistical and Structural Approaches to Texture , 2022 .

[37]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[38]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[39]  Guanglei Xiong,et al.  Automated Segmentation of Drosophila RNAi Fluorescence Cellular Images Using Deformable Models , 2006, IEEE Transactions on Circuits and Systems I: Regular Papers.

[40]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .