A reliable method for cell phenotype image classification

OBJECTIVE Image-based approaches have proven to be of great utility in the automated cell phenotype classification, it is very important to develop a method that efficiently quantifies, distinguishes and classifies sub-cellular images. METHODS AND MATERIALS In this work, the invariant locally binary patterns (LBP) are applied, for the first time, to the classification of protein sub-cellular localization images. They are tested on three image datasets (available for download), in conjunction with support vector machines (SVMs) and random subspace ensembles of neural networks. Our method based on invariant LBP provides higher accuracy than other well-known methods for feature extraction; moreover, our method does not require to (direct) crop the cells for the classification. RESULTS AND CONCLUSION The experimental results show that the random subspace ensemble of neural networks outperforms the SVM in this problem. The proposed approach based on the solely LBP features gives accuracies of 85%, 93.9% and 88.4% on the 2D HeLa dataset, LOCATE endogenous and transfected datasets, respectively, and in combination with other state-of-the-art methods for the cell phenotype image classification we obtain a classification accuracy of 94.2%, 98.4% and 96.5%.

[1]  Jelena Kovacevic,et al.  A multiresolution approach to automated classification of protein subcellular location images , 2007, BMC Bioinformatics.

[2]  Wen Gao,et al.  Ensemble of Piecewise FDA Based on Spatial Histograms of Local (Gabor) Binary Patterns for Face Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[3]  Stefan Wiemann,et al.  LIFEdb: a database for functional genomics experiments integrating information from external sources, and serving as a sample tracking system , 2004, Nucleic Acids Res..

[4]  Radosav S. Pantelic,et al.  Automated sub-cellular phenotype classification: an introduction and recent results , 2006 .

[5]  E. O’Shea,et al.  Global analysis of protein localization in budding yeast , 2003, Nature.

[6]  C. Conrad,et al.  Automatic identification of subcellular phenotypes on human cell arrays. , 2004, Genome research.

[7]  Kai Huang,et al.  Automated classification of subcellular patterns in multicell images without segmentation into single cells , 2004, 2004 2nd IEEE International Symposium on Biomedical Imaging: Nano to Macro (IEEE Cat No. 04EX821).

[8]  Loris Nanni,et al.  Comparison among feature extraction methods for HIV-1 protease cleavage site prediction , 2006, Pattern Recognit..

[9]  Ash A. Alizadeh,et al.  Towards a novel classification of human malignancies based on gene expression patterns , 2001, The Journal of pathology.

[10]  Kai Huang,et al.  Boosting accuracy of automated classification of fluorescence microscope images for location proteomics , 2004, BMC Bioinformatics.

[11]  K. Yeow,et al.  Cellular imaging in drug discovery , 2006, Nature Reviews Drug Discovery.

[12]  Yoshihiko Hamamoto,et al.  On the Behavior of Artificial Neural Network Classifiers in High-Dimensional Spaces , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[14]  R. Murphy,et al.  Objective Clustering of Proteins Based on Subcellular Location Patterns , 2005, Journal of biomedicine & biotechnology.

[15]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Mohammad Bagher Menhaj,et al.  Training feedforward networks with the Marquardt algorithm , 1994, IEEE Trans. Neural Networks.

[17]  M.,et al.  Statistical and Structural Approaches to Texture , 2022 .

[18]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Jun Kawai,et al.  LOCATE: a mouse protein subcellular localization database , 2005, Nucleic Acids Res..

[21]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[22]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[23]  Robert F Murphy,et al.  Putting proteins on the map , 2006, Nature Biotechnology.

[24]  Nicholas A. Hamilton,et al.  Fast automated cell phenotype image classification , 2007, BMC Bioinformatics.

[25]  Chun-Nan Hsu,et al.  Boosting multiclass learning with repeating codes and weak detectors for protein subcellular localization , 2007, Bioinform..