Fusion of systems for automated cell phenotype image classification

Automated cell phenotype image classification is related to the problem of determining locations of protein expression within living cells. Localization of proteins in cells is directly related to their functions and it is crucial for several applications ranging from early diagnosis of a disease to monitoring of therapeutic effectiveness of drugs. Recent advances in imaging instruments and biological reagents have allowed fluorescence microscopy to be extensively used as a tool to understand biology at the cellular level by means of the visualization of biological activity within cells. However, human classification of fluorescence cell micrographs is still subjective and very time consuming, thus an automated approach for the systematic determination of protein subcellular locations from fluorescence microscopy images is required. Existing approaches concentrated on designing a set of optimal features and then applying standard machine-learning algorithms. This paper takes into consideration the best methods proposed in the literature and focuses on the study of ensemble machine learning techniques for cell phenotype image classification. Two techniques are tested for the classification: a random subspace of Levenberg-Marquardt neural networks and a variant of the AdaBoost. Each of these two methods are tested with different feature sets, moreover the fusion between the two ensembles is studied. The best ensemble tested in this work obtains an outstanding 97.5% accuracy in the 2D-Hela dataset, which to the best of our knowledge is the best performance obtained on this dataset (the most used benchmark for comparing automated cell phenotype image classification approaches).

[1]  Kai Huang,et al.  Boosting accuracy of automated classification of fluorescence microscope images for location proteomics , 2004, BMC Bioinformatics.

[2]  Robert F. Murphy,et al.  A neural network classifier capable of recognizing the patterns of all major subcellular structures in fluorescence microscope images of HeLa cells , 2001, Bioinform..

[3]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Nicholas A. Hamilton,et al.  Fast automated cell phenotype image classification , 2007, BMC Bioinformatics.

[5]  M V Boland,et al.  Automated recognition of patterns characteristic of subcellular structures in fluorescence microscopy images. , 1998, Cytometry.

[6]  C. Conrad,et al.  Automatic identification of subcellular phenotypes on human cell arrays. , 2004, Genome research.

[7]  R. Murphy,et al.  Automated subcellular location determination and high-throughput microscopy. , 2007, Developmental cell.

[8]  Jelena Kovacevic,et al.  A multiresolution approach to automated classification of protein subcellular location images , 2007, BMC Bioinformatics.

[9]  Robert F. Murphy,et al.  Automated image analysis of protein localization in budding yeast , 2007, ISMB/ECCB.

[10]  Jian Li,et al.  Unifying the error-correcting and output-code AdaBoost within the margin framework , 2005, ICML.

[11]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[12]  Petra Perner,et al.  Mining knowledge for HEp-2 cell image classification , 2002, Artif. Intell. Medicine.

[13]  P Bork,et al.  Wanted: subcellular localization of proteins based on sequence. , 1998, Trends in cell biology.

[14]  Ling Li,et al.  Multiclass boosting with repartitioning , 2006, ICML.

[15]  Mohammad Bagher Menhaj,et al.  Training feedforward networks with the Marquardt algorithm , 1994, IEEE Trans. Neural Networks.

[16]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Robert F. Murphy,et al.  Automated comparison of protein subcellular location patterns between images of normal and cancerous tissues , 2008, 2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[18]  M. Markey,et al.  Classification of protein localization patterns obtained via fluorescence light microscopy , 1997, Proceedings of the 19th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 'Magnificent Milestones and Emerging Opportunities in Medical Engineering' (Cat. No.97CH36136).

[19]  K. Nakai,et al.  PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization. , 1999, Trends in biochemical sciences.

[20]  Robert E. Schapire,et al.  Using output codes to boost multiclass learning problems , 1997, ICML.

[21]  H. Singh,et al.  UWIT: underwater image toolbox for optical image processing and mosaicking in MATLAB , 2002, Proceedings of the 2002 Interntional Symposium on Underwater Technology (Cat. No.02EX556).

[22]  R. Murphy,et al.  Objective Clustering of Proteins Based on Subcellular Location Patterns , 2005, Journal of biomedicine & biotechnology.

[23]  A. Danckaert,et al.  Automated Recognition of Intracellular Organelles in Confocal Microscope Images , 2002, Traffic.

[24]  Loris Nanni,et al.  A reliable method for cell phenotype image classification , 2008, Artif. Intell. Medicine.

[25]  Loris Nanni,et al.  Ensemble of Neural Networks for Automated Cell Phenotype Image Classification , 2010 .

[26]  Venkatesan Guruswami,et al.  Multiclass learning, boosting, and error-correcting codes , 1999, COLT '99.

[27]  Chun-Nan Hsu,et al.  Boosting multiclass learning with repeating codes and weak detectors for protein subcellular localization , 2007, Bioinform..

[28]  Kai Huang,et al.  Automated classification of subcellular patterns in multicell images without segmentation into single cells , 2004, 2004 2nd IEEE International Symposium on Biomedical Imaging: Nano to Macro (IEEE Cat No. 04EX821).

[29]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  R.M. Haralick,et al.  Statistical and structural approaches to texture , 1979, Proceedings of the IEEE.

[32]  Loris Nanni,et al.  Comparison among feature extraction methods for HIV-1 protease cleavage site prediction , 2006, Pattern Recognit..

[33]  Meel Velliste,et al.  Automated interpretation of subcellular patterns in fluorescence microscope images for location proteomics , 2006, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[34]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .