A methodology for morphological feature extraction and unsupervised cell classification

Cell morphology is an important indicator of cell state, function, stage of development, and fate in both normal and pathological conditions. Cell shape is among key indicators used by pathologists to identify abnormalities or malignancies. With rapid advancements in the speed and amount of biological data acquisition, including images and movies of cells, computer-assisted identification and analysis of images becomes essential. Here, we report on techniques for recognition of cells in microscopic images and automated cell shape classification. We illustrate how our unsupervised machine-learning-based approach can be used to classify distinct cell shapes from a large number of microscopic images. Technical Abstract We develop a methodology to segment cells from microscopy images and compute quantitative descriptors that characterize their morphology. Using unsupervised techniques for dimensionality reduction and density-based clustering, we perform label-free cell shape classification. Cells are identified with minimal user input using mathematical morphology and region-growing segmentation methods. Physical quantities describing cell shape and size (including area, perimeter, Feret diameters, etc.) are computed along with other features including shape factors and Hu’s image moments. Correlated features are combined to obtain a low-dimensional (2-D or 3-D) embedding of data points corresponding to individual segmented cell shapes. Finally, a hierarchical density-based clustering algorithm (HDBSCAN) is used to classify cells. We compare cell classification results obtained from different combinations of features to identify a feature set that delivers optimum classification performance for our test data consisting of phase-contrast microscopy images of a pancreatic-cancer cell line, MIA PaCa-2.

[1]  Cristina Urdiales,et al.  Non-parametric planar shape representation based on adaptive curvature functions , 2002, Pattern Recognit..

[2]  Ata Mahjoubfar,et al.  Deep Learning in Label-free Cell Classification , 2016, Scientific Reports.

[3]  Anne E Carpenter,et al.  Quantifying co-cultured cell phenotypes in high-throughput using pixel-based classification. , 2015, Methods.

[4]  Erik H. W. Meijering,et al.  Cell Segmentation: 50 Years Down the Road [Life Sciences] , 2012, IEEE Signal Processing Magazine.

[5]  Cecilia Di Ruberto,et al.  White Blood Cells Identication and Classication from Leukemic Blood Image , 2013, IWBBIO.

[6]  Maria Filomena Botelho,et al.  MIA PaCa-2 and PANC-1 – pancreas ductal adenocarcinoma cell lines with neuroendocrine differentiation and somatostatin receptors , 2016, Scientific Reports.

[7]  Hongshen Ma,et al.  Morphological Differences between Circulating Tumor Cells from Prostate Cancer Patients and Cultured Prostate Cancer Cells , 2014, PloS one.

[8]  Hubert Fonga Pattern recognition in gray-level images by Fourier analysis , 1996, Pattern Recognit. Lett..

[9]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[10]  Fernand Meyer The watershed concept and its use in segmentation : a brief history , 2012, ArXiv.

[11]  Ricardo J. G. B. Campello,et al.  Density-Based Clustering Based on Hierarchical Density Estimates , 2013, PAKDD.

[12]  Hiroyuki Honda,et al.  Morphology-Based Prediction of Osteogenic Differentiation Potential of Human Mesenchymal Stem Cells , 2013, PloS one.

[13]  Z. P. I N C U S,et al.  Comparison of quantitative methods for cell-shape analysis , 2007 .

[14]  M. Tackx,et al.  The Geometric Features, Shape Factors and Fractal Dimensions of Suspended Particulate Matter in the Scheldt Estuary (Belgium) , 1999 .

[15]  Verônica A. Grieneisen,et al.  Morphometrics of complex cell shapes: lobe contribution elliptic Fourier analysis (LOCO-EFA) , 2017, Development.

[16]  Stefan Steinerberger,et al.  Clustering with t-SNE, provably , 2017, SIAM J. Math. Data Sci..

[17]  Christoph Sommer,et al.  Machine learning in cell biology – teaching computers to recognize phenotypes , 2013, Journal of Cell Science.

[18]  Fabian Rudolf,et al.  Accurate cell segmentation in microscopy images using membrane patterns , 2014, Bioinform..

[19]  E. Olson,et al.  Particle Shape Factors and Their Use in Image Analysis – Part 1 : Theory , 2013 .

[20]  Selim Aksoy,et al.  Unsupervised segmentation and classification of cervical cell images , 2012, Pattern Recognit..

[21]  K. Inoue,et al.  A method for calculating the perimeter of objects for automatic recognition of circular defects , 1987 .

[22]  Samanthe M. Lyons,et al.  Measuring systematic changes in invasive cancer cell shape using Zernike moments. , 2016, Integrative biology : quantitative biosciences from nano to macro.

[23]  Manuele Bicego,et al.  A bioinformatics approach to 2D shape classification , 2016, Comput. Vis. Image Underst..

[24]  Yaling Pei,et al.  A Synthetic Data Generator for Clustering and Outlier Analysis , 2006 .

[25]  L. Nanni,et al.  Texture Descriptors Ensembles Enable Image-Based Classification of Maturation of Human Stem Cell-Derived Retinal Pigmented Epithelium , 2016, PloS one.

[26]  G. Dunn,et al.  Alignment of fibroblasts on grooved surfaces described by a simple geometric transformation. , 1986, Journal of cell science.

[27]  M. Macé « "Le réel à l'état passé" : passion de l'archive et reflux du récit » , 2008 .

[28]  Andreas Luch,et al.  Cell shape characterization and classification with discrete Fourier transforms and self‐organizing maps , 2018, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[29]  Xavier Vasques,et al.  Morphological Neuron Classification Using Machine Learning , 2016, Front. Neuroanat..

[30]  Csaba Legány,et al.  Cluster validity measurement techniques , 2006 .

[31]  J. Nevalainen,et al.  Morphological Clustering of Cell Cultures Based on Size, Shape, and Texture Features , 2016 .

[32]  Richard J. Prokop,et al.  A survey of moment-based techniques for unoccluded object representation and recognition , 1992, CVGIP Graph. Model. Image Process..

[33]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[34]  R. Halír Numerically Stable Direct Least Squares Fitting of Ellipses , 1998 .

[35]  A. Enis Cetin,et al.  Image Classification of Human Carcinoma Cells Using Complex Wavelet-Based Covariance Descriptors , 2013, PloS one.

[36]  Sansanee Auephanwiriyakul,et al.  Automatic cervical cell segmentation and classification in Pap smears , 2014, Comput. Methods Programs Biomed..

[37]  Jesus A. Gonzalez,et al.  Segmentation and Classification of Bone Marrow Cells Images Using Contextual Information for Medical Diagnosis of Acute Leukemias , 2015, PloS one.

[38]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[39]  Justin W. L. Wan,et al.  A combined watershed and level set method for segmentation of brightfield cell images , 2009, Medical Imaging.

[40]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[41]  Pravesh Kothari,et al.  An Analysis of the t-SNE Algorithm for Data Visualization , 2018, COLT.

[42]  H Q Sun,et al.  Adaptive watershed segmentation of binary particle image , 2009, Journal of microscopy.

[43]  Dwi Anoraganingrum,et al.  Cell segmentation with median filter and mathematical morphology operation , 1999, Proceedings 10th International Conference on Image Analysis and Processing.

[44]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[45]  J. Flusser,et al.  Numerically Stable Direct Least Squares Fitting of Ellipses , 1998 .

[46]  A. Yunis,et al.  Human pancreatic carcinoma (mia paca‐2) in continuous culture: Sensitivity to asparaginase , 1977, International journal of cancer.

[47]  Ashok Samal,et al.  A simple method for fitting of bounding rectangle to closed regions , 2007, Pattern Recognit..

[48]  Martin Wattenberg,et al.  How to Use t-SNE Effectively , 2016 .