Critical texture pattern feature assessment for characterizing colonies of induced pluripotent stem cells through machine learning techniques

The objectives of this study are to assess various automated texture features obtained from the segmented colony regions of induced pluripotent stem cells (iPSCs) and confirm their potential for characterizing the colonies using different machine learning techniques. One hundred and fifty-one features quantified using shape-based, moment-based, statistical and spectral texture feature groups are extracted from phase-contrast microscopic colony images of iPSCs. The forward stepwise regression model is implemented to select the most appropriate features required for categorizing the colonies. Support vector machine (SVM), random forest (RF), multilayer perceptron (MLP), decision tree (DT), and adaptive boosting (Adaboost) classifiers are used with ten-fold cross-validation to evaluate the texture features within each texture feature group and fused-features group to characterize healthy and unhealthy colonies of iPSCs. Overall, based on the classification performances of the four texture feature groups using the five classifier models, statistical features always exhibit a high predictive capacity (>87.5%). However, the classification performance using fused texture patterns with statistical, shape-based, and moment-based features was found to be robust and reliable with fewer false positive and false negative values compared to the features when either one is used for the classification of colonies of iPSCs. Furthermore, the results showcase that the SVM, RF and Adaboost classifiers deliver better classification performances than DT and MLP. Our findings suggest that the proposed automated fused statistical, shape-based, and moment-based texture pattern features trained with machine learning techniques are potentially more appropriate and helpful to biologists for characterizing colonies of stem cells.

[1]  Nils Daniel Forkert,et al.  Classifiers for Ischemic Stroke Lesion Segmentation: A Comparison Study , 2015, PloS one.

[2]  Shereen M. El-Metwally,et al.  Decision tree classifiers for automated medical diagnosis , 2013, Neural Computing and Applications.

[3]  T. Ichisaka,et al.  Generation of germline-competent induced pluripotent stem cells , 2007, Nature.

[4]  C. J. Huberty,et al.  Applied Discriminant Analysis , 1994 .

[5]  Hiroto Sasaki,et al.  Visualization of morphological categories of colonies for monitoring of effect on induced pluripotent stem cell culture status , 2017, Regenerative therapy.

[6]  Zheng Rong Yang,et al.  Biological applications of support vector machines , 2004, Briefings Bioinform..

[7]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[8]  Bir Bhanu,et al.  Dynamic Low-Level Context for the Detection of Mild Traumatic Brain Injury , 2015, IEEE Transactions on Biomedical Engineering.

[9]  Sidong Liu,et al.  Cross-View Neuroimage Pattern Analysis in Alzheimer's Disease Staging , 2016, Front. Aging Neurosci..

[10]  Kazufumi Kaneda,et al.  Automatic Detection of Good/Bad Colonies of iPS Cells Using Local Features , 2015, MLMI.

[11]  Ryan Stables,et al.  Assisted Differentiated Stem Cell Classification in Infrared Spectroscopy Using Auditory Feedback , 2014 .

[12]  Seong-Hoon Kim,et al.  X-ray Image Classification Using Random Forests with Local Wavelet-Based CS-Local Binary Patterns , 2011, Journal of Digital Imaging.

[13]  G. Makris,et al.  Image analysis and multi‐layer perceptron artificial neural networks for the discrimination between benign and malignant endometrial lesions , 2017, Diagnostic cytopathology.

[14]  Belur V. Dasarathy,et al.  Medical Image Fusion: A survey of the state of the art , 2013, Inf. Fusion.

[15]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[16]  Dietmar Saupe,et al.  Chaos and fractals - new frontiers of science , 1992 .

[17]  D. Roop,et al.  Differentiation of human induced pluripotent stem cells into a keratinocyte lineage. , 2014, Methods in molecular biology.

[18]  Michael Unser,et al.  Sum and Difference Histograms for Texture Classification , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  David G. Stork,et al.  Pattern Classification , 1973 .

[20]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[22]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[23]  Gonzalo R. Arce,et al.  Nonlinear Signal Processing - A Statistical Approach , 2004 .

[24]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[25]  B. Bhanu,et al.  Evaluating Cell Processes, Quality, and Biomarkers in Pluripotent Stem Cells Using Video Bioinformatics , 2016, PloS one.

[26]  R. Durbin,et al.  A high-content platform to characterise human induced pluripotent stem cell lines , 2016, Methods.

[27]  S. Pavlopoulos,et al.  A decision tree – based method for the differential diagnosis of Aortic Stenosis from Mitral Regurgitation using heart sounds , 2004, Biomedical engineering online.

[28]  Liana G. Apostolova,et al.  Comparison of AdaBoost and Support Vector Machines for Detecting Alzheimer's Disease Through Automated Hippocampal Segmentation , 2010, IEEE Transactions on Medical Imaging.

[29]  G. Falco,et al.  Automated Identification and Location Analysis of Marked Stem Cells Colonies in Optical Microscopy Images , 2013, PloS one.

[30]  Jyrki Rasku,et al.  Machine Learning Approach to Automated Quality Identification of Human Induced Pluripotent Stem Cell Colony Images , 2016, Comput. Math. Methods Medicine.

[31]  Wei-bang Chen,et al.  An automated bacterial colony counting and classification system , 2009, Inf. Syst. Frontiers.

[32]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[33]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..