Automated ranking of stem cell colonies by translating biological rules to computational models

This paper addresses the problem of automating an image ranking process for stem cell colonies. We automate the manual process in a novel way: instead of fitting off-the-shelf image features and colony ranks to prediction models, we define a new feature set that uniquely characterizes the visual clues from images of the colonies and biological rules experts use to rank colonies from image data. Our automation considers several factors: the inconsistency of manually assigned stem cell colony ranks, the type of image segmentation to detect stem cell colonies (manual and automated), the type of image feature set (off-the-shelf vs. custom designed), and an underlying relationship between input colony features and output stem cell colony ranks (linear and nonlinear). The novelty of our work lies in automating stem cell colony ranking while preserving the connection between visually perceived quality characteristics of stem cell colonies, and image colony features combined with a computational prediction model. The main contribution of our work is in demonstrating the benefits of direct interpretation of biological rules to automation of stem cell colony ranking. We also outline a method for establishing relationships between the commonly used Haralick features and our custom-designed features.

[1]  Lluís A. Belanche Muñoz,et al.  Feature selection algorithms: a survey and experimental evaluation , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[2]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[3]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[4]  Johannes E. Schindelin,et al.  Fiji: an open-source platform for biological-image analysis , 2012, Nature Methods.

[5]  Anil K. Jain,et al.  Feature Selection: Evaluation, Application, and Small Sample Performance , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  A. Viera,et al.  Understanding interobserver agreement: the kappa statistic. , 2005, Family medicine.

[7]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[9]  Kai Huang,et al.  Boosting accuracy of automated classification of fluorescence microscope images for location proteomics , 2004, BMC Bioinformatics.

[10]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[11]  C. Bradhurst Monitoring mesenchymal stem cell cultures using image processing and pattern recognition techniques , 2010 .

[12]  P. Groves,et al.  Methodology For Hyperspectral Band Selection , 2004 .

[13]  Mukund Desai,et al.  NON-INVASIVE IMAGE BASED SUPPORT VECTOR MACHINE CLASSIFICATION OF HUMAN EMBRYONIC STEM CELLS , 2007, 2007 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro.