CellProfiler Analyst: interactive data exploration, analysis and classification of large biological image sets

Abstract Summary: CellProfiler Analyst allows the exploration and visualization of image-based data, together with the classification of complex biological phenotypes, via an interactive user interface designed for biologists and data scientists. CellProfiler Analyst 2.0, completely rewritten in Python, builds on these features and adds enhanced supervised machine learning capabilities (Classifier), as well as visualization tools to overview an experiment (Plate Viewer and Image Gallery). Availability and Implementation: CellProfiler Analyst 2.0 is free and open source, available at http://www.cellprofiler.org and from GitHub (https://github.com/CellProfiler/CellProfiler-Analyst) under the BSD license. It is available as a packaged application for Mac OS X and Microsoft Windows and can be compiled for Linux. We implemented an automatic build process that supports nightly updates and regular release cycles for the software. Contact: anne@broadinstitute.org Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Michael R. Kosorok,et al.  Detection of gene pathways with predictive power for breast cancer prognosis , 2010, BMC Bioinformatics.

[2]  Polina Golland,et al.  Scoring diverse cellular morphologies in image-based screens with iterative feedback and machine learning , 2009, Proceedings of the National Academy of Sciences.

[3]  Polina Golland,et al.  CellProfiler Analyst: data exploration and analysis software for complex image-based screens , 2008, BMC Bioinformatics.

[4]  Thomas Wild,et al.  Machine Learning Improves the Precision and Robustness of High-Content Screens , 2011, Journal of biomolecular screening.

[5]  Anne E Carpenter,et al.  Applications in image-based profiling of perturbations. , 2016, Current opinion in biotechnology.

[6]  Wolfgang Huber,et al.  Analysis of cell-based RNAi screens , 2006, Genome Biology.

[7]  Anne E Carpenter,et al.  Improved structure, function and compatibility for CellProfiler: modular high-throughput image analysis software , 2011, Bioinform..

[8]  Oliver Pelz,et al.  web cellHTS2: A web-application for the analysis of high-throughput screening data , 2010, BMC Bioinformatics.

[9]  Ullrich Köthe,et al.  Ilastik: Interactive learning and segmentation toolkit , 2011, 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[10]  Lior Shamir,et al.  WND-CHARM: Multi-purpose image classification using compound image transforms , 2008, Pattern Recognit. Lett..

[11]  Thorsten Meinl,et al.  KNIME - the Konstanz information miner: version 2.0 and beyond , 2009, SKDD.

[12]  Anne E Carpenter,et al.  CellProfiler Tracer: exploring and validating high-throughput, time-lapse microscopy image data , 2015, BMC Bioinformatics.

[13]  Anne E Carpenter,et al.  Workflow and Metrics for Image Quality Control in Large-Scale High-Content Screens , 2012, Journal of biomolecular screening.

[14]  Anne E Carpenter,et al.  Cell Painting, a high-content image-based assay for morphological profiling using multiplexed fluorescent dyes , 2016, Nature Protocols.

[15]  Bernd Fischer,et al.  CellCognition: time-resolved phenotype annotation in high-throughput live cell imaging , 2010, Nature Methods.

[16]  Ramana V. Davuluri,et al.  Annotation of gene promoters by integrative data-mining of ChIP-seq Pol-II enrichment data , 2010, BMC Bioinformatics.

[17]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..