CellH5: a format for data exchange in high-content screening

Summary: High-throughput microscopy data require a diversity of analytical approaches. However, the construction of workflows that use algorithms from different software packages is difficult owing to a lack of interoperability. To overcome this limitation, we present CellH5, an HDF5 data format for cell-based assays in high-throughput microscopy, which stores high-dimensional image data along with inter-object relations in graphs. CellH5Browser, an interactive gallery image browser, demonstrates the versatility and performance of the file format on live imaging data of dividing human cells. CellH5 provides new opportunities for integrated data analysis by multiple software platforms. Availability: Source code is freely available at www.github.com/cellh5 under the GPL license and at www.bioconductor.org/packages/release/bioc/html/rhdf5.html under the Artistic-2.0 license. Demo datasets and the CellH5Browser are available at www.cellh5.org. A Fiji importer for cellh5 will be released soon. Contact: daniel.gerlich@imba.oeaw.ac.at or christoph.sommer@imba.oeaw.ac.at Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Bernd Fischer,et al.  CellCognition: time-resolved phenotype annotation in high-throughput live cell imaging , 2010, Nature Methods.

[2]  Joachim M Buhmann,et al.  Unsupervised modeling of cell morphology dynamics for time-lapse microscopy , 2012, Nature Methods.

[3]  Mario Niepel,et al.  Adaptive informatics for multi-factorial and high content biological data , 2011, Nature Methods.

[4]  Polina Golland,et al.  CellProfiler Analyst: data exploration and analysis software for complex image-based screens , 2008, BMC Bioinformatics.

[5]  C. Conrad,et al.  Automated microscopy for high-content RNAi screening , 2010, The Journal of cell biology.

[6]  B. S. Manjunath,et al.  Biological imaging software tools , 2012, Nature Methods.

[7]  Anne E Carpenter,et al.  CellProfiler: image analysis software for identifying and quantifying cell phenotypes , 2006, Genome Biology.

[8]  C. Rueden,et al.  Metadata matters: access to image data in the real world , 2010, The Journal of cell biology.

[9]  Ullrich Köthe,et al.  Ilastik: Interactive learning and segmentation toolkit , 2011, 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[10]  Emden R. Gansner,et al.  Graphviz - Open Source Graph Drawing Tools , 2001, GD.

[11]  U. Brandes,et al.  GraphML Progress Report ? Structural Layer Proposal , 2001 .

[12]  Jan Ellenberg,et al.  Automatic identification and clustering of chromosome phenotypes in a genome wide RNAi screen by time-lapse imaging. , 2010, Journal of structural biology.

[13]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[14]  Robert F. Murphy,et al.  A neural network classifier capable of recognizing the patterns of all major subcellular structures in fluorescence microscope images of HeLa cells , 2001, Bioinform..