Algorithm sensitivity analysis and parameter tuning for tissue image segmentation pipelines

Motivation: Sensitivity analysis and parameter tuning are important processes in large‐scale image analysis. They are very costly because the image analysis workflows are required to be executed several times to systematically correlate output variations with parameter changes or to tune parameters. An integrated solution with minimum user interaction that uses effective methodologies and high performance computing is required to scale these studies to large imaging datasets and expensive analysis workflows. Results: The experiments with two segmentation workflows show that the proposed approach can (i) quickly identify and prune parameters that are non‐influential; (ii) search a small fraction (about 100 points) of the parameter search space with billions to trillions of points and improve the quality of segmentation results (Dice and Jaccard metrics) by as much as 1.42× compared to the results from the default parameters; (iii) attain good scalability on a high performance cluster with several effective optimizations. Conclusions: Our work demonstrates the feasibility of performing sensitivity analyses, parameter studies and auto‐tuning with large datasets. The proposed framework can enable the quantification of error estimations and output variations in image segmentation pipelines. Availability and Implementation: Source code: https://github.com/SBU‐BMI/region‐templates/. Contact: teodoro@unb.br Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Richard J. Beckman,et al.  A Comparison of Three Methods for Selecting Values of Input Variables in the Analysis of Output From a Computer Code , 2000, Technometrics.

[2]  I. Sobola,et al.  Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates , 2001 .

[3]  AJ Pretorius,et al.  Visual parameter optimisation for biomedical image processing , 2015, BMC Bioinformatics.

[4]  Joel H. Saltz,et al.  Hierarchical nucleus segmentation in digital pathology images , 2016, SPIE Medical Imaging.

[5]  Vahid Tabatabaee,et al.  Parallel Parameter Tuning for Applications with Performance Variability , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[6]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[7]  Max D. Morris,et al.  Factorial sampling plans for preliminary computational experiments , 1991 .

[8]  Martial Hebert,et al.  Discriminative random fields: a discriminative framework for contextual interaction in classification , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[9]  Joel H. Saltz,et al.  Machine-Based Morphologic Analysis of Glioblastoma Using Whole-Slide Pathology Images Uncovers Clinically Relevant Molecular Correlates , 2013, PloS one.

[10]  Nikolaos V. Sahinidis,et al.  Derivative-free optimization: a review of algorithms and comparison of software implementations , 2013, J. Glob. Optim..

[11]  Andrea Saltelli,et al.  An effective screening design for sensitivity analysis of large models , 2007, Environ. Model. Softw..

[12]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[13]  Derek Hoiem,et al.  Learning CRFs Using Graph Cuts , 2008, ECCV.

[14]  Thomas Schultz,et al.  Open-Box Spectral Clustering: Applications to Medical Image Analysis , 2013, IEEE Transactions on Visualization and Computer Graphics.

[15]  Carlo Meloni,et al.  Uncertainty Management in Simulation-Optimization of Complex Systems : Algorithms and Applications , 2015 .

[16]  Stefano Tarantola,et al.  Sensitivity Analysis in Practice: A Guide to Assessing Scientific Models , 2004 .

[17]  M. D. McKay,et al.  A comparison of three methods for selecting values of input variables in the analysis of output from a computer code , 2000 .

[18]  P. Suñé,et al.  Positive Outcomes Influence the Rate and Time to Publication, but Not the Impact Factor of Publications of Clinical Trial Results , 2013, PloS one.

[19]  Salvatore Di Gregorio,et al.  Modelling, computer-assisted simulations, and mapping of dangerous phenomena for hazard assessment , 2007, Environ. Model. Softw..

[20]  Jun Kong,et al.  Region templates: Data representation and management for high-throughput image analysis , 2014, Parallel Comput..

[21]  Hailong Zhu,et al.  Predicting protein functions using incomplete hierarchical labels , 2015, BMC Bioinformatics.

[22]  William J. Rider,et al.  Sensitivity analysis techniques applied to a system of hyperbolic conservation laws , 2012, Reliab. Eng. Syst. Saf..

[23]  Bruno Sareni,et al.  Fitness sharing and niching methods revisited , 1998, IEEE Trans. Evol. Comput..

[24]  B. Iooss,et al.  A Review on Global Sensitivity Analysis Methods , 2014, 1404.2405.

[25]  A. Saltelli,et al.  Making best use of model evaluations to compute sensitivity indices , 2002 .

[26]  Hans-Christian Hege,et al.  Tuner: Principled Parameter Finding for Image Segmentation Algorithms Using Visual Response Surface Exploration , 2011, IEEE Transactions on Visualization and Computer Graphics.

[27]  Jasper Snoek,et al.  Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[28]  Ghassan Hamarneh,et al.  Is a Single Energy Functional Sufficient? Adaptive Energy Functionals and Automatic Initialization , 2007, MICCAI.