Incorporating Priors on Expert Performance Parameters for Segmentation Validation and Label Fusion: A Maximum a Posteriori STAPLE

In order to evaluate the quality of segmentations of an image and assess intra- and inter-expert variability in segmentation performance, an Expectation Maximization (EM) algorithm for Simultaneous Truth And Performance Level Estimation (STAPLE) was recently developed. This algorithm, originally presented for segmentation validation, has since been used for many applications, such as atlas construction and decision fusion. However, the manual delineation of structures of interest is a very time consuming and burdensome task. Further, as the time required and burden of manual delineation increase, the accuracy of the delineation is decreased. Therefore, it may be desirable to ask the experts to delineate only a reduced number of structures or the segmentation of all structures by all experts may simply not be achieved. Fusion from data with some structures not segmented by each expert should be carried out in a manner that accounts for the missing information. In other applications, locally inconsistent segmentations may drive the STAPLE algorithm into an undesirable local optimum, leading to misclassifications or misleading experts performance parameters. We present a new algorithm that allows fusion with partial delineation and which can avoid convergence to undesirable local optima in the presence of strongly inconsistent segmentations. The algorithm extends STAPLE by incorporating prior probabilities for the expert performance parameters. This is achieved through a Maximum A Posteriori formulation, where the prior probabilities for the performance parameters are modeled by a beta distribution. We demonstrate that this new algorithm enables dramatically improved fusion from data with partial delineation by each expert in comparison to fusion with STAPLE.

[1]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[2]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[3]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[5]  Jean Meunier,et al.  Average Brain Models: A Convergence Study , 2000, Comput. Vis. Image Underst..

[6]  Guido Gerig,et al.  Valmet: A New Validation Tool for Assessing and Improving 3D Object Segmentation , 2001, MICCAI.

[7]  Wiro J. Niessen,et al.  Medical Image Computing and Computer-Assisted Intervention – MICCAI 2001 , 2001, Lecture Notes in Computer Science.

[8]  Ron Kikinis,et al.  Statistical validation of image segmentation quality based on a spatial overlap index. , 2004, Academic radiology.

[9]  Torsten Rohlfing,et al.  Evaluation of atlas selection strategies for atlas-based image segmentation with application to confocal microscopy images of bee brains , 2004, NeuroImage.

[10]  William M. Wells,et al.  Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation , 2004, IEEE Transactions on Medical Imaging.

[11]  Grégoire Malandain,et al.  Atlas-based delineation of lymph node levels in head and neck computed tomography images. , 2008, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[12]  Jerry L. Prince,et al.  Efficient Anatomical Labeling by Statistical Recombination of Partially Label Datasets , 2009 .

[13]  Leon G. Higley,et al.  Forensic Entomology: An Introduction , 2009 .

[14]  Simon K. Warfield,et al.  Automatic segmentation of newborn brain MRI , 2009, NeuroImage.