Automating image segmentation verification and validation by learning test oracles

An image segmentation algorithm delineates (an) object(s) of interest in an image. Its output is referred to as a segmentation. Developing these algorithms is a manual, iterative process involving repetitive verification and validation tasks. This process is time-consuming and depends on the availability of experts, who may be a scarce resource (e.g., medical experts). We propose a framework referred to as Image Segmentation Automated Oracle (ISAO) that uses machine learning to construct an oracle, which can then be used to automatically verify the correctness of image segmentations, thus saving substantial resources and making the image segmentation verification and validation task significantly more efficient. The framework also gives informative feedback to the developer as the segmentation algorithm evolves and provides a systematic means of testing different parametric configurations of the algorithm. During the initial learning phase, segmentations from the first few (optimally two) versions of the segmentation algorithm are manually verified by experts. The similarity of successive segmentations of the same images is also measured in various ways. This information is then fed to a machine learning algorithm to construct a classifier that distinguishes between consistent and inconsistent segmentation pairs (as determined by an expert) based on the values of the similarity measures associated with each segmentation pair. Once the accuracy of the classifier is deemed satisfactory to support a consistency determination, the classifier is then used to determine whether the segmentations that are produced by subsequent versions of the algorithm under test, are (in)consistent with already verified segmentations from previous versions. This information is then used to automatically draw conclusions about the correctness of the segmentations. We have successfully applied this approach to 3D segmentations of the cardiac left ventricle obtained from CT scans and have obtained promising results (accuracies of 95%). Even though more experiments are needed to quantify the effectiveness of the approach in real-world applications, ISAO shows promise in increasing the quality and testing efficiency of image segmentation algorithms.

[1]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[2]  Jerry L. Prince,et al.  A Survey of Current Methods in Medical Image Segmentation , 1999 .

[3]  A. Jefferson Offutt,et al.  Book review: Introduction to Software Testing written by Paul Amman & Jeff Offutt. and published by CUP, 2008, 978-0-521-88038 322 pp., 0-471-20282-7 , 2008, SOEN.

[4]  David G. Stork,et al.  Evaluating Classifiers by Means of Test Data with Noisy Labels , 2003, IJCAI.

[5]  Noel E. O'Connor,et al.  Image segmentation evaluation using an integrated framework , 2007 .

[6]  Brian B. Avants,et al.  A learning-based wrapper method to correct systematic errors in automatic image segmentation: Consistently improved performance in hippocampus, cortex and brain segmentation , 2011, NeuroImage.

[7]  David Malah,et al.  A study of edge detection algorithms , 1982, Comput. Graph. Image Process..

[8]  Hélène Laurent,et al.  Unsupervised and Supervised Image Segmentation Evaluation , 2006 .

[9]  Yu-Jin Zhang,et al.  Advances in image and video segmentation , 2006 .

[10]  Paul Ammann,et al.  Introduction to Software Testing: Introduction , 2008 .

[11]  Cemal Yilmaz,et al.  Software Metrics , 2008, Wiley Encyclopedia of Computer Science and Engineering.

[12]  Gail E. Kaiser,et al.  Automatic system testing of programs without test oracles , 2009, ISSTA.

[13]  I.E. Abdou,et al.  Quantitative design and evaluation of enhancement/thresholding edge detectors , 1979, Proceedings of the IEEE.

[14]  Aditya P. Mathur,et al.  Foundations of Software Testing , 2007 .

[15]  Leo Grady,et al.  Random Walks for Image Segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Hui Zhang,et al.  Meta-Evaluation of Image Segmentation Using Machine Learning , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[17]  Elaine J. Weyuker,et al.  Pseudo-oracles for non-testable programs , 1981, ACM '81.

[18]  Nancy G. Leveson,et al.  An experimental evaluation of the assumption of independence in multiversion programming , 1986, IEEE Transactions on Software Engineering.

[19]  Jerry L Prince,et al.  Current methods in medical image segmentation. , 2000, Annual review of biomedical engineering.

[20]  Kambiz Frounchi Learning a test Oracle towards automating image segmentation evaluation , 2009 .

[21]  Hugues Benoit-Cattin,et al.  Scalable discrepancy measures for segmentation evaluation , 2002, Proceedings. International Conference on Image Processing.

[22]  Shing-Chi Cheung,et al.  PAT: A pattern classification approach to automatic reference oracles for the testing of mesh simplification programs , 2009, J. Syst. Softw..

[23]  Jayaram K. Udupa,et al.  A framework for evaluating image segmentation algorithms , 2006, Comput. Medical Imaging Graph..

[24]  Tinku Acharya,et al.  Image Processing: Principles and Applications , 2005, J. Electronic Imaging.

[25]  Debra J. Richardson,et al.  Specification-based test oracles for reactive systems , 1992, International Conference on Software Engineering.

[26]  Lei Zhu,et al.  On Simulating Subjective Evaluation Using Combined Objective Metrics for Validation of 3D Tumor Segmentation , 2007, MICCAI.

[27]  Geoff Holmes,et al.  Benchmarking Attribute Selection Techniques for Discrete Class Data Mining , 2003, IEEE Trans. Knowl. Data Eng..

[28]  Paul Ammann,et al.  Data Diversity: An Approach to Software Fault Tolerance , 1988, IEEE Trans. Computers.

[29]  Oscar Camara,et al.  Generalized Overlap Measures for Evaluation and Validation in Medical Image Analysis , 2006, IEEE Transactions on Medical Imaging.

[30]  Aurelio Matamoros Image Processing in Radiology: Current Applications , 2008, Journal of Nuclear Medicine.

[31]  Ian H. Witten,et al.  Data mining - practical machine learning tools and techniques, Second Edition , 2005, The Morgan Kaufmann series in data management systems.

[32]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[33]  Ian Witten,et al.  Data Mining , 2000 .

[34]  Elaine J. Weyuker,et al.  On Testing Non-Testable Programs , 1982, Comput. J..

[35]  A. Baddeley An Error Metric for Binary Images , 1992 .

[36]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[37]  Tamar Peli,et al.  A study of edge detection algorithms , 1982, Comput. Graph. Image Process..

[38]  Hong Sun,et al.  Investigating the use of analysis contracts to improve the testability of object‐oriented code , 2003, Softw. Pract. Exp..

[39]  Azriel Rosenfeld,et al.  Digital geometry - geometric methods for digital picture analysis , 2004 .

[40]  Wilkerson de L. Andrade,et al.  The Oracle Problem for Testing against Quantified Properties , 2007, Seventh International Conference on Quality Software (QSIC 2007).

[41]  Tinku Acharya,et al.  Image processing , 2005 .

[42]  Jon Louis Bentley,et al.  An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.