Geometry in Active Learning for Binary and Multi-class Image Segmentation

Abstract We propose an active learning approach to image segmentation that exploits geometric priors to speed up and streamline the annotation process. It can be applied for both background–foreground and multi-class segmentation tasks in 2D images and 3D image volumes. Our approach combines geometric smoothness priors in the image space with more traditional uncertainty measures to estimate which pixels or voxels are the most informative, and thus should to be annotated next. For multi-class settings, we additionally introduce two novel criteria for uncertainty. In the 3D case, we use the resulting uncertainty measure to select voxels lying on a planar patch, which makes batch annotation much more convenient for the end user compared to the setting where voxels are randomly distributed in a volume. The planar patch is found using a branch-and-bound algorithm that looks for a 2D patch in a 3D volume where the most informative instances are located. We evaluate our approach on Electron Microscopy and Magnetic Resonance image volumes, as well as on regular images of horses and faces. We demonstrate a substantial performance increase over other approaches thanks to the use of geometric priors.

[1]  Fredrik Olsson,et al.  A literature survey of active machine learning in the context of natural language processing , 2009 .

[2]  Ashish Kapoor,et al.  Active learning for large multi-class problems , 2009, CVPR.

[3]  Jian Sun,et al.  ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Zhuowen Tu,et al.  Combining Generative and Discriminative Models for Semantic Segmentation of CT Scans via Active Learning , 2011, IPMI.

[5]  Mark Craven,et al.  Multiple-Instance Active Learning , 2007, NIPS.

[6]  Joachim Denzler,et al.  Selecting Influential Examples: Active Learning with Expected Model Output Changes , 2014, ECCV.

[7]  Ghassan Hamarneh,et al.  Active Learning for Interactive 3D Image Segmentation , 2011, MICCAI.

[8]  Li Fei-Fei,et al.  Crowdsourcing in Computer Vision , 2016, Found. Trends Comput. Graph. Vis..

[9]  L. Asz Random Walks on Graphs: a Survey , 2022 .

[10]  Gang Hua,et al.  A Joint Gaussian Process Model for Active Visual Recognition with Expertise Estimation in Crowdsourcing , 2013, International Journal of Computer Vision.

[11]  Lawrence O. Hall,et al.  Active learning to recognize multiple types of plankton , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[12]  Pascal Fua,et al.  Learning Active Learning from Real and Synthetic Data , 2017, ArXiv.

[13]  Rong Jin,et al.  Batch mode active learning and its application to medical image classification , 2006, ICML.

[14]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[15]  Massimo Mauro,et al.  Multi-class semantic segmentation of faces , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[16]  Manuel Graña,et al.  Abdominal CTA image analisys through active learning and decision random forests: Aplication to AAA segmentation , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[17]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Jia Xu,et al.  Learning to segment under various forms of weak supervision , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Jianguo Zhang,et al.  The PASCAL Visual Object Classes Challenge , 2006 .

[20]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[21]  Fei-Fei Li,et al.  What's the Point: Semantic Segmentation with Point Supervision , 2015, ECCV.

[22]  Ullrich Köthe,et al.  Segmentation of SBFSEM Volume Data of Neural Tissue by Hierarchical Classification , 2008, DAGM-Symposium.

[23]  Noah Snavely,et al.  Material recognition in the wild with the Materials in Context Database , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Amit K. Roy-Chowdhury,et al.  Non-uniform Subset Selection for Active Learning in Structured Data , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[26]  Gabriel J. Brostow,et al.  Becoming the expert - interactive multi-class machine teaching , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Leo Grady,et al.  Random Walks for Image Segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Yi Yang,et al.  Multi-Class Active Learning by Uncertainty Sampling with Diversity Maximization , 2015, International Journal of Computer Vision.

[29]  Christoph H. Lampert,et al.  Beyond sliding windows: Object localization by efficient subwindow search , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Dhruv Batra,et al.  Active learning for structured probabilistic models with histogram approximation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Thomas Brox,et al.  Video Segmentation with Just a Few Strokes , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[32]  Buyu Liu,et al.  Active Learning for Human Pose Estimation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[33]  Nikolaos Papanikolopoulos,et al.  Scalable Active Learning for Multiclass Image Classification , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  William A. Gale,et al.  A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[35]  Nikolaos Papanikolopoulos,et al.  Multi-class active learning for image classification , 2009, CVPR.

[36]  Frédo Durand,et al.  On the Importance of Label Quality for Semantic Segmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[37]  Jaime G. Carbonell,et al.  Buy-in-Bulk Active Learning , 2013, NIPS.

[38]  Raquel Urtasun,et al.  Latent Structured Active Learning , 2013, NIPS.

[39]  Greg Schohn,et al.  Less is More: Active Learning with Support Vector Machines , 2000, ICML.

[40]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[41]  Bo Han,et al.  TouchCut: Fast image and video segmentation using single-touch interaction , 2014, Comput. Vis. Image Underst..

[42]  Yong Zhang,et al.  A Global Spatial Similarity Optimization Scheme to Track Large Numbers of Dendritic Spines in Time-Lapse Confocal Microscopy , 2011, IEEE Transactions on Medical Imaging.

[43]  Kristen Grauman,et al.  Click Carving: Segmenting Objects in Video with Point Clicks , 2016, HCOMP.

[44]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[45]  Naftali Tishby,et al.  Query by Committee Made Real , 2005, NIPS.

[46]  Amit K. Roy-Chowdhury,et al.  Context Aware Active Learning of Activity Recognition Models , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[47]  Trevor Darrell,et al.  Active Learning with Gaussian Processes for Object Categorization , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[48]  Burr Settles,et al.  From Theories to Queries: Active Learning in Practice , 2011 .

[49]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[50]  Sanja Fidler,et al.  Efficient Interactive Annotation of Segmentation Datasets with Polygon-RNN++ , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[51]  Gang Hua,et al.  Multi-class Multi-annotator Active Learning with Robust Gaussian Process for Visual Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[52]  Pascal Fua,et al.  Supervoxel-Based Segmentation of Mitochondria in EM Image Stacks With Learned Shape Features , 2012, IEEE Transactions on Medical Imaging.

[53]  Deva Ramanan,et al.  Efficiently Scaling up Crowdsourced Video Annotation , 2012, International Journal of Computer Vision.

[54]  Nelly Gordillo,et al.  State of the art survey on MRI brain tumor segmentation. , 2013, Magnetic resonance imaging.

[55]  Bruce A. Draper,et al.  Efficient label collection for unlabeled image datasets , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  Kristen Grauman,et al.  Active Image Segmentation Propagation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[57]  Joachim Denzler,et al.  Active learning and discovery of object categories in the presence of unnameable instances , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[58]  Burr Settles,et al.  Active Learning , 2012, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[59]  Pascal Fua,et al.  Introducing Geometry in Active Learning for Image Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[60]  Tobias Pietzsch,et al.  BigDataViewer: visualization and processing for large image data sets , 2015, Nature Methods.

[61]  Stefan Wrobel,et al.  Multi-class Ensemble-Based Active Learning , 2006, ECML.

[62]  Shimon Ullman,et al.  Combined Top-Down/Bottom-Up Segmentation , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[63]  Andreas Nürnberger,et al.  The Power of Ensembles for Active Learning in Image Classification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[64]  Joachim M. Buhmann,et al.  Weakly supervised structured output learning for semantic segmentation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[65]  Raphael Sznitman,et al.  Active Testing for Face Detection and Localization , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[66]  Benjamin Schmid,et al.  A high-level 3D visualization API for Java and ImageJ , 2010, BMC Bioinformatics.

[67]  Lars Linsen,et al.  Uncertainty estimation and visualization in probabilistic segmentation , 2014, Comput. Graph..

[68]  Brian B. Avants,et al.  The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS) , 2015, IEEE Transactions on Medical Imaging.

[69]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[70]  Fei Yin,et al.  CASIA Online and Offline Chinese Handwriting Databases , 2011, 2011 International Conference on Document Analysis and Recognition.

[71]  Allen Y. Yang,et al.  A Convex Optimization Framework for Active Learning , 2013, 2013 IEEE International Conference on Computer Vision.

[72]  Arnold W. M. Smeulders,et al.  Interaction in the segmentation of medical images: A survey , 2001, Medical Image Anal..

[73]  Lamberto Ballan,et al.  Love Thy Neighbors: Image Annotation by Exploiting Image Metadata , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[74]  Pascal Fua,et al.  Fast Object Detection with Entropy-Driven Evaluation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[75]  Sanja Fidler,et al.  Annotating Object Instances with a Polygon-RNN , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[76]  Stephen M. Plaza Focused Proofreading to Reconstruct Neural Connectomes from EM Images at Scale , 2016, LABELS/DLMIA@MICCAI.

[77]  Vincent Lepetit,et al.  Supervised Feature Learning for Curvilinear Structure Segmentation , 2013, MICCAI.

[78]  Pascal Fua,et al.  Active Learning for Delineation of Curvilinear Structures , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[79]  Philip Bachman,et al.  Learning Algorithms for Active Learning , 2017, ICML.