论文信息 - Volumetric Semantic Segmentation Using Pyramid Context Features

Volumetric Semantic Segmentation Using Pyramid Context Features

We present an algorithm for the per-voxel semantic segmentation of a three-dimensional volume. At the core of our algorithm is a novel "pyramid context" feature, a descriptive representation designed such that exact per-voxel linear classification can be made extremely efficient. This feature not only allows for efficient semantic segmentation but enables other aspects of our algorithm, such as novel learned features and a stacked architecture that can reason about self-consistency. We demonstrate our technique on 3D fluorescence microscopy data of Drosophila embryos for which we are able to produce extremely accurate semantic segmentations in a matter of minutes, and for which other algorithms fail due to the size and high-dimensionality of the data, or due to the difficulty of the task.

[1] P Perona,et al. Preattentive texture discrimination with early vision mechanisms. , 1990, Journal of the Optical Society of America. A, Optics and image science.

[2] Prof. Dr. José A. Campos-Ortega,et al. The Embryonic Development of Drosophila melanogaster , 1997, Springer Berlin Heidelberg.

[3] Jitendra Malik,et al. Recognizing surfaces using three-dimensional textons , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[4] Jitendra Malik,et al. Geometric blur for template matching , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[5] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[6] Jitendra Malik,et al. Shape matching and object recognition using low distortion correspondences , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8] Jianguo Zhang,et al. The PASCAL Visual Object Classes Challenge , 2006 .

[9] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[10] Sylvain Paris,et al. Real-time edge-aware image processing with the bilateral grid , 2007, ACM Trans. Graph..

[11] Jiawen Chen,et al. Real-time edge-aware image processing with the bilateral grid , 2007, SIGGRAPH 2007.

[12] Charless C. Fowlkes,et al. A Quantitative Spatiotemporal Atlas of Gene Expression in the Drosophila Blastoderm , 2008, Cell.

[13] E. Myers,et al. A 3D Digital Atlas of C. elegans and Its Application To Single-Cell Analyses , 2009, Nature Methods.

[14] Koen E. A. van de Sande,et al. Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] Ronen Basri,et al. Co-clustering of image segments using convex optimization applied to EM neuronal reconstruction , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16] Vincent Lepetit,et al. DAISY: An Efficient Dense Descriptor Applied to Wide-Baseline Stereo , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] Jitendra Malik,et al. Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[18] Cristian Sminchisescu,et al. Object Recognition by Sequential Figure-Ground Ranking , 2011, International Journal of Computer Vision.

[19] Joost van de Weijer,et al. Harmony Potentials , 2011, International Journal of Computer Vision.

[20] Andrew Y. Ng,et al. The Importance of Encoding Versus Training with Sparse Coding and Vector Quantization , 2011, ICML.

[21] Guan-Yu Chen,et al. Three-Dimensional Reconstruction of Brain-wide Wiring Networks in Drosophila at Single-Cell Resolution , 2011, Current Biology.

[22] Joost van de Weijer,et al. Fusing Global and Local Scale for Semantic Image Segmentation , 2011 .

[23] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[24] Vladlen Koltun,et al. Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials , 2011, NIPS.

[25] Eric L. Miller,et al. Segmentation fusion for connectomics , 2011, 2011 International Conference on Computer Vision.

[26] Ullrich Köthe,et al. Globally Optimal Closed-Surface Segmentation for Connectomics , 2012, ECCV.

[27] Jitendra Malik,et al. Semantic segmentation using regions and parts , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[28] Cristian Sminchisescu,et al. Semantic Segmentation with Second-Order Pooling , 2012, ECCV.

[29] François Fleuret,et al. Exact Acceleration of Linear Object Detectors , 2012, ECCV.