Culling for Extreme-Scale Segmentation Volumes: A Hybrid Deterministic and Probabilistic Approach

With the rapid increase in raw volume data sizes, such as terabyte-sized microscopy volumes, the corresponding segmentation label volumes have become extremely large as well. We focus on integer label data, whose efficient representation in memory, as well as fast random data access, pose an even greater challenge than the raw image data. Often, it is crucial to be able to rapidly identify which segments are located where, whether for empty space skipping for fast rendering, or for spatial proximity queries. We refer to this process as culling. In order to enable efficient culling of millions of labeled segments, we present a novel hybrid approach that combines deterministic and probabilistic representations of label data in a data-adaptive hierarchical data structure that we call the label list tree. In each node, we adaptively encode label data using either a probabilistic constant-time access representation for fast conservative culling, or a deterministic logarithmic-time access representation for exact queries. We choose the best data structures for representing the labels of each spatial region while building the label list tree. At run time, we further employ a novel query-adaptive culling strategy. While filtering a query down the tree, we prune it successively, and in each node adaptively select the representation that is best suited for evaluating the pruned query, depending on its size. We show an analysis of the efficiency of our approach with several large data sets from connectomics, including a brain scan with more than 13 million labeled segments, and compare our method to conventional culling approaches. Our approach achieves significant reductions in storage size as well as faster query times.

[1]  Amelio Vázquez Reina,et al.  Large-Scale Automatic Reconstruction of Neuronal Processes from Electron Microscopy Images , 2013, Medical Image Anal..

[2]  Roberto Scopigno,et al.  Multiresolution volume visualization with a texture-based octree , 2001, The Visual Computer.

[3]  Markus Hadwiger,et al.  State‐of‐the‐Art in GPU‐Based Large‐Scale Volume Visualization , 2015, Comput. Graph. Forum.

[4]  Markus Hadwiger,et al.  High-quality two-level volume rendering of segmented data sets on consumer graphics hardware , 2003, IEEE Visualization, 2003. VIS 2003..

[5]  Arie E. Kaufman,et al.  Towards a comprehensive volume visualization system , 1992, Proceedings Visualization '92.

[6]  Mario Botsch,et al.  Feature sensitive surface extraction from volume data , 2001, SIGGRAPH.

[7]  William Pugh,et al.  Skip Lists: A Probabilistic Alternative to Balanced Trees , 1989, WADS.

[8]  Ricardo S. Avila,et al.  A hardware acceleration method for volumetric ray tracing , 1995, Proceedings Visualization '95.

[9]  Justin Chu,et al.  ABySS 2.0: resource-efficient assembly of large genomes using a Bloom filter , 2016, bioRxiv.

[10]  Bin Fan,et al.  Cuckoo Filter: Practically Better Than Bloom , 2014, CoNEXT.

[11]  Rüdiger Westermann,et al.  Accelerated volume ray-casting using texture mapping , 2001, Proceedings Visualization, 2001. VIS '01..

[12]  Sheng-Chuan Wang,et al.  The Neuron Navigator: Exploring the information pathway through the neural maze , 2011, 2011 IEEE Pacific Visualization Symposium.

[13]  Hanspeter Pfister,et al.  Compresso: Efficient Compression of Segmentation Data for Connectomics , 2017, MICCAI.

[14]  Luca Deri,et al.  Roaring bitmaps: Implementation of an optimized software library , 2017, Softw. Pract. Exp..

[15]  Christopher Williamson,et al.  Dynamic queries for information exploration: an implementation and evaluation , 1992, CHI.

[16]  John Shalf,et al.  Query-driven visualization of large data sets , 2005, VIS 05. IEEE Visualization, 2005..

[17]  Steven K. Feiner,et al.  Computer graphics: principles and practice (2nd ed.) , 1990 .

[18]  Xing Mei,et al.  Simple Empty-Space Removal for Interactive Volume Rendering , 2008, J. Graph. Tools.

[19]  Arie Shoshani,et al.  Breaking the Curse of Cardinality on Bitmap Indexes , 2008, SSDBM.

[20]  Kalpathi R. Subramanian,et al.  Applying space subdivision techniques to volume rendering , 1990, Proceedings of the First IEEE Conference on Visualization: Visualization `90.

[21]  Li Fan,et al.  Summary cache: a scalable wide-area web cache sharing protocol , 2000, TNET.

[22]  Markus Hadwiger,et al.  ConnectomeExplorer: Query-Guided Visual Analysis of Large Volumetric Neuroscience Data , 2013, IEEE Transactions on Visualization and Computer Graphics.

[23]  Markus Hadwiger,et al.  SparseLeap: Efficient Empty Space Skipping for Large-Scale Volume Rendering , 2018, IEEE Transactions on Visualization and Computer Graphics.

[24]  Markus Hadwiger,et al.  Real-time volume graphics , 2006, Eurographics.

[25]  Stefan Bruckner,et al.  BrainGazer - Visual Queries for Neurobiology Research , 2009, IEEE Transactions on Visualization and Computer Graphics.

[26]  Owen Kaser,et al.  Consistently faster and smaller compressed bitmaps with Roaring , 2016, Softw. Pract. Exp..

[27]  Owen Kaser,et al.  Better bitmap performance with Roaring bitmaps , 2014, Softw. Pract. Exp..

[28]  Burton H. Bloom,et al.  Space/time trade-offs in hash coding with allowable errors , 1970, CACM.

[29]  Markus Hadwiger,et al.  Real‐Time Ray‐Casting and Advanced Shading of Discrete Isosurfaces , 2005, Comput. Graph. Forum.

[30]  Bernd Hamann,et al.  Multiresolution techniques for interactive texture-based volume visualization , 1999, Proceedings Visualization '99 (Cat. No.99CB37067).

[31]  Steven F. Roth,et al.  An interactive visual query environment for exploring data , 1997, UIST '97.

[32]  Yoonsuck Choe,et al.  Fast macro-scale transmission imaging of microvascular networks using KESM , 2011, Biomedical optics express.

[33]  Eva Dyllong,et al.  A modified reliable distance algorithm for octree‐encoded objects , 2007 .

[34]  Enrico Gobbetti,et al.  A single-pass GPU ray casting framework for interactive out-of-core rendering of massive volumetric datasets , 2008, The Visual Computer.

[35]  William R. Gray Roncal,et al.  Saturated Reconstruction of a Volume of Neocortex , 2015, Cell.

[36]  Michael A. Bender,et al.  Don't Thrash: How to Cache Your Hash on Flash , 2011, Proc. VLDB Endow..

[37]  Markus Hadwiger,et al.  Interactive Volume Exploration of Petascale Microscopy Data Streams Using a Visualization-Driven Virtual Memory Approach , 2012, IEEE Transactions on Visualization and Computer Graphics.

[38]  Rüdiger Westermann,et al.  Acceleration techniques for GPU-based volume rendering , 2003, IEEE Visualization, 2003. VIS 2003..

[39]  Sylvain Lefebvre,et al.  GigaVoxels: ray-guided streaming for efficient and detailed voxel rendering , 2009, I3D '09.

[40]  Brian A. Wandell,et al.  Exploring connectivity of the brain's white matter with dynamic queries , 2005, IEEE Transactions on Visualization and Computer Graphics.

[41]  Markus Hadwiger,et al.  Exploring the Connectome: Petascale Volume Visualization of Microscopy Data Streams , 2013, IEEE Computer Graphics and Applications.