Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney

The performance of machine learning algorithms used for the segmentation of 3D biomedical images lags behind that of the algorithms employed in the classification of 2D photos. This may be explained by the comparative lack of high-volume, high-quality training datasets, which require state-of-the art imaging facilities, domain experts for annotation and large computational and personal resources to create. The HR-Kidney dataset presented in this work bridges this gap by providing 1.7 TB of artefact-corrected synchrotron radiation-based X-ray phase-contrast microtomography images of whole mouse kidneys and validated segmentations of 33 729 glomeruli, which represents a 1-2 orders of magnitude increase over currently available biomedical datasets. The dataset further contains the underlying raw data, classical segmentations of renal vasculature and uriniferous tubules, as well as true 3D manual annotations. By removing limits currently imposed by small training datasets, the provided data open up the possibility for disruptions in machine learning for biomedical image analysis.

[1]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[2]  D. Smeak,et al.  In vitro holding security of six friction knots used as a first throw in the creation of a vascular ligation. , 2014, Journal of the American Veterinary Medical Association.

[3]  Steven G. Johnson,et al.  The Design and Implementation of FFTW3 , 2005, Proceedings of the IEEE.

[4]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[5]  Thomas Brox,et al.  3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation , 2016, MICCAI.

[6]  Florian Jung,et al.  Evaluation of prostate segmentation algorithms for MRI: The PROMISE12 challenge , 2014, Medical Image Anal..

[7]  Stéphane Mallat,et al.  Invariant Scattering Convolution Networks , 2012, IEEE transactions on pattern analysis and machine intelligence.

[8]  Richard C. Pais,et al.  The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans. , 2011, Medical physics.

[9]  M. Reyes,et al.  Cutting-edge microangio-CT: new dimensions in vascular imaging and kidney morphometry. , 2018, American journal of physiology. Renal physiology.

[10]  D. Attwell,et al.  Capillary pericytes regulate cerebral blood flow in health and disease , 2014, Nature.

[11]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[12]  S. Wilkins,et al.  Simultaneous phase and amplitude extraction from a single defocused image of a homogeneous object , 2002, Journal of microscopy.

[13]  Johannes E. Schindelin,et al.  Fiji: an open-source platform for biological-image analysis , 2012, Nature Methods.

[14]  P. Rüegsegger,et al.  A new method for the model‐independent assessment of thickness in three‐dimensional images , 1997 .

[15]  Manuel Guizar-Sicairos,et al.  Efficient subpixel image registration algorithms. , 2008, Optics letters.

[16]  B. Müller,et al.  Evaluation of metal nanoparticle- and plastic resin-based x-ray contrast agents for kidney capillary imaging , 2019, Developments in X-Ray Tomography XII.

[17]  Jan Czogalla,et al.  The Mouse Isolated Perfused Kidney Technique. , 2016, Journal of visualized experiments : JoVE.

[19]  E. Candès,et al.  Continuous curvelet transform , 2003 .

[20]  Fionn Murtagh,et al.  Gray and color image contrast enhancement by the curvelet transform , 2003, IEEE Trans. Image Process..

[21]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[22]  John G. Sled,et al.  A perfusion procedure for imaging of the mouse cerebral vasculature by X-ray micro-CT , 2014, Journal of Neuroscience Methods.

[23]  Max Welling,et al.  3D Steerable CNNs: Learning Rotationally Equivariant Features in Volumetric Data , 2018, NeurIPS.

[24]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[25]  Emmanuel Brun,et al.  PyHST2: an hybrid distributed code for high speed tomographic reconstruction with iterative reconstruction and a priori knowledge capabilities , 2013, ArXiv.

[26]  Max Welling,et al.  Steerable CNNs , 2016, ICLR.