Patch Clustering for Representation of Histopathology Images

Whole Slide Imaging (WSI) has become an important topic during the last decade. Even though significant progress in both medical image processing and computational resources has been achieved, there are still problems in WSI that need to be solved. A major challenge is the scan size. The dimensions of digitized tissue samples may exceed 100,000 by 100,000 pixels causing memory and efficiency obstacles for real-time processing. The main contribution of this work is representing a WSI by selecting a small number of patches for algorithmic processing (e.g., indexing and search). As a result, we reduced the search time and storage by various factors between ($50\% - 90\%$), while losing only a few percentages in the patch retrieval accuracy. A self-organizing map (SOM) has been applied on local binary patterns (LBP) and deep features of the KimiaPath24 dataset in order to cluster patches that share the same characteristics. We used a Gaussian mixture model (GMM) to represent each class with a rather small ($10\%-50\%$) portion of patches. The results showed that LBP features can outperform deep features. By selecting only $50\%$ of all patches after SOM clustering and GMM patch selection, we received $65\%$ accuracy for retrieval of the best match, while the maximum accuracy (using all patches) was $69\%$.

[1]  Hamid R. Tizhoosh,et al.  Representing Medical Images With Encoded Local Projections , 2018, IEEE Transactions on Biomedical Engineering.

[2]  Vishal Monga,et al.  Simultaneous Sparsity Model for Histopathological Image Representation and Classification , 2014, IEEE Transactions on Medical Imaging.

[3]  Matti Pietikäinen,et al.  Robust Texture Classification by Subsets of Local Binary Patterns , 2000, ICPR.

[4]  Hamid R. Tizhoosh,et al.  Retrieving Similar X-ray Images from Big Image Data using Radon Barcodes with Single Projections , 2017, ICPRAM.

[5]  Song Han,et al.  Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[6]  Alain Pitiot,et al.  Piecewise affine registration of biological images for volume reconstruction , 2006, Medical Image Anal..

[7]  Jan Modersitzki,et al.  Patch-Based Nonlinear Image Registration For Gigapixel Whole Slide Images , 2016 .

[8]  Jun Kong,et al.  Digital Pathology: Data-Intensive Frontier in Medical Imaging , 2012, Proceedings of the IEEE.

[9]  Mitko Veta,et al.  Going fully digital: Perspective of a Dutch academic pathology lab , 2013, Journal of pathology informatics.

[10]  Yoshua Bengio,et al.  Object Recognition with Gradient-Based Learning , 1999, Shape, Contour and Grouping in Computer Vision.

[11]  Donald L. Weaver,et al.  Digitized Whole Slides for Breast Pathology Interpretation: Current Practices and Perceptions , 2014, Journal of Digital Imaging.

[12]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[13]  Eduard Ayguadé,et al.  On the Behavior of Convolutional Nets for Feature Extraction , 2017, J. Artif. Intell. Res..

[14]  George Lee,et al.  Image analysis and machine learning in digital pathology: Challenges and opportunities , 2016, Medical Image Anal..

[15]  Holger Roth,et al.  Unsupervised pathology image segmentation using representation learning with spherical k-means , 2018, Medical Imaging.

[16]  Saeid Nahavandi,et al.  Parallel deep solutions for image retrieval from imbalanced medical imaging archives , 2018, Appl. Soft Comput..

[17]  Abbas K. AlZubaidi,et al.  Computer aided diagnosis in digital pathology application: Review and perspective approach in lung cancer classification , 2017, 2017 Annual Conference on New Trends in Information & Communications Technology Applications (NTICT).

[18]  M. Bashandy,et al.  Computerized nuclear morphometry in the diagnosis of thyroid lesions with predominant follicular pattern , 2009, Ecancermedicalscience.

[19]  Shaimaa Al-Janabi,et al.  Whole slide images for primary diagnostics of urinary system pathology: a feasibility study , 2014, Journal of renal injury prevention.

[20]  Stanley H. Chan,et al.  Demystifying Symmetric Smoothing Filters , 2016, ArXiv.

[21]  B. Marshall A Brief History of the Discovery of Helicobacter pylori , 2016 .

[22]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[23]  Anant Madabhushi,et al.  Content-based image retrieval of digitized histopathology in boosted spectrally embedded spaces , 2015, Journal of pathology informatics.

[24]  Wei Liu,et al.  Towards Large-Scale Histopathological Image Analysis: Hashing-Based Image Retrieval , 2015, IEEE Transactions on Medical Imaging.

[25]  Shahryar Rahnamayan,et al.  Classification and Retrieval of Digital Pathology Scans: A New Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[26]  Darren Treanor,et al.  Toward routine use of 3D histopathology as a research tool. , 2012, The American journal of pathology.

[27]  Rong Jin,et al.  A Boosting Framework for Visuality-Preserving Distance Metric Learning and Its Application to Medical Image Retrieval , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Carlos Ortiz-de-Solorzano,et al.  High-throughput analysis of multispectral images of breast cancer tissue , 2006, IEEE Transactions on Image Processing.

[29]  Joel H. Saltz,et al.  Patch-Based Convolutional Neural Network for Whole Slide Tissue Image Classification , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).