Parallel multiple instance learning for extremely large histopathology image analysis

BackgroundHistopathology images are critical for medical diagnosis, e.g., cancer and its treatment. A standard histopathology slice can be easily scanned at a high resolution of, say, 200,000×200,000 pixels. These high resolution images can make most existing imaging processing tools infeasible or less effective when operated on a single machine with limited memory, disk space and computing power.ResultsIn this paper, we propose an algorithm tackling this new emerging “big data” problem utilizing parallel computing on High-Performance-Computing (HPC) clusters. Experimental results on a large-scale data set (1318 images at a scale of 10 billion pixels each) demonstrate the efficiency and effectiveness of the proposed algorithm for low-latency real-time applications.ConclusionsThe framework proposed an effective and efficient system for extremely large histopathology image analysis. It is based on the multiple instance learning formulation for weakly-supervised learning for image classification, segmentation and clustering. When a max-margin concept is adopted for different clusters, we obtain further improvement in clustering performance.

[1]  Tony Pan,et al.  ImageMiner: a software system for comparative analysis of tissue microarrays using content-based image retrieval, high-performance computing, and grid technology , 2011, J. Am. Medical Informatics Assoc..

[2]  Anant Madabhushi,et al.  A boosted distance metric: application to content based image retrieval and classification of digitized histopathology , 2009, Medical Imaging.

[3]  S. Akhter,et al.  Multi-core programming , 2006 .

[4]  Ben Taskar,et al.  Probabilistic Classification and Clustering in Relational Data , 2001, IJCAI.

[5]  Richard Bowden,et al.  A boosted classifier tree for hand shape detection , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[6]  Nicolas Loménie,et al.  Point set morphological filtering and semantic spatial configuration modeling: Application to microscopic image and bio-structure analysis , 2012, Pattern Recognit..

[7]  Dale Schuurmans,et al.  Maximum Margin Clustering , 2004, NIPS.

[8]  Mrinal K. Mandal,et al.  Automated analysis and diagnosis of skin melanoma on whole slide histopathological images , 2015, Pattern Recognit..

[9]  Mohammed J. Zaki,et al.  Parallel classification for data mining on shared-memory multiprocessors , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[10]  Wen Gao,et al.  Locally Assembled Binary (LAB) feature with feature-centric cascade for fast and accurate face detection , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[12]  Yair Censor,et al.  Component averaging: An efficient iterative parallel algorithm for large and sparse unstructured problems , 2001, Parallel Comput..

[13]  Yuhua Tang,et al.  Parallization of Adaboost Algorithm through Hybrid MPI/OpenMP and Transactional Memory , 2011, 2011 19th International Euromicro Conference on Parallel, Distributed and Network-Based Processing.

[14]  Zhuowen Tu,et al.  Multiple clustered instance learning for histopathology cancer image classification, segmentation and clustering , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Robert C. Wolpert,et al.  A Review of the , 1985 .

[16]  A. Madabhushi,et al.  Histopathological Image Analysis: A Review , 2009, IEEE Reviews in Biomedical Engineering.

[17]  Yi-Ping Hung,et al.  Multi-class multi-instance boosting for part-based human detection , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[18]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[19]  Zhi-Hua Zhou,et al.  Multi-instance clustering with applications to multi-instance prediction , 2009, Applied Intelligence.

[20]  Zhuowen Tu,et al.  Weakly supervised histopathology cancer image segmentation and classification , 2014, Medical Image Anal..

[21]  Jacob Whitehill,et al.  Haar features for FACS AU recognition , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[22]  Ayhan Demiriz,et al.  Exploiting unlabeled data in ensemble methods , 2002, KDD.

[23]  Ben Taskar,et al.  Max-Margin Markov Networks , 2003, NIPS.

[24]  Jun Kong,et al.  Computer-aided prognosis of neuroblastoma on whole-slide images: Classification of stromal development , 2009, Pattern Recognit..

[25]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Xiaojun Qi,et al.  Incorporating multiple SVMs for automatic image annotation , 2007, Pattern Recognit..

[27]  Murat Dundar,et al.  Multiple Instance Learning for Computer Aided Diagnosis , 2006, NIPS.

[28]  Paul A. Viola,et al.  Multiple Instance Boosting for Object Detection , 2005, NIPS.

[29]  Anant Madabhushi,et al.  Automated gland and nuclei segmentation for grading of prostate and breast cancer histopathology , 2008, 2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[30]  Jeongjin Lee,et al.  Interactive GPU-based maximum intensity projection of large medical data sets using visibility culling based on the initial occluder and the visible block classification , 2012, Comput. Medical Imaging Graph..

[31]  Mrinal K. Mandal,et al.  Detection of melanocytes in skin histopathological images using radial line scanning , 2013, Pattern Recognit..

[32]  Nicolas Loménie,et al.  Time-efficient sparse analysis of histopathological whole slide images , 2011, Comput. Medical Imaging Graph..

[33]  Thomas G. Dietterich,et al.  Solving the Multiple Instance Problem with Axis-Parallel Rectangles , 1997, Artif. Intell..

[34]  Peter S. Pacheco Parallel programming with MPI , 1996 .

[35]  Adrien Depeursinge,et al.  Automated classification of brain tumor type in whole-slide digital pathology images using local representative tiles , 2016, Medical Image Anal..

[36]  Xiu-Shen Wei,et al.  Scalable Algorithms for Multi-Instance Learning , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[37]  Etienne Grossmann AdaTree: Boosting a Weak Classifier into a Decision Tree , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[38]  Trevor Darrell,et al.  Constrained Convolutional Neural Networks for Weakly Supervised Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[39]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[40]  Anant Madabhushi,et al.  Automated grading of breast cancer histopathology using spectral clustering with textural and architectural image features , 2008, 2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[41]  Matthias Reumann,et al.  Performance of Hybrid Programming Models for Multiscale Cardiac Simulations: Preparing for Petascale Computation , 2011, IEEE Transactions on Biomedical Engineering.

[42]  Jinbo Bi,et al.  Multiple Instance Learning of Pulmonary Embolism Detection with Geodesic Distance along Vascular Structure , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Luo Si,et al.  M3IC: Maximum Margin Multiple Instance Clustering , 2009, IJCAI.

[44]  Thomas Hofmann,et al.  Support Vector Machines for Multiple-Instance Learning , 2002, NIPS.

[45]  Bidyut Baran Chaudhuri,et al.  An efficient method based on watershed and rule-based merging for segmentation of 3-D histo-pathological images , 2001, Pattern Recognit..

[46]  Wei Chu,et al.  Multi-category Classification by Soft-Max Combination of Binary Classifiers , 2003, Multiple Classifier Systems.

[47]  Fuzhen Zhuang,et al.  A parallel incremental extreme SVM classifier , 2011, Neurocomputing.

[48]  Serge J. Belongie,et al.  Simultaneous Learning and Alignment: Multi-Instance and Multi-Pose Learning ? , 2008 .

[49]  Han Xiao Towards Parallel and Distributed Computing in Large-Scale Data Mining : A Survey , 2010 .

[50]  Andrew Janowczyk,et al.  A weighted mean shift, normalized cuts initialized color gradient based geodesic active contour model: applications to histopathology image segmentation , 2010, Medical Imaging.

[51]  Theodore Kalamboukis,et al.  Applying latent semantic analysis to large-scale medical image databases , 2015, Comput. Medical Imaging Graph..

[52]  Jon D. Patrick,et al.  Research and applications: Supervised machine learning and active learning in classification of radiology reports , 2014, J. Am. Medical Informatics Assoc..

[53]  Jianghai Hu,et al.  New region feature descriptor-based image registration method , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[54]  Fabio A. González,et al.  Histopathology Image Classification Using Bag of Features and Kernel Functions , 2009, AIME.

[55]  Henning Müller,et al.  Evaluating performance of biomedical image retrieval systems - An overview of the medical image retrieval task at ImageCLEF 2004-2013 , 2015, Comput. Medical Imaging Graph..

[56]  Yixin Chen,et al.  MILES: Multiple-Instance Learning via Embedded Instance Selection , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[57]  Zhuowen Tu,et al.  Context-Constrained Multiple Instance Learning for Histopathology Image Segmentation , 2012, MICCAI.

[58]  Fei Xue,et al.  Image Class Segmentation via Conditional Random Field over Weighted Histogram Classifier , 2011, 2011 Sixth International Conference on Image and Graphics.

[59]  郭秉璋 基於Histogram of Oriented Gradients之課堂舉手辨識研究 , 2012 .

[60]  Ignacio Blanquer,et al.  A Parallel Implementation of the K Nearest Neighbours Classifier in Three Levels: Threads, MPI Processes and the Grid , 2006, VECPAR.

[61]  E. Wes Bethel,et al.  MPI-hybrid Parallelism for Volume Rendering on Large, Multi-core Systems , 2010, EGPGV@Eurographics.

[62]  Horst Bischof,et al.  An active boosting-based learning framework for real-time hand detection , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[63]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[64]  A. Madabhushi Digital pathology image analysis: opportunities and challenges. , 2009, Imaging in medicine.