Out-of-Sample Extrapolation utilizing Semi-Supervised Manifold Learning (OSE-SSL): Content Based Image Retrieval for Histopathology Images

Content-based image retrieval (CBIR) retrieves database images most similar to the query image by (1) extracting quantitative image descriptors and (2) calculating similarity between database and query image descriptors. Recently, manifold learning (ML) has been used to perform CBIR in a low dimensional representation of the high dimensional image descriptor space to avoid the curse of dimensionality. ML schemes are computationally expensive, requiring an eigenvalue decomposition (EVD) for every new query image to learn its low dimensional representation. We present out-of-sample extrapolation utilizing semi-supervised ML (OSE-SSL) to learn the low dimensional representation without recomputing the EVD for each query image. OSE-SSL incorporates semantic information, partial class label, into a ML scheme such that the low dimensional representation co-localizes semantically similar images. In the context of prostate histopathology, gland morphology is an integral component of the Gleason score which enables discrimination between prostate cancer aggressiveness. Images are represented by shape features extracted from the prostate gland. CBIR with OSE-SSL for prostate histology obtained from 58 patient studies, yielded an area under the precision recall curve (AUPRC) of 0.53 ± 0.03 comparatively a CBIR with Principal Component Analysis (PCA) to learn a low dimensional space yielded an AUPRC of 0.44 ± 0.01.

[1]  Horace Ho-Shing Ip,et al.  Semantic content analysis and annotation of histological images , 2008, Comput. Biol. Medicine.

[2]  Daniel Rueckert,et al.  Medical Image Computing and Computer-Assisted Intervention − MICCAI 2017: 20th International Conference, Quebec City, QC, Canada, September 11-13, 2017, Proceedings, Part II , 2017, Lecture Notes in Computer Science.

[3]  George Lee,et al.  Computer-aided prognosis: Predicting patient and disease outcome via quantitative fusion of multi-scale, multi-modal data , 2011, Comput. Medical Imaging Graph..

[4]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[5]  Michael R Hamblin,et al.  CA : A Cancer Journal for Clinicians , 2011 .

[6]  Xiaofei He,et al.  Laplacian Regularized D-Optimal Design for Active Learning and Its Application to Image Retrieval , 2010, IEEE Transactions on Image Processing.

[7]  Robert Cedergren,et al.  Guided tour , 1990, Nature.

[8]  N. Dubrawsky Cancer statistics , 1989, CA: a cancer journal for clinicians.

[9]  Nikolas P. Galatsanos,et al.  A similarity learning approach to content-based image retrieval: application to digital mammography , 2004, IEEE Transactions on Medical Imaging.

[10]  Richard Bellman,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[11]  Rudolf Hanka,et al.  Histological image retrieval based on semantic content analysis , 2003, IEEE Transactions on Information Technology in Biomedicine.

[12]  J. Epstein An update of the Gleason grading system. , 2010, The Journal of urology.

[13]  Richard Bellman,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[14]  A. Madabhushi Digital pathology image analysis: opportunities and challenges. , 2009, Imaging in medicine.

[15]  Lin Yang,et al.  PathMiner: A Web-Based Tool for Computer-Assisted Diagnostics in Pathology , 2009, IEEE Transactions on Information Technology in Biomedicine.

[16]  Yoshua Bengio,et al.  Greedy Spectral Embedding , 2005, AISTATS.

[17]  Lei Zheng,et al.  Design and analysis of a content-based pathology image retrieval system , 2003, IEEE Transactions on Information Technology in Biomedicine.

[18]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[19]  Hai Su,et al.  High-throughput histopathological image analysis via robust cell segmentation and hashing , 2015, Medical Image Anal..

[20]  L. Egevad,et al.  The 2005 International Society of Urological Pathology (ISUP) Consensus Conference on Gleason Grading of Prostatic Carcinoma , 2005, The American journal of surgical pathology.

[21]  Jitendra Malik,et al.  Spectral grouping using the Nystrom method , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Junzhou Huang,et al.  Joint Kernel-Based Supervised Hashing for Scalable Histopathological Image Analysis , 2015, MICCAI.

[23]  A. Jemal,et al.  Cancer statistics, 2012 , 2012, CA: a cancer journal for clinicians.

[24]  Dorin Comaniciu,et al.  Image-guided decision support system for pathology , 1999, Machine Vision and Applications.

[25]  Rongrong Ji,et al.  Supervised hashing with kernels , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Nicolas Le Roux,et al.  Out-of-Sample Extensions for LLE, Isomap, MDS, Eigenmaps, and Spectral Clustering , 2003, NIPS.

[27]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[28]  A. Madabhushi,et al.  Investigating the Efficacy of Nonlinear Dimensionality Reduction Schemes in Classifying Gene and Protein Expression Studies , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[29]  Anant Madabhushi,et al.  A Boosted Bayesian Multiresolution Classifier for Prostate Cancer Detection From Digitized Needle Biopsies , 2012, IEEE Transactions on Biomedical Engineering.

[30]  J. Epstein,et al.  Interobserver reproducibility of Gleason grading of prostatic carcinoma: general pathologist. , 2001, Human pathology.

[31]  Fabio A. González,et al.  Histology image search using multimodal fusion , 2014, J. Biomed. Informatics.

[32]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[33]  Dacheng Tao,et al.  Biased Discriminant Euclidean Embedding for Content-Based Image Retrieval , 2010, IEEE Transactions on Image Processing.

[34]  Kai Zhang,et al.  Density-Weighted Nyström Method for Computing Large Kernel Eigensystems , 2009, Neural Comput..

[35]  Carla E. Brodley,et al.  ASSERT: A Physician-in-the-Loop Content-Based Retrieval System for HRCT Image Databases , 1999, Comput. Vis. Image Underst..

[36]  Anant Madabhushi,et al.  An Integrated Region-, Boundary-, Shape-Based Active Contour for Multiple Object Overlap Resolution in Histological Imagery , 2012, IEEE Transactions on Medical Imaging.

[37]  Tim W. Nattkemper,et al.  A method for linking computed image features to histological semantics in neuropathology , 2007, J. Biomed. Informatics.

[38]  Anant Madabhushi,et al.  A boosted distance metric: application to content based image retrieval and classification of digitized histopathology , 2009, Medical Imaging.

[39]  Jingrui He,et al.  Generalized Manifold-Ranking-Based Image Retrieval , 2006, IEEE Transactions on Image Processing.

[40]  Anant Madabhushi,et al.  Explicit shape descriptors: Novel morphologic features for histopathology classification , 2013, Medical Image Anal..

[41]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[42]  Haitao Zhao Combining labeled and unlabeled data with graph embedding , 2006, Neurocomputing.

[43]  Ebroul Izquierdo,et al.  Histology Image Retrieval in Optimized Multifeature Spaces , 2013, IEEE Journal of Biomedical and Health Informatics.

[44]  Fabio A. González,et al.  Content-based histopathology image retrieval using a kernel-based semantic annotation framework , 2011, J. Biomed. Informatics.

[45]  Wei Liu,et al.  Mining histopathological images via hashing-based scalable image retrieval , 2014, 2014 IEEE 11th International Symposium on Biomedical Imaging (ISBI).

[46]  Wei Liu,et al.  Towards Large-Scale Histopathological Image Analysis: Hashing-Based Image Retrieval , 2015, IEEE Transactions on Medical Imaging.

[47]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[48]  John R. Gilbertson,et al.  Evaluation of prostate tumor grades by content-based image retrieval , 1999, Other Conferences.