Biological interpretation of morphological patterns in histopathological whole-slide images

We propose a framework for studying visual morphological patterns across histopathological whole-slide images (WSIs). Image representation is an important component of computer-aided decision support systems for histopathological cancer diagnosis. Such systems extract hundreds of quantitative image features from digitized tissue biopsy slides and produce models for prediction. The performance of these models depends on the identification of informative features for selection of appropriate regions-of-interest (ROIs) from heterogeneous WSIs and for development of models. However, identification of informative features is hindered by the semantic gap between human interpretation of visual morphological patterns and quantitative image features. We address this challenge by using data mining and information visualization tools to study spatial patterns formed by features extracted from sub-sections of WSIs. Using ovarian serous cystadenocarcinoma (OvCa) WSIs provided by the cancer genome atlas (TCGA), we show that (1) individual and (2) multivariate image features correspond to biologically relevant ROIs, and (3) supervised image feature selection can map histopathology domain knowledge to quantitative image features.

[1]  May D. Wang,et al.  Extraction of informative cell features by segmentation of densely clustered tissue images , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[2]  Tim W. Nattkemper,et al.  WHIDE—a web tool for visual data mining colocation patterns in multivariate bioimages , 2012, Bioinform..

[3]  Tim W. Nattkemper,et al.  A method for linking computed image features to histological semantics in neuropathology , 2007, J. Biomed. Informatics.

[4]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  S. Palokangas,et al.  Segmentation of Folds in Tissue Section Images , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[6]  Patrick Hurban,et al.  Application of Visualization Tools to the Analysis of Histopathological Data Enhances Biological Insight and Interpretation , 2006, Toxicologic pathology.

[7]  Tim W. Nattkemper,et al.  Multivariate image mining , 2011, Wiley Interdiscip. Rev. Data Min. Knowl. Discov..

[8]  Joel H. Saltz,et al.  Integrative, Multimodal Analysis of Glioblastoma Using TCGA Molecular Data, Pathology Images, and Clinical Outcomes , 2011, IEEE Transactions on Biomedical Engineering.

[9]  Ash A. Alizadeh,et al.  Software tools for high-throughput analysis and archiving of immunohistochemistry staining data obtained with tissue microarrays. , 2002, The American journal of pathology.

[10]  J R Iglesias-Rozas,et al.  Histological heterogeneity of human glioblastomas investigated with an unsupervised neural network (SOM). , 2005, Histology and histopathology.

[11]  Anant Madabhushi,et al.  Consensus of Ambiguity: Theory and Application of Active Learning for Biomedical Image Analysis , 2010, PRIB.

[12]  Fabio A. González,et al.  Visual pattern mining in histology image collections using bag of features , 2011, Artif. Intell. Medicine.

[13]  Xiaobo Zhou,et al.  Imaging informatics for personalised medicine: applications and challenges , 2009, Int. J. Funct. Informatics Pers. Medicine.

[14]  A. Marghoob,et al.  Histologic classification of tumor-infiltrating lymphocytes in primary cutaneous malignant melanoma. A study of interobserver agreement. , 2001, American journal of clinical pathology.

[15]  May D. Wang,et al.  Histological Image Feature Mining Reveals Emergent Diagnostic Properties for Renal Cancer , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine.

[16]  Ilya Shmulevich,et al.  Integrated Analysis of Gene Expression and Tumor Nuclear Image Profiles Associated with Chemotherapy Response in Serous Ovarian Carcinoma , 2012, PloS one.

[17]  Tommi S. Jaakkola,et al.  Fast optimal leaf ordering for hierarchical clustering , 2001, ISMB.

[18]  Bahram Parvin,et al.  Morphometic analysis of TCGA glioblastoma multiforme , 2011, BMC Bioinformatics.

[19]  Jun Kong,et al.  Integrated morphologic analysis for the identification and characterization of disease subtypes , 2012, J. Am. Medical Informatics Assoc..

[20]  Narciso Olvera,et al.  Morphologic patterns associated with BRCA1 and BRCA2 genotype in ovarian carcinoma , 2012, Modern Pathology.

[21]  José Vassallo,et al.  Digital slides: present status of a tool for consultation, teaching, and quality control in pathology. , 2009, Pathology, research and practice.

[22]  Nasir M. Rajpoot,et al.  BioIMAX: A Web 2.0 approach for easy exploratory and collaborative access to multivariate bioimage data , 2011, BMC Bioinformatics.

[23]  May D. Wang,et al.  Automated cell counting and cluster segmentation using concavity detection and ellipse fitting techniques , 2009, 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[24]  Rudolf Hanka,et al.  Histological image retrieval based on semantic content analysis , 2003, IEEE Transactions on Information Technology in Biomedicine.

[25]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[26]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[27]  Todd H. Stokes,et al.  Automatic batch-invariant color segmentation of histological cancer images , 2011, 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[28]  Horace Ho-Shing Ip,et al.  Semantic content analysis and annotation of histological images , 2008, Comput. Biol. Medicine.

[29]  George M Yousef,et al.  Informatics for practicing anatomical pathologists: marking a new era in pathology practice , 2010, Modern Pathology.