An unsupervised feature learning framework for basal cell carcinoma image analysis

OBJECTIVE The paper addresses the problem of automatic detection of basal cell carcinoma (BCC) in histopathology images. In particular, it proposes a framework to both, learn the image representation in an unsupervised way and visualize discriminative features supported by the learned model. MATERIALS AND METHODS This paper presents an integrated unsupervised feature learning (UFL) framework for histopathology image analysis that comprises three main stages: (1) local (patch) representation learning using different strategies (sparse autoencoders, reconstruct independent component analysis and topographic independent component analysis (TICA), (2) global (image) representation learning using a bag-of-features representation or a convolutional neural network, and (3) a visual interpretation layer to highlight the most discriminant regions detected by the model. The integrated unsupervised feature learning framework was exhaustively evaluated in a histopathology image dataset for BCC diagnosis. RESULTS The experimental evaluation produced a classification performance of 98.1%, in terms of the area under receiver-operating-characteristic curve, for the proposed framework outperforming by 7% the state-of-the-art discrete cosine transform patch-based representation. CONCLUSIONS The proposed UFL-representation-based approach outperforms state-of-the-art methods for BCC detection. Thanks to its visual interpretation layer, the method is able to highlight discriminative tissue regions providing a better diagnosis support. Among the different UFL strategies tested, TICA-learned features exhibited the best performance thanks to its ability to capture low-level invariances, which are inherent to the nature of the problem.

[1]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[2]  Christopher Malon,et al.  Identifying histological elements with convolutional neural networks , 2008, CSTST.

[3]  Fabio A. González,et al.  Automatic detection of invasive ductal carcinoma in whole slide images with convolutional neural networks , 2014, Medical Imaging.

[4]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[5]  Nigam H. Shah,et al.  The Stanford Tissue Microarray Database , 2007, Nucleic Acids Res..

[6]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[7]  Yoshua Bengio,et al.  Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[8]  Marc'Aurelio Ranzato,et al.  Building high-level features using large scale unsupervised learning , 2011, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Marc'Aurelio Ranzato,et al.  Large Scale Distributed Deep Networks , 2012, NIPS.

[10]  Angel Cruz-Roa,et al.  Hybrid image representation learning model with invariant features for basal cell carcinoma detection , 2013, Other Conferences.

[11]  Wei Chu,et al.  Multi-category Classification by Soft-Max Combination of Binary Classifiers , 2003, Multiple Classifier Systems.

[12]  Aapo Hyvärinen,et al.  Natural Image Statistics - A Probabilistic Approach to Early Computational Vision , 2009, Computational Imaging and Vision.

[13]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[14]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[15]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[16]  A. Madabhushi Digital pathology image analysis: opportunities and challenges. , 2009, Imaging in medicine.

[17]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[18]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[19]  Eduardo Romero,et al.  Histopathological Image Classification Using Stain Component Features on a pLSA Model , 2010, CIARP.

[20]  Aapo Hyvärinen,et al.  Topographic Independent Component Analysis , 2001, Neural Computation.

[21]  Clément Farabet,et al.  Torch7: A Matlab-like Environment for Machine Learning , 2011, NIPS 2011.

[22]  Fabio A. González,et al.  A Semantic Content-Based Retrieval Method for Histopathology Images , 2008, AIRS.

[23]  A. Madabhushi,et al.  Histopathological Image Analysis: A Review , 2009, IEEE Reviews in Biomedical Engineering.

[24]  Anant Madabhushi,et al.  Multi-Field-of-View Framework for Distinguishing Tumor Grade in ER+ Breast Cancer From Entire Histopathology Slides , 2013, IEEE Transactions on Biomedical Engineering.

[25]  C. Fletcher,et al.  Diagnostic histopathology of tumors , 2013 .

[26]  Razvan Pascanu,et al.  Pylearn2: a machine learning research library , 2013, ArXiv.

[27]  Quoc V. Le,et al.  Learning invariant features of tumor signatures , 2012, 2012 9th IEEE International Symposium on Biomedical Imaging (ISBI).

[28]  Fabio A. González,et al.  Cascaded ensemble of convolutional neural networks and handcrafted features for mitosis detection , 2014, Medical Imaging.

[29]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[30]  B. Yener,et al.  Cell-Graph Mining for Breast Tissue Modeling and Classification , 2007, 2007 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[31]  Maria S. Kulikova,et al.  Mitosis detection in breast cancer histological images An ICPR 2012 contest , 2013, Journal of pathology informatics.

[32]  Anant Madabhushi,et al.  AUTOMATED GRADING OF PROSTATE CANCER USING ARCHITECTURAL AND TEXTURAL IMAGE FEATURES , 2007, 2007 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[33]  Tao Wang,et al.  Deep learning with COTS HPC systems , 2013, ICML.

[34]  Angel Cruz-Roa,et al.  A framework for semantic analysis of histopathological images using nonnegative matrix factorization , 2011, 2011 6th Colombian Computing Congress (CCC).

[35]  Erkki Oja,et al.  Independent component analysis: algorithms and applications , 2000, Neural Networks.

[36]  I. Jolliffe Principal Component Analysis , 2002 .

[37]  L. Rodney Long,et al.  Histology image analysis for carcinoma detection and grading , 2012, Comput. Methods Programs Biomed..

[38]  Quoc V. Le,et al.  ICA with Reconstruction Cost for Efficient Overcomplete Feature Learning , 2011, NIPS.

[39]  Angel Cruz-Roa,et al.  Mitosis detection in breast cancer pathology images by combining handcrafted and convolutional neural network features , 2014, Journal of medical imaging.

[40]  Anant Madabhushi,et al.  A Boosting Cascade for Automated Detection of Prostate Cancer from Digitized Histology , 2006, MICCAI.

[41]  G. Weinstein,et al.  Cell proliferation in human basal cell carcinoma. , 1970, Cancer research.

[42]  Shengen Yan,et al.  Deep Image: Scaling up Image Recognition , 2015, ArXiv.

[43]  Yann LeCun,et al.  Learning Invariant Feature Hierarchies , 2012, ECCV Workshops.

[44]  Angel Cruz-Roa,et al.  Automatic annotation of histopathological images using a latent topic model based on non-negative matrix factorization , 2011, Journal of pathology informatics.

[45]  Eduardo Romero,et al.  A supervised visual model for finding regions of interest in basal cell carcinoma images , 2011, Diagnostic pathology.

[46]  Jun Kong,et al.  Computer-aided prognosis of neuroblastoma on whole-slide images: Classification of stromal development , 2009, Pattern Recognit..

[47]  Qinmu Peng,et al.  Cell Nucleus Segmentation in Color Histopathological Imagery Using Convolutional Networks , 2010, 2010 Chinese Conference on Pattern Recognition (CCPR).

[48]  David J. Field,et al.  What The Statistics Of Natural Images Tell Us About Visual Coding , 1989, Photonics West - Lasers and Applications in Science and Engineering.

[49]  Avinash C. Kak,et al.  PCA versus LDA , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[50]  Fabio A. González,et al.  A Visual Latent Semantic Approach for Automatic Analysis and Interpretation of Anaplastic Medulloblastoma Virtual Slides , 2012, MICCAI.

[51]  Fabio A. González,et al.  Histopathology Image Classification Using Bag of Features and Kernel Functions , 2009, AIME.

[52]  R Albert,et al.  Three-dimensional image processing for morphometric analysis of epithelium sections. , 1992, Cytometry.

[53]  B. S. Manjunath,et al.  Object- and spatial-level quantitative analysis of multispectral histopathology images for detection and characterization of cancer , 2008 .

[54]  Joachim M. Buhmann,et al.  Computational Pathology: Challenges and Promises for Tissue Analysis , 2015, Comput. Medical Imaging Graph..

[55]  Eduardo Romero,et al.  Micro‐structural tissue analysis for automatic histopathological image annotation , 2012, Microscopy research and technique.

[56]  Eric Cosatto,et al.  Classification of mitotic figures with convolutional neural networks and seeded blob features , 2013, Journal of pathology informatics.

[57]  Dong Yu,et al.  Conversational Speech Transcription Using Context-Dependent Deep Neural Networks , 2012, ICML.

[58]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[59]  Nasir M. Rajpoot,et al.  Unsupervised Learning of Shape Manifolds , 2007, BMVC.

[60]  Fabio A. González,et al.  Visual pattern mining in histology image collections using bag of features , 2011, Artif. Intell. Medicine.

[61]  Honglak Lee,et al.  Sparse deep belief net model for visual area V2 , 2007, NIPS.

[62]  Fabio A. González,et al.  A Deep Learning Architecture for Image Representation, Visual Interpretability and Automated Basal-Cell Carcinoma Cancer Detection , 2013, MICCAI.

[63]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[64]  Luca Maria Gambardella,et al.  Mitosis Detection in Breast Cancer Histology Images with Deep Neural Networks , 2013, MICCAI.

[65]  A. Oseroff,et al.  Basal cell and squamous cell skin cancers. , 2010, Journal of the National Comprehensive Cancer Network : JNCCN.

[66]  Quoc V. Le,et al.  Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis , 2011, CVPR 2011.

[67]  Rajat Raina,et al.  Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.