Dimensionality Reduction Strategies for CNN-Based Classification of Histopathological Images

Features from pre-trained Convolutional Neural Newtorks (CNN) have proved to be effective for many tasks such as object, scene and face recognition. Compared with traditional, hand-designed image descriptors, CNN-based features produce higher-dimensional feature vectors. In specific applications where the number of samples may be limited – as in the case of histopatological images – high dimensionality could potentially cause overfitting and redundancy in the information to be processed and stored. To overcome these potential problems feature reduction methods can be applied, at the cost of a moderate reduction in the discrimination accuracy. In this paper we investigate dimensionality reduction schemes for CNN-based features applied to computer-assisted classification of histopathological images. The purpose of this study is to find the best trade-off between accuracy and dimensionality. Specifically, we test two well-known techniques (i.e.: Principal Component Analysis and Gaussian Random Projection) and propose a novel reduction strategy based on the cross-correlation between the components of the feature vector. The results show that it is possible to reduce CNN-based features by a high ratio with a moderate decrease in accuracy with respect to the original values.

[1]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2]  Thomas Martinetz,et al.  Deep convolutional neural networks as generic feature extractors , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[3]  Francesco Bianconi,et al.  An investigation on the use of local multi-resolution patterns for image classification , 2016, Inf. Sci..

[4]  Paolo Napoletano,et al.  Combining multiple features for color texture classification , 2016, J. Electronic Imaging.

[5]  Andrew Zisserman,et al.  Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.

[6]  Anant Madabhushi,et al.  A Deep Convolutional Neural Network for segmenting and classifying epithelial and stromal regions in histopathological images , 2016, Neurocomputing.

[7]  Heikki Mannila,et al.  Random projection in dimensionality reduction: applications to image and text data , 2001, KDD '01.

[8]  R. Tollenaar,et al.  Reproducibility and validation of tumour stroma ratio scoring on oesophageal adenocarcinoma biopsies. , 2011, European journal of cancer.

[9]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10]  Hanqing Lu,et al.  Face detection using improved LBP under Bayesian framework , 2004, Third International Conference on Image and Graphics (ICIG'04).

[11]  Francesco Bianconi,et al.  Multi-class texture analysis in colorectal cancer histology , 2016, Scientific Reports.

[12]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Iasonas Kokkinos,et al.  Deep Filter Banks for Texture Recognition, Description, and Segmentation , 2015, International Journal of Computer Vision.

[14]  Dong-Chen He,et al.  Texture Unit, Texture Spectrum, And Texture Analysis , 1990 .

[15]  Lapo Governi,et al.  Color matching of fabric blends: hybrid Kubelka-Munk + artificial neural network based method , 2016, J. Electronic Imaging.

[16]  Francesco Bianconi,et al.  Collection of textures in colorectal cancer histology , 2016 .

[17]  Nina Linder,et al.  Xanthine oxidoreductase - clinical significance in colorectal cancer and in vitro expression of the protein in human colon cancer cells. , 2009, European journal of cancer.

[18]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[19]  Matti Pietikäinen,et al.  Identification of tumor epithelium and stroma in tissue microarrays using texture analysis , 2012, Diagnostic Pathology.

[20]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[21]  B. van Ginneken,et al.  Deep learning as a tool for increased accuracy and efficiency of histopathological diagnosis , 2016, Scientific Reports.

[22]  Luiz Eduardo Soares de Oliveira,et al.  A Dataset for Breast Cancer Histopathological Image Classification , 2016, IEEE Transactions on Biomedical Engineering.

[23]  Iman H Hewedi,et al.  Diagnostic value of progesterone receptor and p53 expression in uterine smooth muscle tumors , 2012, Diagnostic Pathology.