Semi-Supervised Robust Mixture Models in RKHS for Abnormality Detection in Medical Images

Abnormality detection in medical images is a one-class classification problem for which existing methods typically involve variants of kernel principal component analysis or one-class support vector machines. However, existing methods rely on highly-curated training sets with full supervision, often using heuristics for model fitting or ignore the variances of the data within principal subspaces. In contrast, we propose novel methods that can work with imperfectly curated datasets using robust statistical learning, by extending the multivariate generalized-Gaussian distribution to a reproducing kernel Hilbert space (RKHS) and employing it within a mixture model. We propose a novel semi-supervised extension of our learning scheme, showing that a small amount of expert feedback through high-quality labeled data of the outlier class can boost performance. We propose expectation maximization for our semi-supervised robust mixture-model learning in RKHS, using solely the Gram matrix and without the explicit lifting map. Our methods incorporate optimal component means, principal directions, and variances for abnormality detection. Results on four large public datasets on retinopathy and cancer, compared against a variety of contemporary methods, show that our method gives benefits over the state of the art in one-class classification for abnormality detection.

[1]  Philippe Forster,et al.  Covariance Structure Maximum-Likelihood Estimates in Compound Gaussian Noise: Existence and Algorithm Analysis , 2008, IEEE Transactions on Signal Processing.

[2]  Joachim M. Buhmann,et al.  A minimum entropy approach to adaptive image polygonization , 2003, IEEE Trans. Image Process..

[3]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[4]  Richard M. Leahy,et al.  An Optimal Graph Theoretic Approach to Data Clustering: Theory and Its Application to Image Segmentation , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Shenghua Gao,et al.  Multi-Cell Multi-Task Convolutional Neural Networks for Diabetic Retinopathy Grading , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[6]  Benjamin Thyreau,et al.  Detecting outliers in high-dimensional neuroimaging datasets with robust covariance estimators , 2012, Medical Image Anal..

[7]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[8]  Achim Tresch,et al.  Automatic post-picking using MAPPOS improves particle image detection from Cryo-EM micrographs , 2011, Journal of structural biology.

[9]  Alexander Binder,et al.  Deep One-Class Classification , 2018, ICML.

[10]  Fei-Fei Li,et al.  Spatially Coherent Latent Topic Model for Concurrent Segmentation and Classification of Objects and Scenes , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[11]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[12]  Anil K. Jain,et al.  Markov Random Field Texture Models , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Subhransu Maji,et al.  Deep filter banks for texture recognition and segmentation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Jitendra Malik,et al.  Learning to detect natural image boundaries using local brightness, color, and texture cues , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Haluk Derin,et al.  Modeling and Segmentation of Noisy and Textured Images Using Gibbs Random Fields , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[17]  R. Maronna Robust $M$-Estimators of Multivariate Location and Scatter , 1976 .

[18]  Paul F. Whelan,et al.  Using filter banks in Convolutional Neural Networks for texture classification , 2016, Pattern Recognit. Lett..

[19]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[20]  Gang Hua,et al.  Learning Discriminative Reconstructions for Unsupervised Outlier Removal , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[21]  Suyash P. Awate,et al.  A Bayesian Neural Net to Segment Images with Uncertainty Estimates and Good Calibration , 2019, IPMI.

[22]  Nitin Kumar,et al.  Kernel Generalized-Gaussian Mixture Model for Robust Abnormality Detection , 2017, MICCAI.

[23]  Andrew Zisserman,et al.  A Statistical Approach to Material Classification Using Image Patch Exemplars , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[25]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[26]  Leon A. Gatys,et al.  Texture Synthesis Using Convolutional Neural Networks , 2015, NIPS.

[27]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28]  Nitin Kumar,et al.  Kernel generalized Gaussian and robust statistical learning for abnormality detection in medical images , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[29]  Suyash P. Awate,et al.  Sparse Kernel PCA for Outlier Detection , 2018, 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA).

[30]  S. N. Merchant,et al.  MS-Net: Mixed-Supervision Fully-Convolutional Networks for Full-Resolution Segmentation , 2018, MICCAI.

[31]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Heiko Hoffmann,et al.  Kernel PCA for novelty detection , 2007, Pattern Recognit..

[33]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[34]  Xiaojin Zhu,et al.  Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.

[35]  Tülay Adali,et al.  A Complex Generalized Gaussian Distribution— Characterization, Generation, and Estimation , 2010, IEEE Transactions on Signal Processing.

[36]  Tim Verdonck,et al.  Robust kernel principal component analysis and classification , 2010, Adv. Data Anal. Classif..

[37]  Randy C. Paffenroth,et al.  Anomaly Detection with Robust Deep Autoencoders , 2017, KDD.

[38]  Marco Loog,et al.  Robust semi-supervised least squares classification by implicit constraints , 2015, Pattern Recognit..

[39]  Michael Unser,et al.  Texture classification and segmentation using wavelet frames , 1995, IEEE Trans. Image Process..

[40]  H. Vincent Poor,et al.  Complex Elliptically Symmetric Distributions: Survey, New Results and Applications , 2012, IEEE Transactions on Signal Processing.

[41]  Marco Loog,et al.  Semi-Supervised Nearest Mean Classification Through a Constrained Log-Likelihood , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[42]  Stéphane Mallat,et al.  Invariant Scattering Convolution Networks , 2012, IEEE transactions on pattern analysis and machine intelligence.

[43]  Francesca Bovolo,et al.  Semisupervised One-Class Support Vector Machines for Classification of Remote Sensing Data , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[44]  Jitendra Malik,et al.  Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.

[45]  Suyash P. Awate,et al.  Multiatlas Segmentation as Nonparametric Regression , 2014, IEEE Transactions on Medical Imaging.

[46]  Yongmin Li,et al.  On incremental and robust subspace learning , 2004, Pattern Recognit..

[47]  Yan-Wei Pang,et al.  An Iterative Algorithm for Robust Kernel Principal Component Analysis , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[48]  David Elworthy,et al.  Does Baum-Welch Re-estimation Help Taggers? , 1994, ANLP.

[49]  Sanjay Chawla,et al.  Robust, Deep and Inductive Anomaly Detection , 2017, ECML/PKDD.

[50]  Chandan Srivastava,et al.  Support Vector Data Description , 2011 .

[51]  Nojun Kwak,et al.  Principal Component Analysis by $L_{p}$ -Norm Maximization , 2014, IEEE Transactions on Cybernetics.

[52]  Mikhail Belkin,et al.  Laplacian Support Vector Machines Trained in the Primal , 2009, J. Mach. Learn. Res..

[53]  Lior Shamir,et al.  CHLOE: A Software Tool for Automatic Novelty Detection in Microscopy Image Datasets , 2014 .

[54]  Wenkai Li,et al.  A Positive and Unlabeled Learning Algorithm for One-Class Classification of Remote-Sensing Data , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[55]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[56]  Chunmei Zhang,et al.  Adaptive robust kernel PCA algorithm , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[57]  Yu-Hong Dai,et al.  Fast Algorithms for Projection on an Ellipsoid , 2006, SIAM J. Optim..

[58]  Suyash P. Awate,et al.  Bayesian Segmentation of Atrium Wall Using Globally-Optimal Graph Cuts on 3D Meshes , 2013, IPMI.

[59]  Yair Weiss,et al.  Segmentation using eigenvectors: a unifying view , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[60]  Francesca Odone,et al.  Histogram intersection kernel for image classification , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[61]  Xiaogang Wang,et al.  Zoom-in-Net: Deep Mining Lesions for Diabetic Retinopathy Detection , 2017, MICCAI.

[62]  Suyash P. Awate,et al.  Unsupervised Texture Segmentation with Nonparametric Neighborhood Statistics , 2006, ECCV.

[63]  Jürgen Weese,et al.  Four challenges in medical image analysis from an industrial perspective , 2016, Medical Image Anal..

[64]  Robert D. Nowak,et al.  Unsupervised progressive parsing of Poisson fields using minimum description length criteria , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[65]  Konstantin Bauman,et al.  One-Class Semi-supervised Learning , 2017, Braverman Readings in Machine Learning.