Learning Low-dimensional Manifolds for Scoring of Tissue Microarray Images

Tissue microarray (TMA) images have emerged as an important highthroughput tool for cancer study and the validation of biomarkers. Efforts have been dedicated to further improve the accuracy of TACOMA, a cutting-edge automatic scoring algorithm for TMA images. One major advance is due to deepTacoma, an algorithm that incorporates suitable deep representations of a group nature. Inspired by the recent advance in semi-supervised learning and deep learning, we proposemfTacoma to learn alternative deep representations in the context of TMA image scoring. In particular, mfTacoma learns the low-dimensional manifolds, a common latent structure in high dimensional data. Deep representation learning and manifold learning typically requires large data. By encoding deep representation of the manifolds as regularizing features, mfTacoma effectively leverages the manifold information that is potentially crude due to small data. Our experiments show that deep features by manifolds outperforms two alternatives—deep features by linear manifolds with principal component analysis or by leveraging the group property.

[1]  Haitao Zhao,et al.  A novel incremental principal component analysis and its application for face recognition , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[2]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[3]  Rich Caruana,et al.  An empirical evaluation of supervised learning in high dimensions , 2008, ICML '08.

[4]  Robert Pless,et al.  A Survey of Manifold Learning for Images , 2009, IPSJ Trans. Comput. Vis. Appl..

[5]  P. Furmanski,et al.  A rapid and efficient method for testing immunohistochemical reactivity of monoclonal antibodies against multiple tissue samples simultaneously. , 1987, Journal of immunological methods.

[6]  P. Bickel,et al.  Sparsity and the Possibility of Inference , 2008 .

[7]  J. Giltnane,et al.  Technology Insight: identification of biomarkers with tissue microarray technology , 2004, Nature Clinical Practice Oncology.

[8]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[9]  David L Rimm,et al.  Subjective differences in outcome are seen as a function of the immunohistochemical method used on a colorectal cancer tissue microarray. , 2002, Clinical colorectal cancer.

[10]  X. Huo,et al.  A Survey of Manifold-Based Learning Methods , 2007 .

[11]  Fatima Soliman,et al.  Imaging genetics and development: Challenges and promises , 2010, Human brain mapping.

[12]  Spyro Mousses,et al.  Clinical validation of candidate genes associated with prostate cancer progression in the CWR22 model system using tissue microarrays. , 2002, Cancer research.

[13]  Hung Chiang,et al.  Interobserver reproducibility of Her-2/neu protein overexpression in invasive breast carcinoma using the DAKO HercepTest. , 2002, American journal of clinical pathology.

[14]  Francesca M. Buffa,et al.  Multiple biomarker tissue microarrays: bioinformatics and practical approaches , 2008, Cancer and Metastasis Reviews.

[15]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[16]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[17]  Hongyuan Zha,et al.  Principal Manifolds and Nonlinear Dimension Reduction via Local Tangent Space Alignment , 2002, ArXiv.

[18]  Peng Gong,et al.  A comparison of spatial feature extraction algorithms for land-use classification with SPOT HRV data , 1992 .

[19]  D. Donoho,et al.  Hessian eigenmaps: Locally linear embedding techniques for high-dimensional data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Johannes Schmidt-Hieber,et al.  Nonparametric regression using deep neural networks with ReLU activation function , 2017, The Annals of Statistics.

[21]  D. Rimm,et al.  A decade of tissue microarrays: progress in the discovery and validation of cancer biomarkers. , 2008, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[22]  John J Spinelli,et al.  HER-2/neu in Breast Cancer: Interobserver Variability and Performance of Immunohistochemistry with 4 Antibodies Compared with Fluorescent In Situ Hybridization , 2001, Modern Pathology.

[23]  O. Cussenot,et al.  Tissue microarray analysis of the prognostic value of E-cadherin, Ki67, p53, p27, survivin and MSH2 expression in upper urinary tract transitional cell carcinoma. , 2005, European urology.

[24]  R M Levenson,et al.  Quantification of immunohistochemistry—issues concerning methods, utility and semiquantitative assessment II , 2006, Histopathology.

[25]  Shing-Tung Yau,et al.  Geometric Understanding of Deep Learning , 2018, ArXiv.

[26]  Andrew H. Beck,et al.  Systematic Analysis of Breast Cancer Morphology Uncovers Stromal Features Associated with Survival , 2011, Science Translational Medicine.

[27]  Stanley Osher,et al.  Low Dimensional Manifold Model for Image Processing , 2017, SIAM J. Imaging Sci..

[28]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[29]  Lawrence Cayton,et al.  Algorithms for manifold learning , 2005 .

[30]  D. Rimm,et al.  Automated subcellular localization and quantification of protein expression in tissue microarrays , 2002, Nature Medicine.

[31]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[32]  M. Meilă,et al.  Non-linear dimensionality reduction: Riemannian metric estimation and the problem of geometric discovery , 2013, 1305.7255.

[33]  J. Kononen,et al.  Tissue microarrays for high-throughput molecular profiling of tumor specimens , 1998, Nature Medicine.

[34]  Jian Zou,et al.  Incorporating Deep Features in the Analysis of Tissue Microarray Images , 2018, Statistics and its interface.

[35]  R. Walker,et al.  Quantification of immunohistochemistry—issues concerning methods, utility and semiquantitative assessment I , 2006, Histopathology.

[36]  T. Nielsen,et al.  Tissue microarrays in clinical oncology. , 2008, Seminars in radiation oncology.

[37]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[38]  Cyrus Shahabi,et al.  Real-time Pattern Isolation and Recognition Over Immersive Sensor Data Streams , 2003, MMM.

[39]  Jonathon Shlens,et al.  A Tutorial on Principal Component Analysis , 2014, ArXiv.

[40]  Hans Vrolijk,et al.  Automated acquisition of stained tissue microarrays for high-throughput evaluation of molecular targets. , 2003, The Journal of molecular diagnostics : JMD.

[41]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[42]  Susan Holmes,et al.  An Interactive Java Statistical Image Segmentation System: GemIdent. , 2009, Journal of statistical software.

[43]  K. A. DiVito,et al.  Tissue microarrays – automated analysis and future directions , 2005 .

[44]  M.,et al.  Statistical and Structural Approaches to Texture , 2022 .

[45]  David L. Neuhoff,et al.  Quantization , 2022, IEEE Trans. Inf. Theory.

[46]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[47]  Pei Wang,et al.  Statistical Methods for Tissue Array Images - Algorithmic Scoring and Co-training. , 2011, The annals of applied statistics.

[48]  M. Basik,et al.  Tissue microarrays: emerging standard for biomarker validation. , 2008, Current opinion in biotechnology.

[49]  Nigam H. Shah,et al.  The Stanford Tissue Microarray Database , 2007, Nucleic Acids Res..

[50]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[51]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[52]  B. Nadler,et al.  Diffusion Maps - a Probabilistic Interpretation for Spectral Embedding and Clustering Algorithms , 2008 .

[53]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[54]  Peter M. Atkinson,et al.  A comparison of texture measures for the per-field classification of Mediterranean land cover , 2004 .

[55]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[56]  Chinmay Hegde,et al.  Random Projections for Manifold Learning , 2007, NIPS.

[57]  P. Thompson,et al.  Multilocus Genetic Analysis of Brain Images , 2011, Front. Gene..

[58]  Victor G Prieto,et al.  Automated quantitative analysis of activator protein-2alpha subcellular expression in melanoma tissue microarrays correlates with survival prediction. , 2005, Cancer research.