Virtual Microscopy and Grid-Enabled Decision Support for Large-Scale Analysis of Imaged Pathology Specimens

Breast cancer accounts for about 30% of all cancers and 15% of cancer deaths in women. Advances in computer-assisted analysis hold promise for classifying subtypes of disease and improving prognostic accuracy. We introduce a grid-enabled decision support system for performing automatic analysis of imaged breast tissue microarrays. To date, we have processed more than 1 00 000 digitized specimens (1200 times 1200 pixels each) on IBM's World Community Grid (WCG). As a part of the Help Defeat Cancer (HDC) project, we have analyzed that the data returned from WCG along with retrospective patient clinical profiles for a subset of 3744 breast tissue samples, and have reported the results in this paper. Texture-based features were extracted from the digitized specimens, and isometric feature mapping was applied to achieve nonlinear dimension reduction. Iterative prototyping and testing were performed to classify several major subtypes of breast cancer. Overall, the most reliable approach was gentle AdaBoost using an eight-node classification and regression tree as the weak learner. Using the proposed algorithm, a binary classification accuracy of 89% and the multiclass accuracy of 80% were achieved. Throughout the course of the experiments, only 30% of the dataset was used for training.

[1]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[2]  Patrick J. F. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 2003 .

[3]  David J. Foran,et al.  A prototype for unsupervised analysis of tissue microarrays for cancer research and diagnostics , 2004, IEEE Transactions on Information Technology in Biomedicine.

[4]  J. Kononen,et al.  Tissue microarrays for high-throughput molecular profiling of tumor specimens , 1998, Nature Medicine.

[5]  M. Salto‐Tellez,et al.  Reliability of Tissue Microarrays in Detecting Protein Expression and Gene Amplification in Breast Cancer , 2003, Modern Pathology.

[6]  D. Feng,et al.  IEEE transactions on information technology in biomedicine: special issue on advances in clinical and health-care knowledge management , 2005 .

[7]  Bohn Stafleu van Loghum,et al.  Online … , 2002, LOG IN.

[8]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[9]  U. G. Dailey Cancer,Facts and Figures about. , 2022, Journal of the National Medical Association.

[10]  Robert Marti,et al.  A Comparison of Breast Tissue Classification Techniques , 2006, MICCAI.

[11]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[12]  M. Rubin,et al.  Neuroendocrine expression in metastatic prostate cancer: evaluation of high throughput tissue microarrays to detect heterogeneous protein expression. , 2000, Human pathology.

[13]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[14]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[15]  C. Cardinez,et al.  United States cancer statistics; 2003 incidence and mortality , 2006 .

[16]  Antonio Criminisi,et al.  TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation , 2006, ECCV.

[17]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[18]  Anil K. Jain,et al.  Texture classification and segmentation using multiresolution simultaneous autoregressive models , 1992, Pattern Recognit..

[19]  James R. Bergen,et al.  Pyramid-based texture analysis/synthesis , 1995, Proceedings., International Conference on Image Processing.

[20]  H. Moch,et al.  Tissue microarrays for rapid linking of molecular changes to clinical endpoints. , 2001, The American journal of pathology.

[21]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[22]  Kristin J. Dana,et al.  3D Texture Recognition Using Bidirectional Feature Histograms , 2004, International Journal of Computer Vision.

[23]  Andrew Zisserman,et al.  Classifying Images of Materials: Achieving Viewpoint and Illumination Independence , 2002, ECCV.

[24]  Cordelia Schmid,et al.  Constructing models for content-based image retrieval , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[25]  D L Rimm,et al.  Tissue microarray: a new technology for amplification of tissue resources. , 2001, Cancer journal.

[26]  Peter Meer,et al.  Unsupervised segmentation based on robust estimation and color active contour models , 2005, IEEE Transactions on Information Technology in Biomedicine.

[27]  D Butler,et al.  Biopsy of a biopsy: validation of immunoprofiling in gastric cancer biopsy tissue microarrays , 2003, Histopathology.

[28]  Mary M. Galloway,et al.  Texture analysis using gray level run lengths , 1974 .

[29]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[30]  Rangaraj M. Rangayyan,et al.  Recent Advances in Breast Imaging, Mammography, and Computer-Aided Diagnosis of Breast Cancer , 2006 .

[31]  MalikJitendra,et al.  Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001 .

[32]  Lin Yang,et al.  Multiple Class Segmentation Using A Unified Framework over Mean-Shift Patches , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).