ImageMiner: a software system for comparative analysis of tissue microarrays using content-based image retrieval, high-performance computing, and grid technology

OBJECTIVE AND DESIGN The design and implementation of ImageMiner, a software platform for performing comparative analysis of expression patterns in imaged microscopy specimens such as tissue microarrays (TMAs), is described. ImageMiner is a federated system of services that provides a reliable set of analytical and data management capabilities for investigative research applications in pathology. It provides a library of image processing methods, including automated registration, segmentation, feature extraction, and classification, all of which have been tailored, in these studies, to support TMA analysis. The system is designed to leverage high-performance computing machines so that investigators can rapidly analyze large ensembles of imaged TMA specimens. To support deployment in collaborative, multi-institutional projects, ImageMiner features grid-enabled, service-based components so that multiple instances of ImageMiner can be accessed remotely and federated. RESULTS The experimental evaluation shows that: (1) ImageMiner is able to support reliable detection and feature extraction of tumor regions within imaged tissues; (2) images and analysis results managed in ImageMiner can be searched for and retrieved on the basis of image-based features, classification information, and any correlated clinical data, including any metadata that have been generated to describe the specified tissue and TMA; and (3) the system is able to reduce computation time of analyses by exploiting computing clusters, which facilitates analysis of larger sets of tissue samples.

[1]  J. Rao,et al.  Protein expression analysis using quantitative fluorescence image analysis on tissue microarray slides. , 2002, BioTechniques.

[2]  D J Foran,et al.  A network-based prototype for interactive telemedicine & automated management of distributed, clinical databases. , 1996, Journal of clinical engineering.

[3]  Mark James,et al.  Biomedical Informatics Research Network: Building a National Collaboratory to Hasten the Derivation of New Understanding and Treatment of Disease , 2005, HealthGrid.

[4]  D. Rimm,et al.  A decade of tissue microarrays: progress in the discovery and validation of cancer biomarkers. , 2008, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[5]  D. Rimm,et al.  Automated subcellular localization and quantification of protein expression in tissue microarrays , 2002, Nature Medicine.

[6]  Jitendra Malik,et al.  Recognizing surfaces using three-dimensional textons , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[7]  David G. Nohle,et al.  The tissue microarray data exchange specification: A document type definition to validate and enhance XML data , 2005, BMC Medical Informatics Decis. Mak..

[8]  Richard McClatchey,et al.  MammoGrid: Large-Scale Distributed Mammogram Analysis , 2004, MIE.

[9]  M. Rubin,et al.  Neuroendocrine expression in metastatic prostate cancer: evaluation of high throughput tissue microarrays to detect heterogeneous protein expression. , 2000, Human pathology.

[10]  Lin Yang,et al.  Virtual Microscopy and Grid-Enabled Decision Support for Large-Scale Analysis of Imaged Pathology Specimens , 2009, IEEE Transactions on Information Technology in Biomedicine.

[11]  David R. Westhead,et al.  TmaDB: a repository for tissue microarray data , 2005, BMC Bioinformatics.

[12]  Wenjin Chen,et al.  Advances in cancer tissue microarray technology: Towards improved understanding and diagnostics. , 2006, Analytica chimica acta.

[13]  Arthur W. Wetzel Computational Aspects of Pathology Image Classification and Retrieval , 1997, The Journal of Supercomputing.

[14]  Richard McClatchey,et al.  A Grid Information Infrastructure for Medical Image Analysis , 2004, ArXiv.

[15]  Wei He,et al.  Image mining for investigative pathology using optimized feature extraction and data fusion , 2005, Comput. Methods Programs Biomed..

[16]  James R. Bergen,et al.  Pyramid-based texture analysis/synthesis , 1995, Proceedings., International Conference on Image Processing.

[17]  Yu Rang Park,et al.  The tissue microarray object model: a data model for storage, analysis, and exchange of tissue microarray experimental data. , 2006, Archives of pathology & laboratory medicine.

[18]  Lin Yang,et al.  High Throughput Analysis of Breast Cancer Specimens on the Grid , 2007, MICCAI.

[19]  Emanuele Trucco,et al.  Introductory techniques for 3-D computer vision , 1998 .

[20]  Andrea Clematis,et al.  Tissue MicroArray: a Distributed Grid Approach for Image Analysis , 2007, HealthGrid.

[21]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[22]  Ashok Patel,et al.  The tissue microarray data exchange specification: implementation by the Cooperative Prostate Cancer Tissue Resource , 2004, BMC Bioinformatics.

[23]  A Solomonides,et al.  MammoGrid and eDiamond: Grids Applications in Mammogram Analysis , 2003 .

[24]  Brian E Matysiak,et al.  Simple, Inexpensive Method for Automating Tissue Microarray Production Provides Enhanced Microarray Reproducibility , 2003, Applied immunohistochemistry & molecular morphology : AIMM.

[25]  Jun Hu,et al.  A caGrid-enabled, learning based image segmentation method for histopathology specimens , 2009, 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[26]  Anna Frolov,et al.  The prolyl isomerase Pin1 is a novel prognostic marker in human prostate cancer. , 2003, Cancer research.

[27]  F. Schnorrenberg,et al.  Content-based retrieval of breast cancer biopsy slides. , 2000, Technology and health care : official journal of the European Society for Engineering and Medicine.

[28]  C. Street,et al.  The Cancer Biomedical Informatics Grid (caBIGTM) , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[29]  Kristin J. Dana,et al.  Compact representation of bidirectional texture functions , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[30]  Ivan Merelli,et al.  Ontology-based, Tissue MicroArray oriented, image centered tissue bank , 2008, BMC Bioinformatics.

[31]  K. Buetow,et al.  The Cancer Biomedical Informatics Grid (caBIGTM): Creating a Platform for Personalized, Molecular Medicine , 2008 .

[32]  Dorin Comaniciu,et al.  Image-guided decision support system for pathology , 1999, Machine Vision and Applications.

[33]  Martina Uray,et al.  TAMEE: data management and analysis for tissue microarrays , 2007, BMC Bioinformatics.

[34]  Joel H. Saltz,et al.  Processing large-scale multi-dimensional data in parallel and distributed environments , 2002, Parallel Comput..

[35]  Thomas Martin Deserno,et al.  A Generic Concept for the Implementation of Medical Image Retrieval Systems , 2005, MIE.

[36]  Bela Julesz,et al.  A theory of preattentive texture discrimination based on first-order statistics of textons , 2004, Biological Cybernetics.

[37]  Todd H. Stokes,et al.  Development of an automatic quantification method for cancer tissue microarray study , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[38]  David L Rimm,et al.  Quantitative determination of expression of the prostate cancer protein alpha-methylacyl-CoA racemase using automated quantitative analysis (AQUA): a novel paradigm for automated and continuous biomarker measurements. , 2004, The American journal of pathology.

[39]  Rafael Jimenez,et al.  Use and validation of epithelial recognition and fields of view algorithms on virtual slides to guide TMA construction. , 2009, BioTechniques.

[40]  H. Moch,et al.  High-throughput tissue microarray analysis to evaluate genes uncovered by cDNA microarray screening in renal cell carcinoma. , 1999, The American journal of pathology.

[41]  D. Rubin,et al.  The Annotation and Image Mark-up project. , 2009, Radiology.

[42]  Wendy A. Rogers,et al.  Dollars, debts and duties: lessons from funding Australian general practice. , 2000, Health & social care in the community.

[43]  Joel H. Saltz,et al.  caGrid: design and implementation of the core architecture of the cancer biomedical informatics grid , 2006, Bioinform..

[44]  S. Baredes,et al.  Abstract 786: Alterations of TGFβ/Smad signaling in human head and neck squamous cell carcinomas , 2010 .

[45]  Jitendra Malik,et al.  Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.

[46]  Joel H. Saltz,et al.  Distributed processing of very large datasets with DataCutter , 2001, Parallel Comput..

[47]  Jun Kong,et al.  Computerized Pathological Image Analysis For Neuroblastoma Prognosis , 2007, AMIA.

[48]  David J Foran,et al.  Therapeutic starvation and autophagy in prostate cancer: A new paradigm for targeting metabolism in cancer therapy , 2008, The Prostate.

[49]  M H Ellisman,et al.  Web-based telemicroscopy. , 1999, Journal of structural biology.

[50]  Joel H. Saltz,et al.  Model Formulation: caGrid 1.0: An Enterprise Grid Infrastructure for Biomedical Research , 2008, J. Am. Medical Informatics Assoc..

[51]  Jules J. Berman,et al.  The tissue microarray data exchange specification: A community-based, open source tool for sharing tissue microarray data , 2003, BMC Medical Informatics Decis. Mak..

[52]  Dorin Comaniciu,et al.  Cell image segmentation for diagnostic pathology , 2001 .

[53]  J. Kononen,et al.  Tissue microarrays for high-throughput molecular profiling of tumor specimens , 1998, Nature Medicine.

[54]  H. Moch,et al.  Tissue microarrays for rapid linking of molecular changes to clinical endpoints. , 2001, The American journal of pathology.

[55]  Kurt Zatloukal,et al.  Automated evaluation and normalization of immunohistochemistry on tissue microarrays with a DNA microarray scanner. , 2003, BioTechniques.

[56]  A. Madabhushi,et al.  Histopathological Image Analysis: A Review , 2009, IEEE Reviews in Biomedical Engineering.

[57]  V.S. Kumar,et al.  Large Image Correction and Warping in a Cluster Environment , 2006, ACM/IEEE SC 2006 Conference (SC'06).

[58]  Barbara Horner-Miller,et al.  Proceedings of the 2006 ACM/IEEE conference on Supercomputing , 2006 .

[59]  Ilan Shimshoni,et al.  Mean shift based clustering in high dimensions: a texture classification example , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[60]  Peter Meer,et al.  Unsupervised segmentation based on robust estimation and color active contour models , 2005, IEEE Transactions on Information Technology in Biomedicine.

[61]  James Zijun Wang,et al.  Multiresolution browsing of pathology images using wavelets , 1999, AMIA.

[62]  Ann Zimmerman,et al.  The Biomedical Informatics Research Network , 2008 .

[63]  David L Rimm,et al.  Quantitative analysis of breast cancer tissue microarrays shows that both high and normal levels of HER2 expression are associated with poor outcome. , 2003, Cancer research.

[64]  Lei Zheng,et al.  Design and analysis of a content-based pathology image retrieval system , 2003, IEEE Transactions on Information Technology in Biomedicine.

[65]  Cordelia Schmid,et al.  Constructing models for content-based image retrieval , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[66]  Daniel L. Rubin,et al.  The caBIG™ Annotation and Image Markup Project , 2009, Journal of Digital Imaging.

[67]  Kerry K Kakazu,et al.  The Cancer Biomedical Informatics Grid (caBIG): pioneering an expansive network of information and tools for collaborative cancer research. , 2004, Hawaii medical journal.

[68]  David J. Foran,et al.  Unsupervised imaging, registration and archiving of tissue microarrays , 2002, AMIA.

[69]  Mark H. Ellisman,et al.  Medical Data Federation , 2004, The Grid 2, 2nd Edition.

[70]  J. Suri,et al.  Advanced algorithmic approaches to medical image segmentation: state-of-the-art application in cardiology, neurology, mammography and pathology , 2001 .

[71]  David J. Foran,et al.  A prototype for unsupervised analysis of tissue microarrays for cancer research and diagnostics , 2004, IEEE Transactions on Information Technology in Biomedicine.

[72]  Daniel L. Rubin,et al.  Annotation and query of tissue microarray data using the NCI Thesaurus , 2007, BMC Bioinformatics.