Unsupervised Analysis Uncovers Changes in Histopathologic Diagnosis in Supervised Genomic Studies

Human gastrointestinal stromal tumors (GIST) have recently emerged as a distinct mesenchymal tumor type that has a unique phenotype characterized by a gain of function mutations in c-kit. In contrast, leiomyosarcomas (LMS) of the gastrointestinal tract or retroperitoneum, which were previously classified together with GISTs as gastrointestinal sarcomas, have much less frequent mutations of c-kit. We performed microarray analyses to gain a comprehensive understanding of the difference between the two types of soft-tissue sarcomas at the level of gene expression. Microarray experiments were performed on 30 GISTs and 30 LMSs that were collected at the time of surgical resection. These tumors were categorized based on the histopathologic diagnosis recorded in our institutional database. Prior to our search for genes that are differentially expressed between these two types of cancers, we first carried out an unsupervised analysis using multidimensional scaling (MDS) to determine whether the two groups have marked overall differences in gene expression. Initially, the MDS did not reveal a good separation between the two groups. We then re-reviewed the histopathology of these tumors and realized that some of the cases included in our study were acquired 10 years ago when the diagnosis of gastrointestinal sarcoma was made according to histopathologic criteria alone without immunohistochemistry for c-kit. An experienced pathologist reviewed all of the specimens and this revealed that a number of the GIST cases were classified as LMS in the clinical database. Correction of the histopathologic diagnosis and relabeling of the samples resulted in a much more pronounced separation of GIST and LMS in the MDS analysis. This study underscores the need to re-review histopathology as reclassification occurs. While updating the clinical database may be desired, this is usually impractical. For molecular studies that use archival samples, it is critical to have the archival samples re-reviewed by a pathologist. Further, unsupervised analysis often proves to be a critical quality control step in identifying structural problems that may exist. Finally, MDS analysis further supports that GIST is a distinct type of sarcoma.

[1]  Wei Zhang,et al.  Microarray Quality Control: Zhang/Microarray Quality Control , 2005 .

[2]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[3]  K. Hunt,et al.  Gastrointestinal stromal tumors: overview of pathologic features, molecular biology, and therapy with imatinib mesylate. , 2004, Histology and histopathology.

[4]  J. D. den Dunnen,et al.  Intensity-based analysis of two-colour microarrays enables efficient and flexible hybridization designs. , 2004, Nucleic acids research.

[5]  P. Groenen,et al.  Modern multidimensional scaling , 1996 .

[6]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[7]  H Stein,et al.  A revised European-American classification of lymphoid neoplasms: a proposal from the International Lymphoma Study Group. , 1994, Blood.

[8]  John Quackenbush Microarray data normalization and transformation , 2002, Nature Genetics.

[9]  T. Mauad,et al.  Missed diagnosis in hematological patients—an autopsy study , 2005, Virchows Archiv.

[10]  Ilya Shmulevich,et al.  Tumor specific gene expression profiles in human leiomyosarcoma , 2002, Cancer.

[11]  N. Socci,et al.  Gene Expression in Gastrointestinal Stromal Tumors Is Distinguished by KIT Genotype and Anatomic Site , 2004, Clinical Cancer Research.

[12]  D. R. Goldstein,et al.  Science and Statistics: A Festschrift for Terry Speed , 2003 .

[13]  Yee Hwa Yang,et al.  Normalization for two-color cDNA microarray data , 2003 .

[14]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[15]  E. van der Harst,et al.  Revision of gastrointestinal mesenchymal tumours with CD117. , 2004, European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology.

[16]  Jaakko Astola,et al.  Microarray quality control , 2004 .

[17]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[18]  M. Trotter,et al.  Interpretation of skin biopsies by general pathologists: diagnostic discrepancy rate measured by blinded review. , 2009, Archives of pathology & laboratory medicine.

[19]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[20]  Ilya Shmulevich,et al.  Binary analysis and optimization-based normalization of gene expression data , 2002, Bioinform..