Discordance in the histopathologic diagnosis of melanoma and melanocytic nevi between expert pathologists.

The reliability of a diagnostic test depends on the reproducibility of the result. Many clinical diagnostic tests can be quantified with established ranges and standard deviations. Other tests are more subjective, such as those that depend on analysis of a visual image with an increased possibility of variance in the result. To study this variance, the authors analyzed the performance of expert pathologists in the interpretation of cutaneous melanocytic tumors. A panel of expert pathologists was convened to review anatomic pathology specimens from melanocytic tumors. Each pathologist submitted five specimens, from which 37 were selected for review. Only one slide was used for each case. All specimens were interpreted by each pathologist without consultation with each other. In addition to standard diagnostic terms, each specimen was designated as benign, malignant, or indeterminate. Statistical analysis was used to determine the degree of concordance. The combined kappa statistic for the eight observers and three possible outcomes (benign, malignant, or indeterminate) was 0.50. A kappa statistic of this magnitude, is defined as being moderate. In 62% of the specimens, there was unanimous agreement or only one discordant designation. Thirty-eight percent had two or more discordant interpretations. No single pathologist had a disproportionate number of discordant designations. This study mimics the consultation practice of anatomic pathology and shows the variability and discordance in diagnostic language and designation of biological behavior. The results suggest the criteria for the diagnosis of melanomas and melanocytic nevi need to be refined and more consistently applied.

[1]  P. Colloby,et al.  Observer variation in the measurement of Breslow depth and Clark's level in thin cutaneous malignant melanoma , 1991, The Journal of pathology.

[2]  M. Mihm,et al.  Histopathologic recognition and grading of dysplastic melanocytic nevi: an interobserver agreement study. , 1993, The Journal of investigative dermatology.

[3]  W. Clark,et al.  Comparison of the classification by microscopic level (stage) of malignant melanoma by three independent groups of pathologists , 1977, Cancer.

[4]  J. Elmore,et al.  Variability in radiologists' interpretations of mammograms. , 1994, The New England journal of medicine.

[5]  J. Kirkwood,et al.  Interclinician agreement on the recognition of selected gross morphologic features of pigmented lesions. Studies of melanocytic nevi V. , 1992, Journal of the American Academy of Dermatology.

[6]  M. Skolnick,et al.  Histopathologic characteristics of dysplastic nevi. Limited association of conventional histologic criteria with melanoma risk group. , 1990, Journal of the American Academy of Dermatology.

[7]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[8]  C. Holman,et al.  An improved method of analysis of observer variation between pathologists , 1982, Histopathology.

[9]  W. Clark,et al.  Early melanoma. Histologic terms. , 1991, The American Journal of dermatopathology.

[10]  R. Sanson-Fisher,et al.  Reliability of naevus counts in identifying individuals at high risk of malignant melanoma , 1994, The British journal of dermatology.

[11]  J. Richard Landis,et al.  Large sample variance of kappa in the case of different sets of raters. , 1979 .

[12]  C. Holman,et al.  Inter‐observer variation between pathologists in the classification of cutaneous malignant melanoma in Western Australia , 1984, Histopathology.

[13]  C. Lewis,et al.  A multiobserver, population-based analysis of histologic dysplasia in melanocytic nevi. , 1994, Journal of the American Academy of Dermatology.

[14]  O. Clemmensen,et al.  Dysplastic naevus: histological criteria and their inter‐observer reproducibility , 1994, Histopathology.

[15]  A. Kopf,et al.  Diagnostic accuracy in malignant melanoma. , 1975, Archives of dermatology.

[16]  A. Cochran,et al.  Difficulties encountered in the application of clark classification and the Breslow thickness measurement in cutaneous malignant melanoma , 1980, International journal of cancer.

[17]  R. A. Carlsen,et al.  Cutaneous T-cell lymphoma. Evaluation of pretreatment skin biopsy specimens by a panel of pathologists. , 1992, Archives of Dermatology.

[18]  J. R. Landis,et al.  A one-way components of variance model for categorical data , 1977 .