Analyzing Patterns of Staining in Immunohistochemical Studies: Application to a Study of Prostate Cancer Recurrence

Background: Immunohistochemical studies use antibodies to stain tissues with the goal of quantifying protein expression. However, protein expression is often heterogeneous resulting in variable degrees and patterns of staining. This problem is particularly acute in prostate cancer, where tumors are infiltrative and heterogeneous in nature. In this article, we introduce analytic approaches that explicitly consider both the frequency and intensity of tissue staining. Methods: Compositional data analysis is a technique used to analyze vectors of unit-sum proportions, such as those obtained from soil sample studies or species abundance surveys. We summarized specimen staining patterns by the proportion of cells staining at mild, moderate, and intense levels and used compositional data analysis to summarize and compare the resulting staining profiles. Results: In a study of Syndecan-1 staining patterns among 44 localized prostate cancer cases with Gleason score 7 disease, compositional data analysis did not detect a statistically significant difference between the staining patterns in recurrent (n = 22) versus nonrecurrent (n = 22) patients. Results indicated only modest increases in the proportion of cells staining at a moderate intensity in the recurrent group. In contrast, an analysis that compared quantitative scores across groups indicated a (borderline) significant increase in staining in the recurrent group (P = 0.05, t test). Conclusions: Compositional data analysis offers a novel analytic approach for immunohistochemical studies, providing greater insight into differences in staining patterns between groups, but possibly lower statistical power than existing, score-based methods. When appropriate, we recommend conducting a compositional data analysis in addition to a standard score-based analysis.