论文信息 - How Many Annotators Do We Need? - A Study on the Influence of Inter-Observer Variability on the Reliability of Automatic Mitotic Figure Assessment

How Many Annotators Do We Need? - A Study on the Influence of Inter-Observer Variability on the Reliability of Automatic Mitotic Figure Assessment

Density of mitotic figures in histologic sections is a prognostically relevant characteristic for many tumours. Due to high inter-pathologist variability, deep learning-based algorithms are a promising solution to improve tumour prognostication. Pathologists are the gold standard for database development, however, labelling errors may hamper development of accurate algorithms. In the present work we evaluated the benefit of multi-expert consensus (n = 3, 5, 7, 9, 17) on algorithmic performance. While training with individual databases resulted in highly variable F$_1$ scores, performance was notably increased and more consistent when using the consensus of three annotators. Adding more annotators only resulted in minor improvements. We conclude that databases by few pathologists with high label precision may be the best compromise between high algorithmic performance and time investment.

[1] Karl Rohr,et al. Predicting breast tumor proliferation from whole‐slide images: The TUPAC16 challenge , 2018, Medical Image Anal..

[2] Mitko Veta,et al. Mitosis Counting in Breast Cancer: Object-Level Interobserver Agreement and Comparison to an Automatic Method , 2016, PloS one.

[3] Mitko Veta,et al. Are pathologist-defined labels reproducible? Comparison of the TUPAC16 mitotic figure dataset with an alternative set of labels , 2020, iMIMIC/MIL3iD/LABELS@MICCAI.

[4] Daniel Racoceanu,et al. Detection of Mitosis and Evaluation of Nuclear Atypia Score in Breast Cancer Histological Images , 2014 .

[5] C. Bertram,et al. A large-scale dataset for mitotic figure assessment on whole slide images of canine cutaneous mast cell tumor , 2019, Scientific Data.

[6] Luca Maria Gambardella,et al. Assessment of algorithms for mitosis detection in breast cancer histopathology images , 2014, Medical Image Anal..

[7] Nico Karssemeijer,et al. Whole-Slide Mitosis Detection in H&E Breast Histology Using PHH3 as a Reference to Train Distilled Stain-Invariant Convolutional Networks , 2018, IEEE Transactions on Medical Imaging.

[8] Marc Aubreville,et al. Deep learning algorithms out-perform veterinary pathologists in detecting the mitotically most active tumor region , 2019, Scientific Reports.