Do We Need Annotation Experts? A Case Study in Celiac Disease Classification

Inference of clinically-relevant findings from the visual appearance of images has become an essential part of processing pipelines for many problems in medical imaging. Typically, a sufficient amount labeled training data is assumed to be available, provided by domain experts. However, acquisition of this data is usually a time-consuming and expensive endeavor. In this work, we ask the question if, for certain problems, expert knowledge is actually required. In fact, we investigate the impact of letting non-expert volunteers annotate a database of endoscopy images which are then used to assess the absence/presence of celiac disease. Contrary to previous approaches, we are not interested in algorithms that can handle the label noise. Instead, we present compelling empirical evidence that label noise can be compensated by a sufficiently large corpus of training data, labeled by the non-experts.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[3]  Andreas Uhl,et al.  Modeling the Marginal Distributions of Complex Wavelet Coefficient Magnitudes for the Classification of Zoom-Endoscopy Images , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[4]  Matti Pietikäinen,et al.  Multi-scale Binary Patterns for Texture Analysis , 2003, SCIA.

[5]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[6]  Ata Kabán,et al.  Label-Noise Robust Logistic Regression and Its Applications , 2012, ECML/PKDD.

[7]  Matti Pietikäinen,et al.  Gray Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2000, ECCV.

[8]  Peter A. Flach,et al.  Evaluation Measures for Multi-class Subgroup Discovery , 2009, ECML/PKDD.

[9]  Carla E. Brodley,et al.  Identifying Mislabeled Training Data , 1999, J. Artif. Intell. Res..

[10]  Leszek Wojnar,et al.  Image Analysis , 1998 .

[11]  Greg Mori,et al.  Handling Uncertain Tags in Visual Recognition , 2013, 2013 IEEE International Conference on Computer Vision.

[12]  Yang Song,et al.  Handling label noise in video classification via multiple instance learning , 2011, 2011 International Conference on Computer Vision.

[13]  M. Topi,et al.  Robust texture classification by subsets of local binary patterns , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[14]  G. Oberhuber,et al.  The histopathology of coeliac disease: time for a standardized report scheme for pathologists. , 1999, European journal of gastroenterology & hepatology.

[15]  Joachim M. Buhmann,et al.  Weakly supervised semantic segmentation of Crohn's disease tissues from abdominal MRI , 2013, 2013 IEEE 10th International Symposium on Biomedical Imaging.

[16]  Florent Perronnin,et al.  Fisher Kernels on Visual Vocabularies for Image Categorization , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  William Dickey,et al.  Prevalence of celiac disease and its endoscopic markers among patients having routine upper gastrointestinal endoscopy , 1999, American Journal of Gastroenterology.

[18]  Matti Pietikäinen,et al.  Robust Texture Classification by Subsets of Local Binary Patterns , 2000, ICPR.